Probing the Deep, Invisible Web

Please note, this is a STATIC archive of website moz.com from 05 Jul 2018, cach3.com does not collect or store any user information, there is no "phishing" involved.

By: Rand Fishkin June 8th, 2005

Probing the Deep, Invisible Web

Mobile

The author's views are entirely his or her own (excluding the unlikely event of hypnosis) and may not always reflect the views of Moz.

As most of us who are in search know, the web's repository of knowledge is vastly larger than what is accessible to the major commercial search engines. Hundreds of thousands of databases cannot be probed and their documents remain invisible to all but th specialized search tools used to dig them out.

Fortunately, several projects, including one pointed to by Xan Porter (called Qprober - PDF warning) use automatic text extraction followed by intelligent, trained querying of a system to extract as many of the available documents as possible. In the future, this type of intelligent access will provide a gateway for access to much of what is currently 'invisible'.

Today, for example, I used another resource Xan pointed out, called Complete Planet to find an art project that was hidden in the database of the Library of Fine Arts in Dresden, Germany. The search revealed an installation artist's work - GÃ¼nther Hornig - who has a great eye for the whimsical.

Comments 0

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

Post Analytics

Comments 0

Log in to Moz

Don't have an account?