New Search Technologies Mine the Web More Deeply - NYTimes.com - 1 views
-
Beyond the realm of consumer searches, Deep Web technologies may eventually let businesses use data in new ways. For example, a health site could cross-reference data from pharmaceutical companies with the latest findings from medical researchers, or a local news site could extend its coverage by letting users tap into public records stored in government databases.
-
The article talks about the new kinds of technology that major search engines such as google are using to reduce the 99% of data that is hidden and largely unsearchable by search engines. This 99% of information is indeed public but the challenge that search engines continue to face is finding a way to access these information. With the use of these new technology they will be able to explore beyond their current reach which will likely return a better quality of information that is delivered to online users. The new technology includes a software developed by Kosmix that would match data bases that contain meaningful information relative to queries been made, hence delivering a summary regarding important topics from numerous internet sources. Google on the other hand uses a web search strategy where it utilizes certain programs to determine the contents of all the websites that it comes in contact with. Deep peep is yet another technology that will send out spiders to crawl the web and index all data base on the web. This seem quite difficult to accomplish as the accumulation of information on the web is so profound that its needs more than just a crawler to penetrate deep beyond the tiny surface of the web that is presently been searched. Indexing every website will be a challenge as many website owners ensure that their websites are built to highly reduce, or block searches by search engines. Website integration technology has also been explore. Websites cross reference each other; an action that is quite similar to semantic web. However in my view if sematic web is unrealized; quite unknown and many online users are not familiar with its potential to interconnect data how is it that another program built on this platform or similar will be able to perform the task that semantic web never did? While the article was written a few years back it goes to show that the endeavors to penetrate unsearchable data bases have long been approached, and presently ther