"Censys is a search engine that allows computer scientists to ask questions about the devices and networks that compose the Internet. Driven by Internet-wide scanning, Censys lets researchers find specific hosts and create aggregate reports on how devices, websites, and certificates are configured and deployed. [more information]"
I've put together a list of 150+ general web search engines; that is to say, search engines that search the web. It is not a complete listing, nor is it a 'best of'; it's just a collection of engines that I know about and that I've blogged about as well. There's a few specialist ones too! If you know of a general search engine that I've not included, please email me!
"BabelNet is both a multilingual encyclopedic dictionary, with lexicographic and encyclopedic coverage of terms, and a semantic network which connects concepts and named entities in a very large network of semantic relations, made up of about 15 million entries, called Babel synsets. Each Babel synset represents a given meaning and contains all the synonyms which express that meaning in a range of different languages. Its evolution, BabelNet live, is a new, continuously growing resource, .."
"Introduction to the Knowledge Graph API
The Google Knowledge Graph API reveals entity information related to a keyword, that Google knows about. This information can be very useful for SEO - discovering related topics and what Google believes is relevant. It can also help when trying to claim/win a Knowledge Graph box on search results. The API requires a high level of technical understanding, so this tool creates a simple public interface, with the ability to export data into spreadsheets."
"Anna's Archive is a project that aims to catalog all the books in existence, by aggregating data from various sources. We also track humanity's progress toward making all these books easily available in digital form, through "shadow libraries". Learn more about us."
"Stract is an open source search engine where the user has the ability to see exactly what is going on and customize almost everything about their search results. It's a search engine made for hackers and tinkerers just like ourselves. No more searches where some of the terms in the query arent used, and the engine tries to guess what you really meant. You get what you search for."
Spokeo is a search engine specialized in organizing people-related information from phone books, social networks, marketing lists, business sites, and other public sources. Most of this data is publicly available on the Web. For example, you can find people's name, phone, and address on Whitepages.com, and you can get home values from Zillow.com. That said, only Spokeo's algorithm can piece together the scattered data into coherent people profiles, giving you the most comprehensive intelligence about anyone you want to find.
Your friends make the news more often than you think. But right now, you usually miss it.
With Newsle, you'll find out when a friend wins an award...or gets arrested.
Our mission is to create a working search engine for indexing, searching and cataloguing content present on the Tor network. Furthermore, we are creating an environment to share meaningful statistics, insights and news about the Tor network itself. Striving to support the Tor project, we are running exit nodes and tor2web nodes (tor2web.fi). We believe that the Tor network is an important, anonymous, resilient, censorship-resistant and distributed platform that can provide easy-to-implement anonymity to websites and other web services. Servers configured to receive inbound connections through Tor are called hidden services: rather than revealing a server's real IP address, an hi
The University Library has launched a beta version (which means it may be changing!) of a new product to search articles -- and more! ArticlesPlus is perhaps best described as "Google for the library's online content -- without the ads!". From a simple single search box, ArticlesPlus searches full-text content as well as metadata from a wide variety of sources and returns a list of relevancy-ranked results.