Archive-It is a subscription web archiving service from the Internet Archive that helps organizations to harvest, build, and preserve collections of digital content. Through our user friendly web application Archive-It partners can collect, catalog, and manage their collections of archived content with 24/7 access and full text search available for their use as well as their patrons. Content is hosted and stored at the Internet Archive data centers.
"Currently making 1.67TB of research data available.
Sharing data is hard. Emails have size limits, and setting up servers is too much work. We've designed a distributed system for sharing enormous datasets - for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds."
"Find used books, out of print books, textbooks, rare books and new books for sale. Search hundreds of millions of books from over 100,000 booksellers and 60+ websites worldwide"
NEIL (Never Ending Image Learner) is a computer program that runs 24 hours per day and 7 days per week to automatically extract visual knowledge from Internet data. It is an effort to build the world's largest visual knowledge base with minimum human labeling effort - one that would be useful to many computer vision and AI efforts. See current statistics about how much NEIL knows about our world!!
"Sciencenet - Towards a global search and share engine for all scientific knowledge
Modern biological experiments create vast amounts of data which are geographically distributed. These datasets consist of petabytes of raw data and billions of documents. Yet to the best of our knowledge, a search engine technology that searches and crosslinks all different data types in life sciences does not exist."