LAWA | Longitudinal Analytics of Web Archive Data - 0 views
-
Janos Haits on 30 Nov 12LAWA will federate distributed FIRE facilities with the rich Web repository of the European Archive, to create a Virtual Web Observatory and use Web data analytics as a use case study to validate our design. The outcome of our work will enable Internet-scale analysis of data, and bring the content aspect of the Internet on the roadmap of Future Internet Research. In four work packages we will extend the open-source Hadoop software by novel methods for wide-area data access, distributed storage and indexing, scalable data aggregation and data analysis along the time dimension, and automatic classification of Web contents.