Skip to main content

Home/ attuneuniversity/ Group items tagged nutch

Rss Feed Group items tagged

Online Training  at Attune University

Web Crawling and Data Mining with Apache Nutch - Book is Out - 0 views

  •  
    Team Attune and Packt Publishing has joined hands and published book on "Web Crawling and Data Mining with Apache Nutch" few days back. Attune Infocom is an open source consulting and development company with a strength of open source specialist team, While Packt Publishing is well known name in open source technologies publication house.
Online Training  at Attune University

How To Build And Deploy Plugin With Apache Nutch - 0 views

  •  
    I simply would like to add a new field to the index. This new field should indicate the length of the parsed content of the respective web page and therefore be called pageLength. As a first step, you need to create all the necessary new files. Lets say, we call the plugin "TestPlugin".
Online Training  at Attune University

Crawling in Apache Nutch using Eclispe - 0 views

  •  
    Prerequisites You need to have Apache Ant installed and configured on your system. Grab the newest version of Eclipse available http://www.eclipse.org/downloads/. All of the following should be available from the Eclipse MarketPlace. However if not, you can download them throughout Eclipse as follows. Once you've set up Eclipse, download Subclipse from per http://subclipse.tigris.org/.
1 - 3 of 3
Showing 20 items per page