comirva.web.crawling.agmis
Class GoldenRetriever_ProcessedIndexCorrector
java.lang.Object
comirva.web.crawling.agmis.GoldenRetriever_ProcessedIndexCorrector
public class GoldenRetriever_ProcessedIndexCorrector
- extends Object
This class analyzes the crawling.txt and writes the file processed_idx.txt,
containing all the indices (wrt the crawling.txt) of URLs that really has been
retrieved, by analyzing if the files reside on the HDD.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
PROCESSED_IDX_FILE
public static File PROCESSED_IDX_FILE
GoldenRetriever_ProcessedIndexCorrector
public GoldenRetriever_ProcessedIndexCorrector()
main
public static void main(String[] args)