comirva.web.crawling.agmis
Class SearchResultsAnalyzer

java.lang.Object
  extended by comirva.web.crawling.agmis.SearchResultsAnalyzer

public class SearchResultsAnalyzer
extends Object

Analyzes the info.xml files stored for a crawl and prints out some statistical measures. This class can further be used to generate subsets of the crawled artists.


Field Summary
static boolean OUTPUT_PC_DISTRIBUTIONS_FOR_EACH_GENRE
           
static boolean PERFORM_TOP_X_SELECTION
           
static File ROOT_DIR
           
static float SELECT_TOP_X_PERCENT_PER_GENRE
           
static File SUBSET_FILE
           
 
Constructor Summary
SearchResultsAnalyzer()
           
 
Method Summary
 void analyzeSearchResults()
          Walks through all directories under ROOT_DIR searching for crawl info filex (info.xml) and analyzes them.
static void main(String[] args)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

ROOT_DIR

public static File ROOT_DIR

PERFORM_TOP_X_SELECTION

public static final boolean PERFORM_TOP_X_SELECTION
See Also:
Constant Field Values

SELECT_TOP_X_PERCENT_PER_GENRE

public static final float SELECT_TOP_X_PERCENT_PER_GENRE
See Also:
Constant Field Values

SUBSET_FILE

public static File SUBSET_FILE

OUTPUT_PC_DISTRIBUTIONS_FOR_EACH_GENRE

public static final boolean OUTPUT_PC_DISTRIBUTIONS_FOR_EACH_GENRE
See Also:
Constant Field Values
Constructor Detail

SearchResultsAnalyzer

public SearchResultsAnalyzer()
Method Detail

analyzeSearchResults

public void analyzeSearchResults()
Walks through all directories under ROOT_DIR searching for crawl info filex (info.xml) and analyzes them. If desired by user (set PERFORM_TOP_X_SELECTION, a subset of the top-X%-artists is created and written to SUBSET_FILE.


main

public static void main(String[] args)