comirva.web.crawling.agmis
Class SearchResultsAnalyzer
java.lang.Object
comirva.web.crawling.agmis.SearchResultsAnalyzer
public class SearchResultsAnalyzer
- extends Object
Analyzes the info.xml files stored for a crawl and prints out some statistical measures.
This class can further be used to generate subsets of the crawled artists.
Method Summary |
void |
analyzeSearchResults()
Walks through all directories under ROOT_DIR searching for crawl info filex (info.xml) and
analyzes them. |
static void |
main(String[] args)
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
ROOT_DIR
public static File ROOT_DIR
PERFORM_TOP_X_SELECTION
public static final boolean PERFORM_TOP_X_SELECTION
- See Also:
- Constant Field Values
SELECT_TOP_X_PERCENT_PER_GENRE
public static final float SELECT_TOP_X_PERCENT_PER_GENRE
- See Also:
- Constant Field Values
SUBSET_FILE
public static File SUBSET_FILE
OUTPUT_PC_DISTRIBUTIONS_FOR_EACH_GENRE
public static final boolean OUTPUT_PC_DISTRIBUTIONS_FOR_EACH_GENRE
- See Also:
- Constant Field Values
SearchResultsAnalyzer
public SearchResultsAnalyzer()
analyzeSearchResults
public void analyzeSearchResults()
- Walks through all directories under ROOT_DIR searching for crawl info filex (info.xml) and
analyzes them. If desired by user (set PERFORM_TOP_X_SELECTION, a subset of the top-X%-artists is created and written to SUBSET_FILE.
main
public static void main(String[] args)