comirva.web.crawling
Class WebCrawling

java.lang.Object
  extended by java.lang.Thread
      extended by comirva.web.crawling.WebCrawling
All Implemented Interfaces:
Runnable

public class WebCrawling
extends Thread

This class implements a simple web crawler to fetch top-ranked Web pages according to a search engine. It accesses a search engine to get a list of web pages (using class AnySearch) and crawls them (using wget). The results are stored in a local path. This class uses AnySearch for querying the search engine!


Nested Class Summary
 
Nested classes/interfaces inherited from class java.lang.Thread
Thread.State, Thread.UncaughtExceptionHandler
 
Field Summary
 
Fields inherited from class java.lang.Thread
MAX_PRIORITY, MIN_PRIORITY, NORM_PRIORITY
 
Constructor Summary
WebCrawling(WebCrawlingConfig wcCfg, Vector searchWords, JLabel statusBar)
          Creates a new WebCrawling-instance.
 
Method Summary
 void run()
          Initializes and starts the web crawl.
 
Methods inherited from class java.lang.Thread
activeCount, checkAccess, countStackFrames, currentThread, destroy, dumpStack, enumerate, getAllStackTraces, getContextClassLoader, getDefaultUncaughtExceptionHandler, getId, getName, getPriority, getStackTrace, getState, getThreadGroup, getUncaughtExceptionHandler, holdsLock, interrupt, interrupted, isAlive, isDaemon, isInterrupted, join, join, join, resume, setContextClassLoader, setDaemon, setDefaultUncaughtExceptionHandler, setName, setPriority, setUncaughtExceptionHandler, sleep, sleep, start, stop, stop, suspend, toString, yield
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

WebCrawling

public WebCrawling(WebCrawlingConfig wcCfg,
                   Vector searchWords,
                   JLabel statusBar)
Creates a new WebCrawling-instance.

Parameters:
wcCfg - a WebCrawlingConfig-instance containing the settings for the web crawl
searchWords - a Vector containing all search queries
statusBar - the JLabel of CoMIRVA's status bar to inform the user
Method Detail

run

public void run()
Initializes and starts the web crawl.

Specified by:
run in interface Runnable
Overrides:
run in class Thread
See Also:
Thread.run()