comirva.web.crawling
Class WebCrawling
java.lang.Object
java.lang.Thread
comirva.web.crawling.WebCrawling
- All Implemented Interfaces:
- Runnable
public class WebCrawling
- extends Thread
This class implements a simple web crawler to fetch top-ranked
Web pages according to a search engine.
It accesses a search engine to get a list of web pages
(using class AnySearch) and crawls them (using wget).
The results are stored in a local path.
This class uses AnySearch for querying the search engine!
Method Summary |
void |
run()
Initializes and starts the web crawl. |
Methods inherited from class java.lang.Thread |
activeCount, checkAccess, countStackFrames, currentThread, destroy, dumpStack, enumerate, getAllStackTraces, getContextClassLoader, getDefaultUncaughtExceptionHandler, getId, getName, getPriority, getStackTrace, getState, getThreadGroup, getUncaughtExceptionHandler, holdsLock, interrupt, interrupted, isAlive, isDaemon, isInterrupted, join, join, join, resume, setContextClassLoader, setDaemon, setDefaultUncaughtExceptionHandler, setName, setPriority, setUncaughtExceptionHandler, sleep, sleep, start, stop, stop, suspend, toString, yield |
WebCrawling
public WebCrawling(WebCrawlingConfig wcCfg,
Vector searchWords,
JLabel statusBar)
- Creates a new WebCrawling-instance.
- Parameters:
wcCfg
- a WebCrawlingConfig-instance containing the settings for the web crawlsearchWords
- a Vector containing all search queriesstatusBar
- the JLabel of CoMIRVA's status bar to inform the user
run
public void run()
- Initializes and starts the web crawl.
- Specified by:
run
in interface Runnable
- Overrides:
run
in class Thread
- See Also:
Thread.run()