| Interface | Description |
|---|---|
| Crawler | |
| CrawlerAccess |
A call back interface for the
WebCrawler. |
| Class | Description |
|---|---|
| AbstractCrawler | |
| HTMLParser | |
| LuceneCrawler | |
| ReplaceFileInputStream | |
| URLMask |
URLMask is used by WebCrawler to decide whether or not to crawl an url.
|
| WebCrawler |
A generic class for crawling web pages (or possibly other objects/files too).
|
Copyright 2004-2015 Wandora Team