Interface | Description |
---|---|
Frontier |
An interface for URI Frontiers.
|
Frontier.FrontierGroup |
Generic interface representing the internal groupings
of a Frontier's URIs -- usually queues.
|
Class | Description |
---|---|
ActionDirectory |
Directory watched for new files.
|
BeanLookupBindings |
Provides syntactic sugar for H3 scripts to reference beans without adding a
line like
def scope = appCtx.getBean("scope"); . |
CheckpointService |
Executes checkpoints, and offers convenience methods for enumerating
available Checkpoints and injecting a recovery-Checkpoint after
build and before launch (setRecoveryCheckpointByName).
|
CheckpointSuccessEvent |
Report success of a Checkpoint (so that it may be reported by the
CrawlJOb to the job log).
|
CheckpointValidator | |
CrawlController |
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
CrawlController.StopCompleteEvent | |
CrawlJob |
CrawlJob represents a crawl configuration, including its
configuration files, instantiated/running ApplicationContext, and
disk output, potentially across multiple runs.
|
CrawlLimitEnforcer |
Bean to enforce limits on the size of a crawl in URI count,
byte count, or elapsed time.
|
Engine |
Implementation for Engine.
|
Scoper |
Base class for Scopers.
|
ToePool |
A collection of ToeThreads.
|
ToeThread |
One "worker thread"; asks for CrawlURIs, processes them,
repeats unless told otherwise.
|
Enum | Description |
---|---|
CrawlController.State | |
CrawlStatus | |
Frontier.State |
Enumeration of possible target states.
|
ToeThread.Step |
Copyright © 2003-2014 Internet Archive. All Rights Reserved.