Package | Description |
---|---|
org.archive.crawler |
Introduction to Heritrix.
|
org.archive.crawler.deciderules |
Provides classes for a simple decision rules framework.
|
org.archive.crawler.event | |
org.archive.crawler.framework | |
org.archive.crawler.frontier | |
org.archive.crawler.monitor |
This package consists of modules that monitor an ongoing crawl by various means,
typically interceding if certain limits/thresholds/conditions are met.
|
org.archive.crawler.postprocessor | |
org.archive.crawler.prefetch | |
org.archive.crawler.processor | |
org.archive.crawler.reporting | |
org.archive.crawler.restlet | |
org.archive.crawler.restlet.models |
Class and Description |
---|
Engine
Implementation for Engine.
|
Class and Description |
---|
CrawlController
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
Class and Description |
---|
CrawlController.State |
Class and Description |
---|
CheckpointService
Executes checkpoints, and offers convenience methods for enumerating
available Checkpoints and injecting a recovery-Checkpoint after
build and before launch (setRecoveryCheckpointByName).
|
CrawlController
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
CrawlController.State |
CrawlJob
CrawlJob represents a crawl configuration, including its
configuration files, instantiated/running ApplicationContext, and
disk output, potentially across multiple runs.
|
CrawlStatus |
Frontier
An interface for URI Frontiers.
|
Frontier.FrontierGroup
Generic interface representing the internal groupings
of a Frontier's URIs -- usually queues.
|
Frontier.State
Enumeration of possible target states.
|
ToePool
A collection of ToeThreads.
|
ToeThread.Step |
Class and Description |
---|
CrawlController
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
Frontier
An interface for URI Frontiers.
|
Frontier.FrontierGroup
Generic interface representing the internal groupings
of a Frontier's URIs -- usually queues.
|
Frontier.State
Enumeration of possible target states.
|
Class and Description |
---|
CrawlController
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
Class and Description |
---|
CrawlController
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
Frontier
An interface for URI Frontiers.
|
Scoper
Base class for Scopers.
|
Class and Description |
---|
CrawlController
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
Frontier
An interface for URI Frontiers.
|
Scoper
Base class for Scopers.
|
Class and Description |
---|
Frontier
An interface for URI Frontiers.
|
Class and Description |
---|
CrawlController
CrawlController collects all the classes which cooperate to
perform a crawl and provides a high-level interface to the
running crawl.
|
Class and Description |
---|
CrawlJob
CrawlJob represents a crawl configuration, including its
configuration files, instantiated/running ApplicationContext, and
disk output, potentially across multiple runs.
|
Engine
Implementation for Engine.
|
Class and Description |
---|
CrawlJob
CrawlJob represents a crawl configuration, including its
configuration files, instantiated/running ApplicationContext, and
disk output, potentially across multiple runs.
|
Engine
Implementation for Engine.
|
Copyright © 2003-2014 Internet Archive. All Rights Reserved.