Package | Description |
---|---|
org.archive.modules |
The beginnings of a refactored settings framework.
|
org.archive.modules.extractor |
Modifier and Type | Method and Description |
---|---|
CrawlURI |
CrawlURI.makeConsequentCandidate(String destination,
LinkContext lc,
Hop hop)
Create a consequent CrawlURI from this one, given the
additional parameters
|
Modifier and Type | Method and Description |
---|---|
Hop |
Link.getHopType() |
static Hop |
Hop.valueOf(String name)
Returns the enum constant of this type with the specified name.
|
static Hop[] |
Hop.values()
Returns an array containing the constants of this enum type, in
the order they are declared.
|
Modifier and Type | Method and Description |
---|---|
static void |
Link.add(CrawlURI uri,
int max,
String newUri,
LinkContext context,
Hop hop) |
protected void |
ExtractorHTML.addLinkFromString(CrawlURI curi,
CharSequence uri,
CharSequence context,
Hop hop) |
protected void |
Extractor.addOutlink(CrawlURI curi,
String uri,
LinkContext context,
Hop hop)
Create and add a 'Link' to the CrawlURI with given URI/context/hop-type
|
static void |
Link.addRelativeToBase(CrawlURI uri,
int max,
String newUri,
LinkContext context,
Hop hop) |
static void |
Link.addRelativeToVia(CrawlURI uri,
int max,
String newUri,
LinkContext context,
Hop hop) |
protected void |
ExtractorHTML.considerIfLikelyUri(CrawlURI curi,
CharSequence candidate,
CharSequence valueContext,
Hop hop)
Consider whether a given string is URI-like.
|
protected void |
ExtractorHTML.considerQueryStringValues(CrawlURI curi,
CharSequence queryString,
CharSequence valueContext,
Hop hop)
Consider a query-string-like collections of key=value[&key=value]
pairs for URI-like strings in the values.
|
protected void |
ExtractorHTML.processEmbed(CrawlURI curi,
CharSequence value,
CharSequence context,
Hop hop) |
Constructor and Description |
---|
Link(CharSequence source,
CharSequence destination,
LinkContext context,
Hop hop)
Create a Link with the given fields.
|
Copyright © 2003-2014 Internet Archive. All Rights Reserved.