|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Uses of Pluggable in org.apache.nutch.analysis.lang |
---|
Classes in org.apache.nutch.analysis.lang that implement Pluggable | |
---|---|
class |
HTMLLanguageParser
Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats |
class |
LanguageIndexingFilter
An IndexingFilter that adds a
lang (language) field to the document. |
Uses of Pluggable in org.apache.nutch.collection |
---|
Classes in org.apache.nutch.collection that implement Pluggable | |
---|---|
class |
Subcollection
SubCollection represents a subset of index, you can define url patterns that will indicate that particular page (url) is part of SubCollection. |
Uses of Pluggable in org.apache.nutch.indexer |
---|
Subinterfaces of Pluggable in org.apache.nutch.indexer | |
---|---|
interface |
IndexingFilter
Extension point for indexing. |
Uses of Pluggable in org.apache.nutch.indexer.anchor |
---|
Classes in org.apache.nutch.indexer.anchor that implement Pluggable | |
---|---|
class |
AnchorIndexingFilter
Indexing filter that indexes all inbound anchor text for a document. |
Uses of Pluggable in org.apache.nutch.indexer.basic |
---|
Classes in org.apache.nutch.indexer.basic that implement Pluggable | |
---|---|
class |
BasicIndexingFilter
Adds basic searchable fields to a document. |
Uses of Pluggable in org.apache.nutch.indexer.feed |
---|
Classes in org.apache.nutch.indexer.feed that implement Pluggable | |
---|---|
class |
FeedIndexingFilter
|
Uses of Pluggable in org.apache.nutch.indexer.more |
---|
Classes in org.apache.nutch.indexer.more that implement Pluggable | |
---|---|
class |
MoreIndexingFilter
Add (or reset) a few metaData properties as respective fields (if they are available), so that they can be displayed by more.jsp (called by search.jsp). |
Uses of Pluggable in org.apache.nutch.indexer.subcollection |
---|
Classes in org.apache.nutch.indexer.subcollection that implement Pluggable | |
---|---|
class |
SubcollectionIndexingFilter
|
Uses of Pluggable in org.apache.nutch.indexer.tld |
---|
Classes in org.apache.nutch.indexer.tld that implement Pluggable | |
---|---|
class |
TLDIndexingFilter
Adds the Top level domain extensions to the index |
Uses of Pluggable in org.apache.nutch.microformats.reltag |
---|
Classes in org.apache.nutch.microformats.reltag that implement Pluggable | |
---|---|
class |
RelTagIndexingFilter
An IndexingFilter that add tag
field(s) to the document. |
class |
RelTagParser
Adds microformat rel-tags of document if found. |
Uses of Pluggable in org.apache.nutch.net |
---|
Subinterfaces of Pluggable in org.apache.nutch.net | |
---|---|
interface |
URLFilter
Interface used to limit which URLs enter Nutch. |
Uses of Pluggable in org.apache.nutch.parse |
---|
Subinterfaces of Pluggable in org.apache.nutch.parse | |
---|---|
interface |
ParseFilter
Extension point for DOM-based parsers. |
interface |
Parser
A parser for content generated by a Protocol
implementation. |
Uses of Pluggable in org.apache.nutch.parse.ext |
---|
Classes in org.apache.nutch.parse.ext that implement Pluggable | |
---|---|
class |
ExtParser
A wrapper that invokes external command to do real parsing job. |
Uses of Pluggable in org.apache.nutch.parse.feed |
---|
Classes in org.apache.nutch.parse.feed that implement Pluggable | |
---|---|
class |
FeedParser
|
Uses of Pluggable in org.apache.nutch.parse.html |
---|
Classes in org.apache.nutch.parse.html that implement Pluggable | |
---|---|
class |
HtmlParser
|
Uses of Pluggable in org.apache.nutch.parse.js |
---|
Classes in org.apache.nutch.parse.js that implement Pluggable | |
---|---|
class |
JSParseFilter
This class is a heuristic link extractor for JavaScript files and code snippets. |
Uses of Pluggable in org.apache.nutch.parse.swf |
---|
Classes in org.apache.nutch.parse.swf that implement Pluggable | |
---|---|
class |
SWFParser
Parser for Flash SWF files. |
Uses of Pluggable in org.apache.nutch.parse.tika |
---|
Classes in org.apache.nutch.parse.tika that implement Pluggable | |
---|---|
class |
TikaParser
Wrapper for Tika parsers. |
Uses of Pluggable in org.apache.nutch.parse.zip |
---|
Classes in org.apache.nutch.parse.zip that implement Pluggable | |
---|---|
class |
ZipParser
ZipParser class based on MSPowerPointParser class by Stephan Strittmatter. |
Uses of Pluggable in org.apache.nutch.plugin |
---|
Subinterfaces of Pluggable in org.apache.nutch.plugin | |
---|---|
interface |
FieldPluggable
|
Uses of Pluggable in org.apache.nutch.protocol |
---|
Subinterfaces of Pluggable in org.apache.nutch.protocol | |
---|---|
interface |
Protocol
A retriever of url content. |
Uses of Pluggable in org.apache.nutch.protocol.file |
---|
Classes in org.apache.nutch.protocol.file that implement Pluggable | |
---|---|
class |
File
File.java deals with file: scheme. |
Uses of Pluggable in org.apache.nutch.protocol.ftp |
---|
Classes in org.apache.nutch.protocol.ftp that implement Pluggable | |
---|---|
class |
Ftp
Ftp.java deals with ftp: scheme. |
Uses of Pluggable in org.apache.nutch.protocol.http |
---|
Classes in org.apache.nutch.protocol.http that implement Pluggable | |
---|---|
class |
Http
|
Uses of Pluggable in org.apache.nutch.protocol.http.api |
---|
Classes in org.apache.nutch.protocol.http.api that implement Pluggable | |
---|---|
class |
HttpBase
|
Uses of Pluggable in org.apache.nutch.protocol.sftp |
---|
Classes in org.apache.nutch.protocol.sftp that implement Pluggable | |
---|---|
class |
Sftp
This class uses the Jsch package to fetch content using the Sftp protocol. |
Uses of Pluggable in org.apache.nutch.scoring |
---|
Subinterfaces of Pluggable in org.apache.nutch.scoring | |
---|---|
interface |
ScoringFilter
A contract defining behavior of scoring plugins. |
Classes in org.apache.nutch.scoring that implement Pluggable | |
---|---|
class |
ScoringFilters
Creates and caches ScoringFilter implementing plugins. |
Uses of Pluggable in org.apache.nutch.scoring.link |
---|
Classes in org.apache.nutch.scoring.link that implement Pluggable | |
---|---|
class |
LinkAnalysisScoringFilter
|
Uses of Pluggable in org.apache.nutch.scoring.opic |
---|
Classes in org.apache.nutch.scoring.opic that implement Pluggable | |
---|---|
class |
OPICScoringFilter
This plugin implements a variant of an Online Page Importance Computation (OPIC) score, described in this paper: Abiteboul, Serge and Preda, Mihai and Cobena, Gregory (2003), Adaptive On-Line Page Importance Computation . |
Uses of Pluggable in org.apache.nutch.scoring.tld |
---|
Classes in org.apache.nutch.scoring.tld that implement Pluggable | |
---|---|
class |
TLDScoringFilter
Scoring filter to boost tlds. |
Uses of Pluggable in org.apache.nutch.urlfilter.api |
---|
Classes in org.apache.nutch.urlfilter.api that implement Pluggable | |
---|---|
class |
RegexURLFilterBase
Generic URL filter based on
regular expressions. |
Uses of Pluggable in org.apache.nutch.urlfilter.automaton |
---|
Classes in org.apache.nutch.urlfilter.automaton that implement Pluggable | |
---|---|
class |
AutomatonURLFilter
RegexURLFilterBase implementation based on the dk.brics.automaton Finite-State Automata for JavaTM. |
Uses of Pluggable in org.apache.nutch.urlfilter.domain |
---|
Classes in org.apache.nutch.urlfilter.domain that implement Pluggable | |
---|---|
class |
DomainURLFilter
Filters URLs based on a file containing domain suffixes, domain names, and hostnames. |
Uses of Pluggable in org.apache.nutch.urlfilter.prefix |
---|
Classes in org.apache.nutch.urlfilter.prefix that implement Pluggable | |
---|---|
class |
PrefixURLFilter
Filters URLs based on a file of URL prefixes. |
Uses of Pluggable in org.apache.nutch.urlfilter.regex |
---|
Classes in org.apache.nutch.urlfilter.regex that implement Pluggable | |
---|---|
class |
RegexURLFilter
Filters URLs based on a file of regular expressions using the Java Regex implementation . |
Uses of Pluggable in org.apache.nutch.urlfilter.suffix |
---|
Classes in org.apache.nutch.urlfilter.suffix that implement Pluggable | |
---|---|
class |
SuffixURLFilter
Filters URLs based on a file of URL suffixes. |
Uses of Pluggable in org.apache.nutch.urlfilter.validator |
---|
Classes in org.apache.nutch.urlfilter.validator that implement Pluggable | |
---|---|
class |
UrlValidator
Validates URLs. |
Uses of Pluggable in org.creativecommons.nutch |
---|
Classes in org.creativecommons.nutch that implement Pluggable | |
---|---|
class |
CCIndexingFilter
Adds basic searchable fields to a document. |
class |
CCParseFilter
Adds metadata identifying the Creative Commons license used, if any. |
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |