|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
| Uses of Pluggable in org.apache.nutch.analysis.lang |
|---|
| Classes in org.apache.nutch.analysis.lang that implement Pluggable | |
|---|---|
class |
HTMLLanguageParser
Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats |
class |
LanguageIndexingFilter
An IndexingFilter that adds a
lang (language) field to the document. |
| Uses of Pluggable in org.apache.nutch.collection |
|---|
| Classes in org.apache.nutch.collection that implement Pluggable | |
|---|---|
class |
Subcollection
SubCollection represents a subset of index, you can define url patterns that will indicate that particular page (url) is part of SubCollection. |
| Uses of Pluggable in org.apache.nutch.indexer |
|---|
| Subinterfaces of Pluggable in org.apache.nutch.indexer | |
|---|---|
interface |
IndexingFilter
Extension point for indexing. |
| Uses of Pluggable in org.apache.nutch.indexer.anchor |
|---|
| Classes in org.apache.nutch.indexer.anchor that implement Pluggable | |
|---|---|
class |
AnchorIndexingFilter
Indexing filter that indexes all inbound anchor text for a document. |
| Uses of Pluggable in org.apache.nutch.indexer.basic |
|---|
| Classes in org.apache.nutch.indexer.basic that implement Pluggable | |
|---|---|
class |
BasicIndexingFilter
Adds basic searchable fields to a document. |
| Uses of Pluggable in org.apache.nutch.indexer.feed |
|---|
| Classes in org.apache.nutch.indexer.feed that implement Pluggable | |
|---|---|
class |
FeedIndexingFilter
|
| Uses of Pluggable in org.apache.nutch.indexer.more |
|---|
| Classes in org.apache.nutch.indexer.more that implement Pluggable | |
|---|---|
class |
MoreIndexingFilter
Add (or reset) a few metaData properties as respective fields (if they are available), so that they can be displayed by more.jsp (called by search.jsp). |
| Uses of Pluggable in org.apache.nutch.indexer.subcollection |
|---|
| Classes in org.apache.nutch.indexer.subcollection that implement Pluggable | |
|---|---|
class |
SubcollectionIndexingFilter
|
| Uses of Pluggable in org.apache.nutch.indexer.tld |
|---|
| Classes in org.apache.nutch.indexer.tld that implement Pluggable | |
|---|---|
class |
TLDIndexingFilter
Adds the Top level domain extensions to the index |
| Uses of Pluggable in org.apache.nutch.microformats.reltag |
|---|
| Classes in org.apache.nutch.microformats.reltag that implement Pluggable | |
|---|---|
class |
RelTagIndexingFilter
An IndexingFilter that add tag
field(s) to the document. |
class |
RelTagParser
Adds microformat rel-tags of document if found. |
| Uses of Pluggable in org.apache.nutch.net |
|---|
| Subinterfaces of Pluggable in org.apache.nutch.net | |
|---|---|
interface |
URLFilter
Interface used to limit which URLs enter Nutch. |
| Uses of Pluggable in org.apache.nutch.parse |
|---|
| Subinterfaces of Pluggable in org.apache.nutch.parse | |
|---|---|
interface |
ParseFilter
Extension point for DOM-based parsers. |
interface |
Parser
A parser for content generated by a Protocol
implementation. |
| Uses of Pluggable in org.apache.nutch.parse.ext |
|---|
| Classes in org.apache.nutch.parse.ext that implement Pluggable | |
|---|---|
class |
ExtParser
A wrapper that invokes external command to do real parsing job. |
| Uses of Pluggable in org.apache.nutch.parse.feed |
|---|
| Classes in org.apache.nutch.parse.feed that implement Pluggable | |
|---|---|
class |
FeedParser
|
| Uses of Pluggable in org.apache.nutch.parse.html |
|---|
| Classes in org.apache.nutch.parse.html that implement Pluggable | |
|---|---|
class |
HtmlParser
|
| Uses of Pluggable in org.apache.nutch.parse.js |
|---|
| Classes in org.apache.nutch.parse.js that implement Pluggable | |
|---|---|
class |
JSParseFilter
This class is a heuristic link extractor for JavaScript files and code snippets. |
| Uses of Pluggable in org.apache.nutch.parse.swf |
|---|
| Classes in org.apache.nutch.parse.swf that implement Pluggable | |
|---|---|
class |
SWFParser
Parser for Flash SWF files. |
| Uses of Pluggable in org.apache.nutch.parse.tika |
|---|
| Classes in org.apache.nutch.parse.tika that implement Pluggable | |
|---|---|
class |
TikaParser
Wrapper for Tika parsers. |
| Uses of Pluggable in org.apache.nutch.parse.zip |
|---|
| Classes in org.apache.nutch.parse.zip that implement Pluggable | |
|---|---|
class |
ZipParser
ZipParser class based on MSPowerPointParser class by Stephan Strittmatter. |
| Uses of Pluggable in org.apache.nutch.plugin |
|---|
| Subinterfaces of Pluggable in org.apache.nutch.plugin | |
|---|---|
interface |
FieldPluggable
|
| Uses of Pluggable in org.apache.nutch.protocol |
|---|
| Subinterfaces of Pluggable in org.apache.nutch.protocol | |
|---|---|
interface |
Protocol
A retriever of url content. |
| Uses of Pluggable in org.apache.nutch.protocol.file |
|---|
| Classes in org.apache.nutch.protocol.file that implement Pluggable | |
|---|---|
class |
File
File.java deals with file: scheme. |
| Uses of Pluggable in org.apache.nutch.protocol.ftp |
|---|
| Classes in org.apache.nutch.protocol.ftp that implement Pluggable | |
|---|---|
class |
Ftp
Ftp.java deals with ftp: scheme. |
| Uses of Pluggable in org.apache.nutch.protocol.http |
|---|
| Classes in org.apache.nutch.protocol.http that implement Pluggable | |
|---|---|
class |
Http
|
| Uses of Pluggable in org.apache.nutch.protocol.http.api |
|---|
| Classes in org.apache.nutch.protocol.http.api that implement Pluggable | |
|---|---|
class |
HttpBase
|
| Uses of Pluggable in org.apache.nutch.protocol.sftp |
|---|
| Classes in org.apache.nutch.protocol.sftp that implement Pluggable | |
|---|---|
class |
Sftp
This class uses the Jsch package to fetch content using the Sftp protocol. |
| Uses of Pluggable in org.apache.nutch.scoring |
|---|
| Subinterfaces of Pluggable in org.apache.nutch.scoring | |
|---|---|
interface |
ScoringFilter
A contract defining behavior of scoring plugins. |
| Classes in org.apache.nutch.scoring that implement Pluggable | |
|---|---|
class |
ScoringFilters
Creates and caches ScoringFilter implementing plugins. |
| Uses of Pluggable in org.apache.nutch.scoring.link |
|---|
| Classes in org.apache.nutch.scoring.link that implement Pluggable | |
|---|---|
class |
LinkAnalysisScoringFilter
|
| Uses of Pluggable in org.apache.nutch.scoring.opic |
|---|
| Classes in org.apache.nutch.scoring.opic that implement Pluggable | |
|---|---|
class |
OPICScoringFilter
This plugin implements a variant of an Online Page Importance Computation (OPIC) score, described in this paper: Abiteboul, Serge and Preda, Mihai and Cobena, Gregory (2003), Adaptive On-Line Page Importance Computation . |
| Uses of Pluggable in org.apache.nutch.scoring.tld |
|---|
| Classes in org.apache.nutch.scoring.tld that implement Pluggable | |
|---|---|
class |
TLDScoringFilter
Scoring filter to boost tlds. |
| Uses of Pluggable in org.apache.nutch.urlfilter.api |
|---|
| Classes in org.apache.nutch.urlfilter.api that implement Pluggable | |
|---|---|
class |
RegexURLFilterBase
Generic URL filter based on
regular expressions. |
| Uses of Pluggable in org.apache.nutch.urlfilter.automaton |
|---|
| Classes in org.apache.nutch.urlfilter.automaton that implement Pluggable | |
|---|---|
class |
AutomatonURLFilter
RegexURLFilterBase implementation based on the dk.brics.automaton Finite-State Automata for JavaTM. |
| Uses of Pluggable in org.apache.nutch.urlfilter.domain |
|---|
| Classes in org.apache.nutch.urlfilter.domain that implement Pluggable | |
|---|---|
class |
DomainURLFilter
Filters URLs based on a file containing domain suffixes, domain names, and hostnames. |
| Uses of Pluggable in org.apache.nutch.urlfilter.prefix |
|---|
| Classes in org.apache.nutch.urlfilter.prefix that implement Pluggable | |
|---|---|
class |
PrefixURLFilter
Filters URLs based on a file of URL prefixes. |
| Uses of Pluggable in org.apache.nutch.urlfilter.regex |
|---|
| Classes in org.apache.nutch.urlfilter.regex that implement Pluggable | |
|---|---|
class |
RegexURLFilter
Filters URLs based on a file of regular expressions using the Java Regex implementation. |
| Uses of Pluggable in org.apache.nutch.urlfilter.suffix |
|---|
| Classes in org.apache.nutch.urlfilter.suffix that implement Pluggable | |
|---|---|
class |
SuffixURLFilter
Filters URLs based on a file of URL suffixes. |
| Uses of Pluggable in org.apache.nutch.urlfilter.validator |
|---|
| Classes in org.apache.nutch.urlfilter.validator that implement Pluggable | |
|---|---|
class |
UrlValidator
Validates URLs. |
| Uses of Pluggable in org.creativecommons.nutch |
|---|
| Classes in org.creativecommons.nutch that implement Pluggable | |
|---|---|
class |
CCIndexingFilter
Adds basic searchable fields to a document. |
class |
CCParseFilter
Adds metadata identifying the Creative Commons license used, if any. |
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||