Uses of Interface
org.apache.nutch.plugin.Pluggable

Packages that use Pluggable
org.apache.nutch.analysis.lang Text document language identifier. 
org.apache.nutch.collection Subcollection is a subset of an index. 
org.apache.nutch.indexer Maintain Lucene full-text indexes. 
org.apache.nutch.indexer.anchor An indexing plugin for inbound anchor text. 
org.apache.nutch.indexer.basic A basic indexing plugin. 
org.apache.nutch.indexer.feed   
org.apache.nutch.indexer.more A more indexing plugin. 
org.apache.nutch.indexer.subcollection   
org.apache.nutch.indexer.tld Top Level Domain Indexing plugin. 
org.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin. 
org.apache.nutch.net   
org.apache.nutch.parse   
org.apache.nutch.parse.ext   
org.apache.nutch.parse.feed   
org.apache.nutch.parse.html An HTML document parsing plugin. 
org.apache.nutch.parse.js   
org.apache.nutch.parse.swf   
org.apache.nutch.parse.tika   
org.apache.nutch.parse.zip   
org.apache.nutch.plugin The Nutch Plugin System. 
org.apache.nutch.protocol   
org.apache.nutch.protocol.file Protocol plugin which supports retrieving local file resources. 
org.apache.nutch.protocol.ftp Protocol plugin which supports retrieving documents via the ftp protocol. 
org.apache.nutch.protocol.http Protocol plugin which supports retrieving documents via the http protocol. 
org.apache.nutch.protocol.http.api Common API used by HTTP plugins (http, httpclient
org.apache.nutch.protocol.sftp Protocol plugin which supports retrieving documents via the sftp protocol. 
org.apache.nutch.scoring   
org.apache.nutch.scoring.link   
org.apache.nutch.scoring.opic   
org.apache.nutch.scoring.tld Top Level Domain Scoring plugin. 
org.apache.nutch.urlfilter.api   
org.apache.nutch.urlfilter.automaton A url filter plugin based on dk.brics.automaton Finite-State Automata for JavaTM
org.apache.nutch.urlfilter.domain A url filter plugin that filters by domain. 
org.apache.nutch.urlfilter.prefix A url filter plugin. 
org.apache.nutch.urlfilter.regex A url filter plugin. 
org.apache.nutch.urlfilter.suffix   
org.apache.nutch.urlfilter.validator A url filter plugin that validates given urls. 
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 
 

Uses of Pluggable in org.apache.nutch.analysis.lang
 

Classes in org.apache.nutch.analysis.lang that implement Pluggable
 class HTMLLanguageParser
          Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats
 class LanguageIndexingFilter
          An IndexingFilter that adds a lang (language) field to the document.
 

Uses of Pluggable in org.apache.nutch.collection
 

Classes in org.apache.nutch.collection that implement Pluggable
 class Subcollection
          SubCollection represents a subset of index, you can define url patterns that will indicate that particular page (url) is part of SubCollection.
 

Uses of Pluggable in org.apache.nutch.indexer
 

Subinterfaces of Pluggable in org.apache.nutch.indexer
 interface IndexingFilter
          Extension point for indexing.
 

Uses of Pluggable in org.apache.nutch.indexer.anchor
 

Classes in org.apache.nutch.indexer.anchor that implement Pluggable
 class AnchorIndexingFilter
          Indexing filter that indexes all inbound anchor text for a document.
 

Uses of Pluggable in org.apache.nutch.indexer.basic
 

Classes in org.apache.nutch.indexer.basic that implement Pluggable
 class BasicIndexingFilter
          Adds basic searchable fields to a document.
 

Uses of Pluggable in org.apache.nutch.indexer.feed
 

Classes in org.apache.nutch.indexer.feed that implement Pluggable
 class FeedIndexingFilter
           
 

Uses of Pluggable in org.apache.nutch.indexer.more
 

Classes in org.apache.nutch.indexer.more that implement Pluggable
 class MoreIndexingFilter
          Add (or reset) a few metaData properties as respective fields (if they are available), so that they can be displayed by more.jsp (called by search.jsp).
 

Uses of Pluggable in org.apache.nutch.indexer.subcollection
 

Classes in org.apache.nutch.indexer.subcollection that implement Pluggable
 class SubcollectionIndexingFilter
           
 

Uses of Pluggable in org.apache.nutch.indexer.tld
 

Classes in org.apache.nutch.indexer.tld that implement Pluggable
 class TLDIndexingFilter
          Adds the Top level domain extensions to the index
 

Uses of Pluggable in org.apache.nutch.microformats.reltag
 

Classes in org.apache.nutch.microformats.reltag that implement Pluggable
 class RelTagIndexingFilter
          An IndexingFilter that add tag field(s) to the document.
 class RelTagParser
          Adds microformat rel-tags of document if found.
 

Uses of Pluggable in org.apache.nutch.net
 

Subinterfaces of Pluggable in org.apache.nutch.net
 interface URLFilter
          Interface used to limit which URLs enter Nutch.
 

Uses of Pluggable in org.apache.nutch.parse
 

Subinterfaces of Pluggable in org.apache.nutch.parse
 interface ParseFilter
          Extension point for DOM-based parsers.
 interface Parser
          A parser for content generated by a Protocol implementation.
 

Uses of Pluggable in org.apache.nutch.parse.ext
 

Classes in org.apache.nutch.parse.ext that implement Pluggable
 class ExtParser
          A wrapper that invokes external command to do real parsing job.
 

Uses of Pluggable in org.apache.nutch.parse.feed
 

Classes in org.apache.nutch.parse.feed that implement Pluggable
 class FeedParser
           
 

Uses of Pluggable in org.apache.nutch.parse.html
 

Classes in org.apache.nutch.parse.html that implement Pluggable
 class HtmlParser
           
 

Uses of Pluggable in org.apache.nutch.parse.js
 

Classes in org.apache.nutch.parse.js that implement Pluggable
 class JSParseFilter
          This class is a heuristic link extractor for JavaScript files and code snippets.
 

Uses of Pluggable in org.apache.nutch.parse.swf
 

Classes in org.apache.nutch.parse.swf that implement Pluggable
 class SWFParser
          Parser for Flash SWF files.
 

Uses of Pluggable in org.apache.nutch.parse.tika
 

Classes in org.apache.nutch.parse.tika that implement Pluggable
 class TikaParser
          Wrapper for Tika parsers.
 

Uses of Pluggable in org.apache.nutch.parse.zip
 

Classes in org.apache.nutch.parse.zip that implement Pluggable
 class ZipParser
          ZipParser class based on MSPowerPointParser class by Stephan Strittmatter.
 

Uses of Pluggable in org.apache.nutch.plugin
 

Subinterfaces of Pluggable in org.apache.nutch.plugin
 interface FieldPluggable
           
 

Uses of Pluggable in org.apache.nutch.protocol
 

Subinterfaces of Pluggable in org.apache.nutch.protocol
 interface Protocol
          A retriever of url content.
 

Uses of Pluggable in org.apache.nutch.protocol.file
 

Classes in org.apache.nutch.protocol.file that implement Pluggable
 class File
          File.java deals with file: scheme.
 

Uses of Pluggable in org.apache.nutch.protocol.ftp
 

Classes in org.apache.nutch.protocol.ftp that implement Pluggable
 class Ftp
          Ftp.java deals with ftp: scheme.
 

Uses of Pluggable in org.apache.nutch.protocol.http
 

Classes in org.apache.nutch.protocol.http that implement Pluggable
 class Http
           
 

Uses of Pluggable in org.apache.nutch.protocol.http.api
 

Classes in org.apache.nutch.protocol.http.api that implement Pluggable
 class HttpBase
           
 

Uses of Pluggable in org.apache.nutch.protocol.sftp
 

Classes in org.apache.nutch.protocol.sftp that implement Pluggable
 class Sftp
          This class uses the Jsch package to fetch content using the Sftp protocol.
 

Uses of Pluggable in org.apache.nutch.scoring
 

Subinterfaces of Pluggable in org.apache.nutch.scoring
 interface ScoringFilter
          A contract defining behavior of scoring plugins.
 

Classes in org.apache.nutch.scoring that implement Pluggable
 class ScoringFilters
          Creates and caches ScoringFilter implementing plugins.
 

Uses of Pluggable in org.apache.nutch.scoring.link
 

Classes in org.apache.nutch.scoring.link that implement Pluggable
 class LinkAnalysisScoringFilter
           
 

Uses of Pluggable in org.apache.nutch.scoring.opic
 

Classes in org.apache.nutch.scoring.opic that implement Pluggable
 class OPICScoringFilter
          This plugin implements a variant of an Online Page Importance Computation (OPIC) score, described in this paper: Abiteboul, Serge and Preda, Mihai and Cobena, Gregory (2003), Adaptive On-Line Page Importance Computation .
 

Uses of Pluggable in org.apache.nutch.scoring.tld
 

Classes in org.apache.nutch.scoring.tld that implement Pluggable
 class TLDScoringFilter
          Scoring filter to boost tlds.
 

Uses of Pluggable in org.apache.nutch.urlfilter.api
 

Classes in org.apache.nutch.urlfilter.api that implement Pluggable
 class RegexURLFilterBase
          Generic URL filter based on regular expressions.
 

Uses of Pluggable in org.apache.nutch.urlfilter.automaton
 

Classes in org.apache.nutch.urlfilter.automaton that implement Pluggable
 class AutomatonURLFilter
          RegexURLFilterBase implementation based on the dk.brics.automaton Finite-State Automata for JavaTM.
 

Uses of Pluggable in org.apache.nutch.urlfilter.domain
 

Classes in org.apache.nutch.urlfilter.domain that implement Pluggable
 class DomainURLFilter
          Filters URLs based on a file containing domain suffixes, domain names, and hostnames.
 

Uses of Pluggable in org.apache.nutch.urlfilter.prefix
 

Classes in org.apache.nutch.urlfilter.prefix that implement Pluggable
 class PrefixURLFilter
          Filters URLs based on a file of URL prefixes.
 

Uses of Pluggable in org.apache.nutch.urlfilter.regex
 

Classes in org.apache.nutch.urlfilter.regex that implement Pluggable
 class RegexURLFilter
          Filters URLs based on a file of regular expressions using the Java Regex implementation.
 

Uses of Pluggable in org.apache.nutch.urlfilter.suffix
 

Classes in org.apache.nutch.urlfilter.suffix that implement Pluggable
 class SuffixURLFilter
          Filters URLs based on a file of URL suffixes.
 

Uses of Pluggable in org.apache.nutch.urlfilter.validator
 

Classes in org.apache.nutch.urlfilter.validator that implement Pluggable
 class UrlValidator
          Validates URLs.
 

Uses of Pluggable in org.creativecommons.nutch
 

Classes in org.creativecommons.nutch that implement Pluggable
 class CCIndexingFilter
          Adds basic searchable fields to a document.
 class CCParseFilter
          Adds metadata identifying the Creative Commons license used, if any.
 



Copyright © 2012 The Apache Software Foundation