Uses of Interface
org.apache.nutch.parse.ParseFilter

Packages that use ParseFilter
org.apache.nutch.analysis.lang Text document language identifier. 
org.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin. 
org.apache.nutch.parse.js   
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 
 

Uses of ParseFilter in org.apache.nutch.analysis.lang
 

Classes in org.apache.nutch.analysis.lang that implement ParseFilter
 class HTMLLanguageParser
          Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats
 

Uses of ParseFilter in org.apache.nutch.microformats.reltag
 

Classes in org.apache.nutch.microformats.reltag that implement ParseFilter
 class RelTagParser
          Adds microformat rel-tags of document if found.
 

Uses of ParseFilter in org.apache.nutch.parse.js
 

Classes in org.apache.nutch.parse.js that implement ParseFilter
 class JSParseFilter
          This class is a heuristic link extractor for JavaScript files and code snippets.
 

Uses of ParseFilter in org.creativecommons.nutch
 

Classes in org.creativecommons.nutch that implement ParseFilter
 class CCParseFilter
          Adds metadata identifying the Creative Commons license used, if any.
 



Copyright © 2012 The Apache Software Foundation