org.apache.nutch.parse
Class ParseFilters
java.lang.Object
org.apache.nutch.parse.ParseFilters
public class ParseFilters
- extends Object
Creates and caches ParseFilter
implementing plugins.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
HTMLPARSEFILTER_ORDER
public static final String HTMLPARSEFILTER_ORDER
- See Also:
- Constant Field Values
ParseFilters
public ParseFilters(Configuration conf)
filter
public Parse filter(String url,
WebPage page,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
- Run all defined filters.
getFields
public Collection<WebPage.Field> getFields()
Copyright © 2012 The Apache Software Foundation