org.apache.nutch.microformats.reltag
Class RelTagParser
java.lang.Object
org.apache.nutch.microformats.reltag.RelTagParser
- All Implemented Interfaces:
- Configurable, ParseFilter, FieldPluggable, Pluggable
public class RelTagParser
- extends Object
- implements ParseFilter
Adds microformat rel-tags of document if found.
- Author:
- Jérôme Charron
- See Also:
-
http://www.microformats.org/wiki/rel-tag
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
LOG
public static final org.slf4j.Logger LOG
REL_TAG
public static final String REL_TAG
- See Also:
- Constant Field Values
RelTagParser
public RelTagParser()
setConf
public void setConf(Configuration conf)
- Specified by:
setConf
in interface Configurable
getConf
public Configuration getConf()
- Specified by:
getConf
in interface Configurable
getFields
public Collection<WebPage.Field> getFields()
- Specified by:
getFields
in interface FieldPluggable
filter
public Parse filter(String url,
WebPage page,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
- Description copied from interface:
ParseFilter
- Adds metadata or otherwise modifies a parse, given
the DOM tree of a page.
- Specified by:
filter
in interface ParseFilter
Copyright © 2012 The Apache Software Foundation