org.apache.nutch.parse.html
Class HTMLMetaProcessor
java.lang.Object
org.apache.nutch.parse.html.HTMLMetaProcessor
public class HTMLMetaProcessor
- extends Object
Class for parsing META Directives from DOM trees. This class
handles specifically Robots META directives (all, none, nofollow,
noindex), finding BASE HREF tags, and HTTP-EQUIV no-cache
instructions. All meta directives are stored in a HTMLMetaTags instance.
Method Summary |
static void |
getMetaTags(HTMLMetaTags metaTags,
Node node,
URL currURL)
Sets the indicators in robotsMeta to appropriate
values, based on any META tags found under the given
node . |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
HTMLMetaProcessor
public HTMLMetaProcessor()
getMetaTags
public static final void getMetaTags(HTMLMetaTags metaTags,
Node node,
URL currURL)
- Sets the indicators in
robotsMeta
to appropriate
values, based on any META tags found under the given
node
.
Copyright © 2012 The Apache Software Foundation