org.apache.nutch.parse.tika
Class HTMLMetaProcessor

java.lang.Object
  extended by org.apache.nutch.parse.tika.HTMLMetaProcessor

public class HTMLMetaProcessor
extends Object

Class for parsing META Directives from DOM trees. This class handles specifically Robots META directives (all, none, nofollow, noindex), finding BASE HREF tags, and HTTP-EQUIV no-cache instructions. All meta directives are stored in a HTMLMetaTags instance.


Constructor Summary
HTMLMetaProcessor()
           
 
Method Summary
static void getMetaTags(HTMLMetaTags metaTags, Node node, URL currURL)
          Sets the indicators in robotsMeta to appropriate values, based on any META tags found under the given node.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HTMLMetaProcessor

public HTMLMetaProcessor()
Method Detail

getMetaTags

public static final void getMetaTags(HTMLMetaTags metaTags,
                                     Node node,
                                     URL currURL)
Sets the indicators in robotsMeta to appropriate values, based on any META tags found under the given node.



Copyright © 2012 The Apache Software Foundation