org.creativecommons.nutch
Class CCParseFilter.Walker

java.lang.Object
  extended by org.creativecommons.nutch.CCParseFilter.Walker
Enclosing class:
CCParseFilter

public static class CCParseFilter.Walker
extends Object

Walks DOM tree, looking for RDF in comments and licenses in anchors.


Method Summary
static void walk(Node doc, URL base, WebPage page, Configuration conf)
          Scan the document adding attributes to metadata.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Method Detail

walk

public static void walk(Node doc,
                        URL base,
                        WebPage page,
                        Configuration conf)
                 throws ParseException
Scan the document adding attributes to metadata.

Throws:
ParseException


Copyright © 2012 The Apache Software Foundation