org.creativecommons.nutch
Class CCParseFilter.Walker
java.lang.Object
org.creativecommons.nutch.CCParseFilter.Walker
- Enclosing class:
- CCParseFilter
public static class CCParseFilter.Walker
- extends Object
Walks DOM tree, looking for RDF in comments and licenses in anchors.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
walk
public static void walk(Node doc,
URL base,
WebPage page,
Configuration conf)
throws ParseException
- Scan the document adding attributes to metadata.
- Throws:
ParseException
Copyright © 2012 The Apache Software Foundation