org.apache.nutch.indexer
Class IndexUtil
java.lang.Object
org.apache.nutch.indexer.IndexUtil
public class IndexUtil
- extends Object
Utility to create an indexed document from a webpage.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
IndexUtil
public IndexUtil(Configuration conf)
index
public NutchDocument index(String key,
WebPage page)
- Index a webpage.
- Parameters:
key
- The key of the page (reversed url).page
- The webpage.
- Returns:
- The indexed document, or null if skipped by index filters.
Copyright © 2012 The Apache Software Foundation