org.apache.nutch.indexer
Class IndexUtil

java.lang.Object
  extended by org.apache.nutch.indexer.IndexUtil

public class IndexUtil
extends Object

Utility to create an indexed document from a webpage.


Constructor Summary
IndexUtil(Configuration conf)
           
 
Method Summary
 NutchDocument index(String key, WebPage page)
          Index a webpage.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

IndexUtil

public IndexUtil(Configuration conf)
Method Detail

index

public NutchDocument index(String key,
                           WebPage page)
Index a webpage.

Parameters:
key - The key of the page (reversed url).
page - The webpage.
Returns:
The indexed document, or null if skipped by index filters.


Copyright © 2012 The Apache Software Foundation