org.apache.nutch.crawl
Class WebTableReader
java.lang.Object
org.apache.hadoop.conf.Configured
org.apache.nutch.util.NutchTool
org.apache.nutch.crawl.WebTableReader
- All Implemented Interfaces:
- Configurable, Tool
public class WebTableReader
- extends NutchTool
- implements Tool
Displays information about the entries of the webtable
Field Summary |
static org.slf4j.Logger |
LOG
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
LOG
public static final org.slf4j.Logger LOG
WebTableReader
public WebTableReader()
processStatJob
public void processStatJob(boolean sort)
throws Exception
- Throws:
Exception
processDumpJob
public void processDumpJob(String output,
Configuration config,
String regex,
boolean content,
boolean headers,
boolean links,
boolean text)
throws IOException,
ClassNotFoundException,
InterruptedException
- Throws:
IOException
ClassNotFoundException
InterruptedException
main
public static void main(String[] args)
throws Exception
- Throws:
Exception
run
public int run(String[] args)
throws Exception
- Specified by:
run
in interface Tool
- Throws:
Exception
run
public Map<String,Object> run(Map<String,Object> args)
throws Exception
- Description copied from class:
NutchTool
- Runs the tool, using a map of arguments.
May return results, or null.
- Specified by:
run
in class NutchTool
- Throws:
Exception
Copyright © 2012 The Apache Software Foundation