org.apache.nutch.crawl
Class GeneratorJob
java.lang.Object
org.apache.hadoop.conf.Configured
org.apache.nutch.util.NutchTool
org.apache.nutch.crawl.GeneratorJob
- All Implemented Interfaces:
- Configurable, Tool
public class GeneratorJob
- extends NutchTool
- implements Tool
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
GENERATE_UPDATE_CRAWLDB
public static final String GENERATE_UPDATE_CRAWLDB
- See Also:
- Constant Field Values
GENERATOR_MIN_SCORE
public static final String GENERATOR_MIN_SCORE
- See Also:
- Constant Field Values
GENERATOR_FILTER
public static final String GENERATOR_FILTER
- See Also:
- Constant Field Values
GENERATOR_NORMALISE
public static final String GENERATOR_NORMALISE
- See Also:
- Constant Field Values
GENERATOR_MAX_COUNT
public static final String GENERATOR_MAX_COUNT
- See Also:
- Constant Field Values
GENERATOR_COUNT_MODE
public static final String GENERATOR_COUNT_MODE
- See Also:
- Constant Field Values
GENERATOR_COUNT_VALUE_DOMAIN
public static final String GENERATOR_COUNT_VALUE_DOMAIN
- See Also:
- Constant Field Values
GENERATOR_COUNT_VALUE_HOST
public static final String GENERATOR_COUNT_VALUE_HOST
- See Also:
- Constant Field Values
GENERATOR_COUNT_VALUE_IP
public static final String GENERATOR_COUNT_VALUE_IP
- See Also:
- Constant Field Values
GENERATOR_TOP_N
public static final String GENERATOR_TOP_N
- See Also:
- Constant Field Values
GENERATOR_CUR_TIME
public static final String GENERATOR_CUR_TIME
- See Also:
- Constant Field Values
GENERATOR_DELAY
public static final String GENERATOR_DELAY
- See Also:
- Constant Field Values
GENERATOR_RANDOM_SEED
public static final String GENERATOR_RANDOM_SEED
- See Also:
- Constant Field Values
BATCH_ID
public static final String BATCH_ID
- See Also:
- Constant Field Values
LOG
public static final org.slf4j.Logger LOG
GeneratorJob
public GeneratorJob()
GeneratorJob
public GeneratorJob(Configuration conf)
run
public Map<String,Object> run(Map<String,Object> args)
throws Exception
- Description copied from class:
NutchTool
- Runs the tool, using a map of arguments.
May return results, or null.
- Specified by:
run
in class NutchTool
- Throws:
Exception
generate
public String generate(long topN,
long curTime,
boolean filter,
boolean norm)
throws Exception
- Mark URLs ready for fetching.
- Throws:
ClassNotFoundException
InterruptedException
Exception
run
public int run(String[] args)
throws Exception
- Specified by:
run
in interface Tool
- Throws:
Exception
main
public static void main(String[] args)
throws Exception
- Throws:
Exception
Copyright © 2012 The Apache Software Foundation