org.apache.nutch.host
Class HostInjectorJob
java.lang.Object
org.apache.nutch.host.HostInjectorJob
- All Implemented Interfaces:
- Configurable, Tool
public class HostInjectorJob
- extends Object
- implements Tool
Creates or updates an existing host table from a text file.
The files contain one host name per line, optionally followed by custom
metadata separated by tabs with the metadata key is separated from the
corresponding value by '='.
The URLs must contain the protocol as well as the host name
e.g. http://www.nutch.org \t nutch.score=10 \t nutch.fetchInterval=2592000 \t
userType=open_source
Field Summary |
static org.slf4j.Logger |
LOG
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
LOG
public static final org.slf4j.Logger LOG
HostInjectorJob
public HostInjectorJob()
HostInjectorJob
public HostInjectorJob(Configuration conf)
getConf
public Configuration getConf()
- Specified by:
getConf
in interface Configurable
setConf
public void setConf(Configuration conf)
- Specified by:
setConf
in interface Configurable
inject
public boolean inject(Path hostDir)
throws Exception
- Throws:
Exception
run
public int run(String[] args)
throws Exception
- Specified by:
run
in interface Tool
- Throws:
Exception
main
public static void main(String[] args)
throws Exception
- Throws:
Exception
Copyright © 2012 The Apache Software Foundation