org.apache.nutch.host
Class HostInjectorJob

java.lang.Object
  extended by org.apache.nutch.host.HostInjectorJob
All Implemented Interfaces:
Configurable, Tool

public class HostInjectorJob
extends Object
implements Tool

Creates or updates an existing host table from a text file.
The files contain one host name per line, optionally followed by custom metadata separated by tabs with the metadata key is separated from the corresponding value by '='.
The URLs must contain the protocol as well as the host name
e.g. http://www.nutch.org \t nutch.score=10 \t nutch.fetchInterval=2592000 \t userType=open_source


Nested Class Summary
static class HostInjectorJob.UrlMapper
           
 
Field Summary
static org.slf4j.Logger LOG
           
 
Constructor Summary
HostInjectorJob()
           
HostInjectorJob(Configuration conf)
           
 
Method Summary
 Configuration getConf()
           
 boolean inject(Path hostDir)
           
static void main(String[] args)
           
 int run(String[] args)
           
 void setConf(Configuration conf)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public static final org.slf4j.Logger LOG
Constructor Detail

HostInjectorJob

public HostInjectorJob()

HostInjectorJob

public HostInjectorJob(Configuration conf)
Method Detail

getConf

public Configuration getConf()
Specified by:
getConf in interface Configurable

setConf

public void setConf(Configuration conf)
Specified by:
setConf in interface Configurable

inject

public boolean inject(Path hostDir)
               throws Exception
Throws:
Exception

run

public int run(String[] args)
        throws Exception
Specified by:
run in interface Tool
Throws:
Exception

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception


Copyright © 2012 The Apache Software Foundation