org.apache.nutch.host
Class HostDbUpdateReducer

java.lang.Object
  extended by org.apache.hadoop.mapreduce.Reducer<K1,V1,K2,V2>
      extended by org.apache.gora.mapreduce.GoraReducer<Text,WebPage,String,Host>
          extended by org.apache.nutch.host.HostDbUpdateReducer

public class HostDbUpdateReducer
extends org.apache.gora.mapreduce.GoraReducer<Text,WebPage,String,Host>

Combines all WebPages with the same host key to create a Host object, with some statistics.


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Reducer
Reducer.Context
 
Constructor Summary
HostDbUpdateReducer()
           
 
Method Summary
protected  void reduce(Text key, Iterable<WebPage> values, Reducer.Context context)
           
 
Methods inherited from class org.apache.gora.mapreduce.GoraReducer
initReducerJob, initReducerJob, initReducerJob
 
Methods inherited from class org.apache.hadoop.mapreduce.Reducer
cleanup, run, setup
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HostDbUpdateReducer

public HostDbUpdateReducer()
Method Detail

reduce

protected void reduce(Text key,
                      Iterable<WebPage> values,
                      Reducer.Context context)
               throws IOException,
                      InterruptedException
Overrides:
reduce in class Reducer<Text,WebPage,String,Host>
Throws:
IOException
InterruptedException


Copyright © 2012 The Apache Software Foundation