org.apache.nutch.crawl
Class WebTableReader.WebTableRegexMapper

java.lang.Object
  extended by org.apache.hadoop.mapreduce.Mapper<K1,V1,K2,V2>
      extended by org.apache.gora.mapreduce.GoraMapper<String,WebPage,Text,Text>
          extended by org.apache.nutch.crawl.WebTableReader.WebTableRegexMapper
Enclosing class:
WebTableReader

public static class WebTableReader.WebTableRegexMapper
extends org.apache.gora.mapreduce.GoraMapper<String,WebPage,Text,Text>

Filters the entries from the table based on a regex


Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
Mapper.Context
 
Constructor Summary
WebTableReader.WebTableRegexMapper()
           
 
Method Summary
protected  void map(String key, WebPage value, Mapper.Context context)
           
protected  void setup(Mapper.Context context)
           
 
Methods inherited from class org.apache.gora.mapreduce.GoraMapper
initMapperJob, initMapperJob, initMapperJob, initMapperJob, initMapperJob
 
Methods inherited from class org.apache.hadoop.mapreduce.Mapper
cleanup, run
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

WebTableReader.WebTableRegexMapper

public WebTableReader.WebTableRegexMapper()
Method Detail

map

protected void map(String key,
                   WebPage value,
                   Mapper.Context context)
            throws IOException,
                   InterruptedException
Overrides:
map in class Mapper<String,WebPage,Text,Text>
Throws:
IOException
InterruptedException

setup

protected void setup(Mapper.Context context)
              throws IOException,
                     InterruptedException
Overrides:
setup in class Mapper<String,WebPage,Text,Text>
Throws:
IOException
InterruptedException


Copyright © 2012 The Apache Software Foundation