org.apache.nutch.net
Interface URLNormalizer

All Superinterfaces:
Configurable
All Known Implementing Classes:
BasicURLNormalizer, PassURLNormalizer, RegexURLNormalizer

public interface URLNormalizer
extends Configurable

Interface used to convert URLs to normal form and optionally perform substitutions


Field Summary
static String X_POINT_ID
           
 
Method Summary
 String normalize(String urlString, String scope)
           
 
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
 

Field Detail

X_POINT_ID

static final String X_POINT_ID
Method Detail

normalize

String normalize(String urlString,
                 String scope)
                 throws MalformedURLException
Throws:
MalformedURLException


Copyright © 2012 The Apache Software Foundation