| 
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.util.StopwordAnalyzerBase
org.apache.lucene.analysis.standard.UAX29URLEmailAnalyzer
public final class UAX29URLEmailAnalyzer
Filters UAX29URLEmailTokenizer
 with StandardFilter,
 LowerCaseFilter and
 StopFilter, using a list of
 English stop words.
 
 
   You must specify the required Version
   compatibility when creating UAX29URLEmailAnalyzer
 
| Nested Class Summary | 
|---|
| Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer | 
|---|
Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents | 
| Field Summary | |
|---|---|
static int | 
DEFAULT_MAX_TOKEN_LENGTH
Default maximum allowed token length  | 
static CharArraySet | 
STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not useful for searching.  | 
| Fields inherited from class org.apache.lucene.analysis.util.StopwordAnalyzerBase | 
|---|
matchVersion, stopwords | 
| Constructor Summary | |
|---|---|
UAX29URLEmailAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words ( STOP_WORDS_SET). | 
|
UAX29URLEmailAnalyzer(Version matchVersion,
                      CharArraySet stopWords)
Builds an analyzer with the given stop words.  | 
|
UAX29URLEmailAnalyzer(Version matchVersion,
                      Reader stopwords)
Builds an analyzer with the stop words from the given reader.  | 
|
| Method Summary | |
|---|---|
protected  Analyzer.TokenStreamComponents | 
createComponents(String fieldName,
                 Reader reader)
 | 
 int | 
getMaxTokenLength()
 | 
 void | 
setMaxTokenLength(int length)
Set maximum allowed token length.  | 
| Methods inherited from class org.apache.lucene.analysis.util.StopwordAnalyzerBase | 
|---|
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSet | 
| Methods inherited from class org.apache.lucene.analysis.Analyzer | 
|---|
close, getOffsetGap, getPositionIncrementGap, initReader, tokenStream | 
| Methods inherited from class java.lang.Object | 
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait | 
| Field Detail | 
|---|
public static final int DEFAULT_MAX_TOKEN_LENGTH
public static final CharArraySet STOP_WORDS_SET
| Constructor Detail | 
|---|
public UAX29URLEmailAnalyzer(Version matchVersion,
                             CharArraySet stopWords)
matchVersion - Lucene version to match See abovestopWords - stop wordspublic UAX29URLEmailAnalyzer(Version matchVersion)
STOP_WORDS_SET).
matchVersion - Lucene version to match See above
public UAX29URLEmailAnalyzer(Version matchVersion,
                             Reader stopwords)
                      throws IOException
matchVersion - Lucene version to match See abovestopwords - Reader to read stop words from
IOExceptionWordlistLoader.getWordSet(java.io.Reader, org.apache.lucene.util.Version)| Method Detail | 
|---|
public void setMaxTokenLength(int length)
public int getMaxTokenLength()
setMaxTokenLength(int)
protected Analyzer.TokenStreamComponents createComponents(String fieldName,
                                                          Reader reader)
createComponents in class Analyzer
  | 
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||