org.apache.lucene.analysis.cn.smart
Class SentenceTokenizer
java.lang.Object
  
org.apache.lucene.util.AttributeSource
      
org.apache.lucene.analysis.TokenStream
          
org.apache.lucene.analysis.Tokenizer
              
org.apache.lucene.analysis.cn.smart.SentenceTokenizer
- All Implemented Interfaces: 
 - Closeable
 
public final class SentenceTokenizer
- extends Tokenizer
 
Tokenizes input text into sentences.
 
 The output tokens can then be broken into words with WordTokenFilter
 
- WARNING: This API is experimental and might change in incompatible ways in the next release.
 
  
 
 
 
| Fields inherited from class org.apache.lucene.analysis.Tokenizer | 
input | 
 
 
 
 
| Methods inherited from class org.apache.lucene.util.AttributeSource | 
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState | 
 
 
SentenceTokenizer
public SentenceTokenizer(Reader reader)
SentenceTokenizer
public SentenceTokenizer(AttributeSource source,
                         Reader reader)
SentenceTokenizer
public SentenceTokenizer(AttributeSource.AttributeFactory factory,
                         Reader reader)
incrementToken
public boolean incrementToken()
                       throws IOException
- Specified by:
 incrementToken in class TokenStream
 
- Throws:
 IOException
 
reset
public void reset()
           throws IOException
- Overrides:
 reset in class TokenStream
 
- Throws:
 IOException
 
end
public void end()
- Overrides:
 end in class TokenStream
 
 
          Copyright © 2000-2012 Apache Software Foundation.  All Rights Reserved.