org.apache.lucene.benchmark.byTask.feeds
Class TrecFTParser
java.lang.Object
org.apache.lucene.benchmark.byTask.feeds.TrecDocParser
org.apache.lucene.benchmark.byTask.feeds.TrecFTParser
public class TrecFTParser
- extends TrecDocParser
Parser for the FT docs in trec disks 4+5 collection format
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TrecFTParser
public TrecFTParser()
parse
public DocData parse(DocData docData,
String name,
TrecContentSource trecSrc,
StringBuilder docBuf,
TrecDocParser.ParsePathType pathType)
throws IOException,
InterruptedException
- Description copied from class:
TrecDocParser
- parse the text prepared in docBuf into a result DocData,
no synchronization is required.
- Specified by:
parse in class TrecDocParser
- Parameters:
docData - reusable resultname - name that should be set to the resulttrecSrc - calling trec content sourcedocBuf - text to parsepathType - type of parsed file, or null if unknown - may be used by
parsers to alter their behavior according to the file path type.
- Throws:
IOException
InterruptedException