org.apache.lucene.benchmark.byTask.feeds
Class TrecLATimesParser
java.lang.Object
org.apache.lucene.benchmark.byTask.feeds.TrecDocParser
org.apache.lucene.benchmark.byTask.feeds.TrecLATimesParser
public class TrecLATimesParser
- extends TrecDocParser
Parser for the FT docs in trec disks 4+5 collection format
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TrecLATimesParser
public TrecLATimesParser()
parse
public DocData parse(DocData docData,
String name,
TrecContentSource trecSrc,
StringBuilder docBuf,
TrecDocParser.ParsePathType pathType)
throws IOException,
InterruptedException
- Description copied from class:
TrecDocParser
- parse the text prepared in docBuf into a result DocData,
no synchronization is required.
- Specified by:
parse
in class TrecDocParser
- Parameters:
docData
- reusable resultname
- name that should be set to the resulttrecSrc
- calling trec content sourcedocBuf
- text to parsepathType
- type of parsed file, or null if unknown - may be used by
parsers to alter their behavior according to the file path type.
- Throws:
IOException
InterruptedException