org.apache.lucene.benchmark.byTask.feeds
Class TrecFR94Parser
java.lang.Object
org.apache.lucene.benchmark.byTask.feeds.TrecDocParser
org.apache.lucene.benchmark.byTask.feeds.TrecFR94Parser
public class TrecFR94Parser
- extends TrecDocParser
Parser for the FR94 docs in trec disks 4+5 collection format
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TrecFR94Parser
public TrecFR94Parser()
parse
public DocData parse(DocData docData,
String name,
TrecContentSource trecSrc,
StringBuilder docBuf,
TrecDocParser.ParsePathType pathType)
throws IOException,
InterruptedException
- Description copied from class:
TrecDocParser
- parse the text prepared in docBuf into a result DocData,
no synchronization is required.
- Specified by:
parse
in class TrecDocParser
- Parameters:
docData
- reusable resultname
- name that should be set to the resulttrecSrc
- calling trec content sourcedocBuf
- text to parsepathType
- type of parsed file, or null if unknown - may be used by
parsers to alter their behavior according to the file path type.
- Throws:
IOException
InterruptedException