org.apache.lucene.benchmark.byTask.feeds
Class TrecParserByPath
java.lang.Object
org.apache.lucene.benchmark.byTask.feeds.TrecDocParser
org.apache.lucene.benchmark.byTask.feeds.TrecParserByPath
public class TrecParserByPath
- extends TrecDocParser
Parser for trec docs which selects the parser to apply according
to the source files path, defaulting to TrecGov2Parser
.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TrecParserByPath
public TrecParserByPath()
parse
public DocData parse(DocData docData,
String name,
TrecContentSource trecSrc,
StringBuilder docBuf,
TrecDocParser.ParsePathType pathType)
throws IOException,
InterruptedException
- Description copied from class:
TrecDocParser
- parse the text prepared in docBuf into a result DocData,
no synchronization is required.
- Specified by:
parse
in class TrecDocParser
- Parameters:
docData
- reusable resultname
- name that should be set to the resulttrecSrc
- calling trec content sourcedocBuf
- text to parsepathType
- type of parsed file, or null if unknown - may be used by
parsers to alter their behavior according to the file path type.
- Throws:
IOException
InterruptedException