Zend Framework
LICENSE
This source file is subject to the new BSD license that is bundled with this package in the file LICENSE.txt. It is also available through the world-wide-web at this URL: http://framework.zend.com/license/new-bsd If you did not receive a copy of the license and are unable to obtain it through the world-wide-web, please send an email to license@zend.com so we can send you a copy immediately.
FORMAT_PRE_2_1 = '0'
FORMAT_2_1 = '1'
FORMAT_2_3 = '2'
GENERATION_RETRIEVE_COUNT = '10'
Generation retrieving counter
GENERATION_RETRIEVE_PAUSE = '50'
Pause between generation retrieving attempts in milliseconds
boolean $_closeDirOnExit = 'true'
File system adapter closing option
boolean $_closed = 'false'
Signal, that index is already closed, changes are fixed and resources are cleaned up
string $_defaultSearchField = 'null'
Default field name for search
Null means search through all fields
Zend_Search_Lucene_Storage_Directory $_directory = 'null'
File system adapter.
integer $_docCount = '0'
Number of documents in this index.
integer $_formatVersion = ''
Index format version
integer $_generation = 'FORMAT_PRE_2_1'
Current segment generation
boolean $_hasChanges = 'false'
Flag for index changes
integer $_refCount = '0'
Number of references to the index object
integer $_resultSetLimit = '0'
Result set limit
0 means no limit
array $_segmentInfos = 'array'
Array of Zend_Search_Lucene_Index_SegmentInfo objects for current version of index.
integer $_termsPerQueryLimit = '1024'
Terms per query limit
0 means no limit
Zend_Search_Lucene_TermStreamsPriorityQueue $_termsStream = 'null'
Terms stream priority queue object
Zend_Search_Lucene_Index_Writer $_writer = 'null'
Writer for this index, not instantiated unless required.
__construct(
Zend_Search_Lucene_Storage_Directory_Filesystem|string $directory
=
null, $create
=
false
)
:
Opens the index.
IndexReader constructor needs Directory as a parameter. It should be a string with a path to the index folder or a Directory object.
__destruct(
)
:
Object destructor
_close(
)
:
Close current index and free resources
_getIndexWriter(
)
:
Zend_Search_Lucene_Index_Writer
Returns an instance of Zend_Search_Lucene_Index_Writer for the index
_readPre21SegmentsFile(
)
:
Read segments file for pre-2.1 Lucene index format
_readSegmentsFile(
)
:
Read segments file
_updateDocCount(
)
:
Update document counter
addDocument(
Zend_Search_Lucene_Document $document
)
:
Adds a document to this index.
addReference(
)
:
Add reference to the index object
closeTermsStream(
)
:
Close terms stream
Should be used for resources clean up if stream is not read up to the end
commit(
)
:
Commit changes resulting from delete() or undeleteAll() operations.
count(
)
:
integer
Returns the total number of documents in this index (including deleted documents).
create(
mixed $directory
)
:
Zend_Search_Lucene_Interface
Create index
currentTerm(
)
:
Zend_Search_Lucene_Index_Term|null
Returns term in current position
delete(
integer|Zend_Search_Lucene_Search_QueryHit $id
)
:
Deletes a document from the index.
$id is an internal document id
docFreq(
Zend_Search_Lucene_Index_Term $term
)
:
integer
Returns the number of documents in this index containing the $term.
find(
Zend_Search_Lucene_Search_QueryParser|string $query
)
:
array
Performs a query against the index and returns an array of Zend_Search_Lucene_Search_QueryHit objects.
Input is a string or Zend_Search_Lucene_Search_Query.
getActualGeneration(
Zend_Search_Lucene_Storage_Directory $directory
)
:
integer
Get current generation number
Returns generation number 0 means pre-2.1 index format -1 means there are no segments files.
getDefaultSearchField(
)
:
string
Get default search field.
Null means, that search is performed through all fields by default
getDirectory(
)
:
Zend_Search_Lucene_Storage_Directory
Returns the Zend_Search_Lucene_Storage_Directory instance for this index.
getDocument(
integer|Zend_Search_Lucene_Search_QueryHit $id
)
:
Zend_Search_Lucene_Document
Returns a Zend_Search_Lucene_Document object for the document number $id in this index.
getFieldNames(
boolean $indexed
=
false
)
:
array
Returns a list of all unique field names that exist in this index.
getFormatVersion(
)
:
integer
Get index format version
getGeneration(
)
:
integer
Get generation number associated with this index instance
The same generation number in pair with document number or query string guarantees to give the same result while index retrieving. So it may be used for search result caching.
getMaxBufferedDocs(
)
:
integer
Retrieve index maxBufferedDocs option
maxBufferedDocs is a minimal number of documents required before the buffered in-memory documents are written into a new Segment
Default value is 10
getMaxMergeDocs(
)
:
integer
Retrieve index maxMergeDocs option
maxMergeDocs is a largest number of documents ever merged by addDocument(). Small values (e.g., less than 10,000) are best for interactive indexing, as this limits the length of pauses while indexing to a few seconds. Larger values are best for batched indexing and speedier searches.
Default value is PHP_INT_MAX
getMergeFactor(
)
:
integer
Retrieve index mergeFactor option
mergeFactor determines how often segment indices are merged by addDocument(). With smaller values, less RAM is used while indexing, and searches on unoptimized indices are faster, but indexing speed is slower. With larger values, more RAM is used during indexing, and while searches on unoptimized indices are slower, indexing is faster. Thus larger values (> 10) are best for batch index creation, and smaller values (< 10) for indices that are interactively maintained.
Default value is 10
getResultSetLimit(
)
:
integer
Get result set limit.
0 means no limit
getSegmentFileName(
integer $generation
)
:
string
Get segments file name
getSimilarity(
)
:
Zend_Search_Lucene_Search_Similarity
Retrive similarity used by index reader
getTermsPerQueryLimit(
)
:
integer
Get result set limit.
0 (default) means no limit
hasDeletions(
)
:
boolean
Returns true if any documents have been deleted from this index.
hasTerm(
Zend_Search_Lucene_Index_Term $term
)
:
boolean
Returns true if index contain documents with specified term.
Is used for query optimization.
isDeleted(
integer $id
)
:
boolean
Checks, that document is deleted
maxDoc(
)
:
integer
Returns one greater than the largest possible document number.
This may be used to, e.g., determine how big to allocate a structure which will have an element for every document number in an index.
nextTerm(
)
:
Zend_Search_Lucene_Index_Term|null
Scans terms dictionary and returns next term
norm(
integer $id, string $fieldName
)
:
float
Returns a normalization factor for "field, document" pair.
numDocs(
)
:
integer
Returns the total number of non-deleted documents in this index.
open(
mixed $directory
)
:
Zend_Search_Lucene_Interface
Open index
optimize(
)
:
Optimize index.
Merges all segments into one
removeReference(
)
:
Remove reference from the index object
When reference count becomes zero, index is closed and resources are cleaned up
resetTermsStream(
)
:
Reset terms stream.
setDefaultSearchField(
string $fieldName
)
:
Set default search field.
Null means, that search is performed through all fields by default
Default value is null
setFormatVersion(
int $formatVersion
)
:
Set index format version.
Index is converted to this format at the nearest upfdate time
setMaxBufferedDocs(
integer $maxBufferedDocs
)
:
Set index maxBufferedDocs option
maxBufferedDocs is a minimal number of documents required before the buffered in-memory documents are written into a new Segment
Default value is 10
setMaxMergeDocs(
integer $maxMergeDocs
)
:
Set index maxMergeDocs option
maxMergeDocs is a largest number of documents ever merged by addDocument(). Small values (e.g., less than 10,000) are best for interactive indexing, as this limits the length of pauses while indexing to a few seconds. Larger values are best for batched indexing and speedier searches.
Default value is PHP_INT_MAX
setMergeFactor(
$mergeFactor
)
:
Set index mergeFactor option
mergeFactor determines how often segment indices are merged by addDocument(). With smaller values, less RAM is used while indexing, and searches on unoptimized indices are faster, but indexing speed is slower. With larger values, more RAM is used during indexing, and while searches on unoptimized indices are slower, indexing is faster. Thus larger values (> 10) are best for batch index creation, and smaller values (< 10) for indices that are interactively maintained.
Default value is 10
setResultSetLimit(
integer $limit
)
:
Set result set limit.
0 (default) means no limit
setTermsPerQueryLimit(
integer $limit
)
:
Set terms per query limit.
0 means no limit
skipTo(
Zend_Search_Lucene_Index_Term $prefix
)
:
Skip terms stream up to the specified term preffix.
Prefix contains fully specified field info and portion of searched term
termDocs(
Zend_Search_Lucene_Index_Term $term, Zend_Search_Lucene_Index_DocsFilter|null $docsFilter
=
null
)
:
array
Returns IDs of all documents containing term.
termDocsFilter(
Zend_Search_Lucene_Index_Term $term, Zend_Search_Lucene_Index_DocsFilter|null $docsFilter
=
null
)
:
Zend_Search_Lucene_Index_DocsFilter
Returns documents filter for all documents containing term.
It performs the same operation as termDocs, but return result as Zend_Search_Lucene_Index_DocsFilter object
termFreqs(
Zend_Search_Lucene_Index_Term $term, Zend_Search_Lucene_Index_DocsFilter|null $docsFilter
=
null
)
:
integer
Returns an array of all term freqs.
Result array structure: array(docId => freq, ...)
termPositions(
Zend_Search_Lucene_Index_Term $term, Zend_Search_Lucene_Index_DocsFilter|null $docsFilter
=
null
)
:
array
Returns an array of all term positions in the documents.
Result array structure: array(docId => array(pos1, pos2, ...), ...)
terms(
)
:
array
Returns an array of all terms in this index.
undeleteAll(
)
:
Undeletes all documents currently marked as deleted in this index.