public class TokenStreamOffsetStrategy extends AnalysisOffsetStrategy
OffsetsEnum wrapping the TokenStream filtered to terms
in the query, including wildcards. It can't handle position-sensitive queries (phrases). Passage accuracy suffers
because the freq() is unknown -- it's always Integer.MAX_VALUE instead.| Modifier and Type | Class and Description |
|---|---|
private static class |
TokenStreamOffsetStrategy.TokenStreamPostingsEnum |
| Modifier and Type | Field and Description |
|---|---|
private static BytesRef[] |
ZERO_LEN_BYTES_REF_ARRAY |
analyzerautomata, field, phraseHelper, terms| Constructor and Description |
|---|
TokenStreamOffsetStrategy(java.lang.String field,
BytesRef[] terms,
PhraseHelper phraseHelper,
CharacterRunAutomaton[] automata,
Analyzer indexAnalyzer) |
| Modifier and Type | Method and Description |
|---|---|
private static CharacterRunAutomaton[] |
convertTermsToAutomata(BytesRef[] terms,
CharacterRunAutomaton[] automata) |
java.util.List<OffsetsEnum> |
getOffsetsEnums(IndexReader reader,
int docId,
java.lang.String content)
The primary method -- return offsets for highlightable words in the specified document.
|
getOffsetSource, tokenStreamcreateAutomataOffsetsFromTerms, createOffsetsEnumsFromReader, getFieldprivate static final BytesRef[] ZERO_LEN_BYTES_REF_ARRAY
public TokenStreamOffsetStrategy(java.lang.String field,
BytesRef[] terms,
PhraseHelper phraseHelper,
CharacterRunAutomaton[] automata,
Analyzer indexAnalyzer)
private static CharacterRunAutomaton[] convertTermsToAutomata(BytesRef[] terms, CharacterRunAutomaton[] automata)
public java.util.List<OffsetsEnum> getOffsetsEnums(IndexReader reader, int docId, java.lang.String content) throws java.io.IOException
FieldOffsetStrategygetOffsetsEnums in class FieldOffsetStrategyjava.io.IOException