public class MorfologikAnalyzer extends Analyzer
Analyzer using Morfologik library.Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponentsGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY| Constructor and Description |
|---|
MorfologikAnalyzer()
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(morfologik.stemming.Dictionary dictionary)
Builds an analyzer with an explicit dictionary resource.
|
| Modifier and Type | Method and Description |
|---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field,
Reader reader)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader. |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, setVersion, tokenStream, tokenStreampublic MorfologikAnalyzer(morfologik.stemming.Dictionary dictionary)
dictionaryResource - A constant specifying which dictionary to choose. The
dictionary resource must be named morfologik/dictionaries/{dictionaryResource}.dict
and have an associated .info metadata file. See the Morfologik project
for details.public MorfologikAnalyzer()
protected Analyzer.TokenStreamComponents createComponents(String field, Reader reader)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader.createComponents in class Analyzerfield - ignored field namereader - source of tokensAnalyzer.TokenStreamComponents
built from an StandardTokenizer filtered with
StandardFilter and MorfologikFilter.Copyright © 2000–2021 The Apache Software Foundation. All rights reserved.