Package org.apache.lucene.analysis.ko
Class KoreanAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.ko.KoreanAnalyzer
- All Implemented Interfaces:
Closeable,AutoCloseable
Analyzer for Korean that uses morphological analysis.
- Since:
- 7.4.0
- See Also:
- WARNING: This API is experimental and might change in incompatible ways in the next release.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer
Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents -
Field Summary
Fields inherited from class org.apache.lucene.analysis.Analyzer
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY -
Constructor Summary
ConstructorsConstructorDescriptionCreates a new KoreanAnalyzer.KoreanAnalyzer(UserDictionary userDict, KoreanTokenizer.DecompoundMode mode, Set<POS.Tag> stopTags, boolean outputUnknownUnigrams) Creates a new KoreanAnalyzer. -
Method Summary
Modifier and TypeMethodDescriptionprotected Analyzer.TokenStreamComponentscreateComponents(String fieldName) protected TokenStreamnormalize(String fieldName, TokenStream in) Methods inherited from class org.apache.lucene.analysis.Analyzer
attributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, initReader, initReaderForNormalization, normalize, tokenStream, tokenStream
-
Constructor Details
-
KoreanAnalyzer
public KoreanAnalyzer()Creates a new KoreanAnalyzer. -
KoreanAnalyzer
public KoreanAnalyzer(UserDictionary userDict, KoreanTokenizer.DecompoundMode mode, Set<POS.Tag> stopTags, boolean outputUnknownUnigrams) Creates a new KoreanAnalyzer.- Parameters:
userDict- Optional: if non-null, user dictionary.mode- Decompound mode.stopTags- The set of part of speech that should be filtered.outputUnknownUnigrams- If true outputs unigrams for unknown words.
-
-
Method Details
-
createComponents
- Specified by:
createComponentsin classAnalyzer
-
normalize
-