All Classes
-
All Classes Interface Summary Class Summary Class Description DefaultICUTokenizerConfig DefaultICUTokenizerConfigthat is generally applicable to many languages.ICUCollatedTermAttributeImpl Extension ofCharTermAttributeImplthat encodes the term text as a binary Unicode collation key instead of as UTF-8 bytes.ICUCollationAttributeFactory Converts each token into itsCollationKey, and then encodes bytes as an index term.ICUCollationDocValuesField Indexes collation keys as a single-valuedSortedDocValuesField.ICUCollationKeyAnalyzer ConfiguresKeywordTokenizerwithICUCollationAttributeFactory.ICUFoldingFilter A TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.ICUFoldingFilterFactory Factory forICUFoldingFilter.ICUNormalizer2CharFilter Normalize token text with ICU'sNormalizer2.ICUNormalizer2CharFilterFactory Factory forICUNormalizer2CharFilterICUNormalizer2Filter Normalize token text with ICU'sNormalizer2ICUNormalizer2FilterFactory Factory forICUNormalizer2FilterICUTokenizer Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/)ICUTokenizerConfig Class that allows for tailored Unicode Text Segmentation on a per-writing system basis.ICUTokenizerFactory Factory forICUTokenizer.ICUTransformFilter ATokenFilterthat transforms text with ICU.ICUTransformFilterFactory Factory forICUTransformFilter.ScriptAttribute This attribute stores the UTR #24 script value for a token of text.ScriptAttributeImpl Implementation ofScriptAttributethat stores the script as an integer.