Package org.apache.lucene.classification
Uses already seen data (the indexed documents) to classify an input ( can be simple text or a
structured document).
Currently contains a (simplistic) Naive Bayes classifier, a k-Nearest Neighbor classifier and a Perceptron based classifier.
-
Interface Summary Interface Description Classifier<T> A classifier, seehttp://en.wikipedia.org/wiki/Classifier_(mathematics), which assign classes of typeT -
Class Summary Class Description BM25NBClassifier A classifier approximating naive bayes classifier by using pure queries on BM25.BooleanPerceptronClassifier CachingNaiveBayesClassifier A simplistic Lucene based NaiveBayes classifier, with caching feature, seehttp://en.wikipedia.org/wiki/Naive_Bayes_classifierClassificationResult<T> The result of a call toClassifier.assignClass(String)holding an assigned class of typeTand a score.KNearestFuzzyClassifier A k-Nearest Neighbor classifier based onNearestFuzzyQuery.KNearestNeighborClassifier A k-Nearest Neighbor classifier (seehttp://en.wikipedia.org/wiki/K-nearest_neighbors) based onMoreLikeThisSimpleNaiveBayesClassifier A simplistic Lucene based NaiveBayes classifier, seehttp://en.wikipedia.org/wiki/Naive_Bayes_classifier