com.solidstategroup.search
Class StopWordsAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
com.solidstategroup.search.StopWordsAnalyzer
public class StopWordsAnalyzer
- extends org.apache.lucene.analysis.Analyzer
StopWordsAnalyzer processes input
text by stemming English words to their roots.
This Analyzer also converts the input to lower case
and removes stop words. A small set of default stop
words is defined in the STOP_WORDS
array, but a caller can specify an alternative set
of stop words by calling non-default constructor.
|
Field Summary |
static java.lang.String[] |
STOP_WORDS
An array containing some common English words
that are usually not useful for searching. |
|
Method Summary |
org.apache.lucene.analysis.TokenStream |
tokenStream(java.lang.String field,
java.io.Reader reader)
|
| Methods inherited from class org.apache.lucene.analysis.Analyzer |
getPositionIncrementGap, tokenStream |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
STOP_WORDS
public static final java.lang.String[] STOP_WORDS
- An array containing some common English words
that are usually not useful for searching.
StopWordsAnalyzer
public StopWordsAnalyzer()
tokenStream
public org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String field,
java.io.Reader reader)
- Overrides:
tokenStream in class org.apache.lucene.analysis.Analyzer