com.solidstategroup.search
Class StopWordsAnalyzer

java.lang.Object
  extended by org.apache.lucene.analysis.Analyzer
      extended by com.solidstategroup.search.StopWordsAnalyzer

public class StopWordsAnalyzer
extends org.apache.lucene.analysis.Analyzer

StopWordsAnalyzer processes input text by stemming English words to their roots. This Analyzer also converts the input to lower case and removes stop words. A small set of default stop words is defined in the STOP_WORDS array, but a caller can specify an alternative set of stop words by calling non-default constructor.


Field Summary
static java.lang.String[] STOP_WORDS
          An array containing some common English words that are usually not useful for searching.
 
Constructor Summary
StopWordsAnalyzer()
           
 
Method Summary
 org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String field, java.io.Reader reader)
           
 
Methods inherited from class org.apache.lucene.analysis.Analyzer
getPositionIncrementGap, tokenStream
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

STOP_WORDS

public static final java.lang.String[] STOP_WORDS
An array containing some common English words that are usually not useful for searching.

Constructor Detail

StopWordsAnalyzer

public StopWordsAnalyzer()
Method Detail

tokenStream

public org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String field,
                                                          java.io.Reader reader)
Overrides:
tokenStream in class org.apache.lucene.analysis.Analyzer