|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.lucene.analysis.Analyzer
fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer
fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer
fr.gouv.culture.sdx.search.lucene.analysis.Analyzer_br
Analyzer for brazilian language. Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (word that will not be stemmed, but indexed).
Field Summary | |
protected static java.lang.String |
ANALYZER_TYPE
|
static java.lang.String[] |
BRAZILIAN_STOP_WORDS
List of typical brazilian stopwords. |
Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer |
ATTRIBUTE_EXCLUDE_STEMS, ATTRIBUTE_USE_STOP_WORDS, DEFAULT_STOP_WORDS, EXCLUDE_STEM_ELEMENT, EXCLUDE_STEMS_ELEMENT, excludeTable, stopTable |
Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer |
logger |
Constructor Summary | |
Analyzer_br(java.io.File stopwords)
Builds an analyzer with the given stop words. |
|
Analyzer_br(java.util.Hashtable stopwords)
Builds an analyzer with the given stop words. |
|
Analyzer_br(java.lang.String[] stopwords)
Builds an analyzer with the given stop words. |
Method Summary | |
protected java.lang.String |
getAnalyzerType()
|
void |
setStemExclusionTable(java.io.File exclusionlist)
Builds an exclusionlist from the words contained in the given file. |
void |
setStemExclusionTable(java.util.Hashtable exclusionlist)
Builds an exclusionlist from a Hashtable. |
void |
setStemExclusionTable(java.lang.String[] exclusionlist)
Builds an exclusionlist from an array of Strings. |
org.apache.lucene.analysis.TokenStream |
tokenStream(java.lang.String fieldName,
java.io.Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader. |
Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer |
buildExcludeTable, buildStopTable, configure, getDefaultStopWords |
Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer |
enableLogging, toSAX |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
tokenStream |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface fr.gouv.culture.sdx.search.lucene.analysis.Analyzer |
tokenStream |
Field Detail |
protected static final java.lang.String ANALYZER_TYPE
public static final java.lang.String[] BRAZILIAN_STOP_WORDS
Constructor Detail |
public Analyzer_br(java.lang.String[] stopwords)
public Analyzer_br(java.util.Hashtable stopwords)
public Analyzer_br(java.io.File stopwords) throws java.io.IOException
Method Detail |
public void setStemExclusionTable(java.lang.String[] exclusionlist)
public void setStemExclusionTable(java.util.Hashtable exclusionlist)
public void setStemExclusionTable(java.io.File exclusionlist) throws java.io.IOException
java.io.IOException
public final org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName, java.io.Reader reader)
tokenStream
in interface Analyzer
tokenStream
in class DefaultAnalyzer
protected java.lang.String getAnalyzerType()
getAnalyzerType
in class DefaultAnalyzer
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |