Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.ga |
Analysis for Irish.
|
org.apache.lucene.analysis.ja |
Analyzer for Japanese.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
Modifier and Type | Field | Description |
---|---|---|
static CharArraySet |
CharArraySet.EMPTY_SET |
|
protected CharArraySet |
StopwordAnalyzerBase.stopwords |
An immutable stopword set
|
Modifier and Type | Method | Description |
---|---|---|
static CharArraySet |
CharArraySet.copy(Set<?> set) |
Deprecated.
use
copy(Version, Set) instead. |
static CharArraySet |
CharArraySet.copy(Version matchVersion,
Set<?> set) |
Returns a copy of the given set as a
CharArraySet . |
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
CharArraySet result) |
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
Version matchVersion) |
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
CharArraySet result) |
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
Version matchVersion) |
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
CharArraySet result) |
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
Version matchVersion) |
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
CharArraySet |
CharArrayMap.keySet() |
Returns an
CharArraySet view on the map's keys. |
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(boolean ignoreCase,
Class<? extends ReusableAnalyzerBase> aClass,
String resource,
String comment) |
Creates a CharArraySet from a file resource associated with a class.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(File stopwords,
Version matchVersion) |
Creates a CharArraySet from a file.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(Reader stopwords,
Version matchVersion) |
Creates a CharArraySet from a file.
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set) |
Returns an unmodifiable
CharArraySet . |
Modifier and Type | Method | Description |
---|---|---|
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
CharArraySet result) |
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
CharArraySet result) |
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
CharArraySet result) |
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set) |
Returns an unmodifiable
CharArraySet . |
Constructor | Description |
---|---|
KeywordMarkerFilter(TokenStream in,
CharArraySet keywordSet) |
Create a new KeywordMarkerFilter, that marks the current token as a
keyword if the tokens term buffer is contained in the given set via the
KeywordAttribute . |
MockAnalyzer(Random random,
int pattern,
boolean lowerCase,
CharArraySet filter,
boolean enablePositionIncrements) |
Creates a new MockAnalyzer.
|
Modifier and Type | Method | Description |
---|---|---|
static CharArraySet |
SmartChineseAnalyzer.getDefaultStopSet() |
Returns an unmodifiable instance of the default stop-words set.
|
Modifier and Type | Field | Description |
---|---|---|
protected CharArraySet |
CompoundWordTokenFilterBase.dictionary |
Modifier and Type | Method | Description |
---|---|---|
static CharArraySet |
CompoundWordTokenFilterBase.makeDictionary(Version matchVersion,
String[] dictionary) |
Deprecated.
Only available for backwards compatibility.
|
Modifier and Type | Method | Description |
---|---|---|
static CharArraySet |
IrishAnalyzer.getDefaultStopSet() |
Returns an unmodifiable instance of the default stop words set.
|
Constructor | Description |
---|---|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords) |
Builds an analyzer with the given stop words.
|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet) |
Builds an analyzer with the given stop words.
|
Modifier and Type | Method | Description |
---|---|---|
static CharArraySet |
JapaneseAnalyzer.getDefaultStopSet() |
Constructor | Description |
---|---|
JapaneseAnalyzer(Version matchVersion,
UserDictionary userDict,
JapaneseTokenizer.Mode mode,
CharArraySet stopwords,
Set<String> stoptags) |
Modifier and Type | Field | Description |
---|---|---|
protected CharArraySet |
RSLPStemmerBase.RuleWithSetExceptions.exceptions |
Copyright © 2000-2018 Apache Software Foundation. All Rights Reserved.