public final class ThaiWordFilter
extends org.apache.lucene.analysis.TokenFilter
TokenFilter that use BreakIterator to break each
Token that is Thai into separate Token(s) for each Thai word.
Please note: Since matchVersion 3.1 on, this filter no longer lowercases non-thai text.
ThaiAnalyzer will insert a LowerCaseFilter before this filter
so the behaviour of the Analyzer does not change. With version 3.1, the filter handles
position increments correctly.
WARNING: this filter may not be supported by all JREs. It is known to work with Sun/Oracle and Harmony JREs. If your application needs to be fully portable, consider using ICUTokenizer instead, which uses an ICU Thai BreakIterator that will always be available.
| Modifier and Type | Field and Description |
|---|---|
static boolean |
DBBI_AVAILABLE
True if the JRE supports a working dictionary-based breakiterator for Thai.
|
| Constructor and Description |
|---|
ThaiWordFilter(org.apache.lucene.analysis.TokenStream input)
Deprecated.
Use the ctor with
matchVersion instead! |
ThaiWordFilter(org.apache.lucene.util.Version matchVersion,
org.apache.lucene.analysis.TokenStream input)
Creates a new ThaiWordFilter with the specified match version.
|
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken() |
void |
reset() |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toStringpublic static final boolean DBBI_AVAILABLE
@Deprecated public ThaiWordFilter(org.apache.lucene.analysis.TokenStream input)
matchVersion instead!public ThaiWordFilter(org.apache.lucene.util.Version matchVersion,
org.apache.lucene.analysis.TokenStream input)
public boolean incrementToken()
throws IOException
incrementToken in class org.apache.lucene.analysis.TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class org.apache.lucene.analysis.TokenFilterIOExceptionCopyright © 2000-2012 Apache Software Foundation. All Rights Reserved.