public class TwitterTokenizer extends EnglishTokenizer
stemExclusionsSet, stopWords| Constructor and Description |
|---|
TwitterTokenizer() |
TwitterTokenizer(int minNGram,
int maxNGram) |
| Modifier and Type | Method and Description |
|---|---|
protected String |
preprocess(String tweet) |
List<String> |
tokenize(String text) |
createTokenStream, getMaxNGram, getMinNGram, getStemExclusionsSet, getStopWords, isnGram, setMaxNGram, setMinNGram, setnGram, setNGram, setStemExclusionsSet, setStopWordspublic TwitterTokenizer()
public TwitterTokenizer(int minNGram,
int maxNGram)
public List<String> tokenize(String text)
tokenize in interface TextTokenizertokenize in class EnglishTokenizerCopyright © 2014. All rights reserved.