Package org.languagetool.tagging.ar
Class ArabicTagger
java.lang.Object
org.languagetool.tagging.BaseTagger
org.languagetool.tagging.ar.ArabicTagger
- All Implemented Interfaces:
Tagger
- Since:
- 4.9
-
Field Summary
FieldsFields inherited from class org.languagetool.tagging.BaseTagger
locale, wordTagger -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprotected List<AnalyzedToken> additionalTags(String word, morfologik.stemming.IStemmer stemmer) private voidaddTokens(List<AnalyzedToken> taggedTokens, List<AnalyzedToken> l) voidgetEnclitic(AnalyzedToken token) getJarProclitic(AnalyzedToken token) getLemmas(AnalyzedTokenReadings patternTokens, String type) private StringgetPrefixIndexList(String possibleWord) getProclitic(AnalyzedToken token) private StringgetSuffixIndexList(String possibleWord) private booleanisStopWord(List<AnalyzedToken> taggerTokens) Returns a list ofAnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).Methods inherited from class org.languagetool.tagging.BaseTagger
additionalTags, asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getAnalyzedTokens, getDictionary, getDictionaryPath, getManualAdditionsFileNames, getManualRemovalsFileNames, getWordTagger, overwriteWithManualTagger
-
Field Details
-
tagmanager
-
newStylePronounTag
private boolean newStylePronounTag
-
-
Constructor Details
-
ArabicTagger
public ArabicTagger()
-
-
Method Details
-
tag
Description copied from interface:TaggerReturns a list ofAnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.
- Specified by:
tagin interfaceTagger- Overrides:
tagin classBaseTagger- Parameters:
sentenceTokens- the text as returned by a WordTokenizer
-
additionalTags
@Nullable protected List<AnalyzedToken> additionalTags(String word, morfologik.stemming.IStemmer stemmer) -
addTokens
-
getSuffixIndexList
-
getPrefixIndexList
-
getTags
-
isStopWord
- Returns:
- test if word has stopword tagging
-
getPrefix
-
getSuffix
-
getStem
-
enableNewStylePronounTag
public void enableNewStylePronounTag() -
getProclitic
- Returns:
- if have a flag which is a noun/verb and has proclitics, return the first prefix named procletic letters for this case
-
getEnclitic
- Returns:
- if have a flag which is a noun and has pronoun, return the suffix letters for this case
-
getJarProclitic
- Returns:
- if have a flag which is a noun and has proclitics, return the first prefix named procletic letters for this case
-
tag
-
getLemmas
-