Package org.languagetool.tagging.uk
Class UkrainianTagger
java.lang.Object
org.languagetool.tagging.BaseTagger
org.languagetool.tagging.uk.UkrainianTagger
- All Implemented Interfaces:
Tagger
Ukrainian part-of-speech tagger.
See README for details, the POS tagset is described in tagset.txt
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate final CompoundTaggerprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final org.slf4j.Loggerprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final PatternFields inherited from class org.languagetool.tagging.BaseTagger
locale, wordTagger -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionadditionalTags(String word, WordTagger wordTagger) Allows additional tagging in some language-dependent circumstancesprotected List<AnalyzedToken> (package private) List<AnalyzedToken> asAnalyzedTokenListForTaggedWordsInternal(String word, List<TaggedWord> taggedWords) private List<AnalyzedToken> convertTokens(List<AnalyzedToken> origTokens, String word, String str, String dictStr, String additionalTag) private List<AnalyzedToken> getAdjustedAnalyzedTokens(String word, String adjustedWord, Pattern posTagRegex, String additionalTag, UnaryOperator<String> lemmaFunction) protected List<AnalyzedToken> getAnalyzedTokens(String word) Methods inherited from class org.languagetool.tagging.BaseTagger
asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getDictionary, getDictionaryPath, getManualAdditionsFileNames, getManualRemovalsFileNames, overwriteWithManualTagger, tag
-
Field Details
-
logger
private static final org.slf4j.Logger logger -
NUMBER
-
LATIN_NUMBER
-
LATIN_NUMBER_CYR
-
HASHTAG
-
DATE
-
TIME
-
ALT_DASHES_IN_WORD
-
COMPOUND_WITH_QUOTES_REGEX
-
COMPOUND_WITH_QUOTES_REGEX2
-
MISSING_APO
-
MISSING_HYPHEN
-
CAPS_INSIDE_WORD
-
PATTERN_MD
-
QUOTES
-
YI_PATTERN
-
compoundTagger
-
-
Constructor Details
-
UkrainianTagger
public UkrainianTagger()
-
-
Method Details
-
additionalTags
Description copied from class:BaseTaggerAllows additional tagging in some language-dependent circumstances- Overrides:
additionalTagsin classBaseTagger- Parameters:
word- The word to tag- Returns:
- Returns list of analyzed tokens with additional tags, or
null
-
getAnalyzedTokens
- Overrides:
getAnalyzedTokensin classBaseTagger
-
analyzeAllCapitamizedAdj
-
convertTokens
private List<AnalyzedToken> convertTokens(List<AnalyzedToken> origTokens, String word, String str, String dictStr, String additionalTag) -
getAdjustedAnalyzedTokens
private List<AnalyzedToken> getAdjustedAnalyzedTokens(String word, String adjustedWord, Pattern posTagRegex, String additionalTag, UnaryOperator<String> lemmaFunction) -
asAnalyzedTokenListForTaggedWordsInternal
List<AnalyzedToken> asAnalyzedTokenListForTaggedWordsInternal(String word, List<TaggedWord> taggedWords) -
getWordTagger
- Overrides:
getWordTaggerin classBaseTagger
-