Class UkrainianHybridDisambiguator
java.lang.Object
org.languagetool.tagging.disambiguation.AbstractDisambiguator
org.languagetool.tagging.disambiguation.uk.UkrainianHybridDisambiguator
- All Implemented Interfaces:
Disambiguator
Hybrid chunker-disambiguator for Ukrainian.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate final Disambiguatorprivate static final Patternprivate final Disambiguatorprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Stringprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate static final Patternprivate final SimpleDisambiguatorprivate static final Stringprivate static final Pattern -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprivate booleancanRemoveVmis(List<AnalyzedToken> analyzedTokens) private static voidcheckForInitialRetag(AnalyzedTokenReadings lastName, List<Integer> initialsIdxs, AnalyzedTokenReadings[] tokens) final AnalyzedSentencedisambiguate(AnalyzedSentence input) Calls two disambiguator classes: (1) a chunker; (2) a rule-based disambiguator.private voidprivate voiddisambiguateSt(AnalyzedSentence input) private static AnalyzedTokenReadingsgetInitialReadings(AnalyzedTokenReadings initialsReadings, AnalyzedTokenReadings lnameTokens, String initialType) private static booleanisInitial(AnalyzedTokenReadings[] tokens, int pos) private booleanlikelyVklyContext(AnalyzedTokenReadings[] tokens, int i) preDisambiguate(AnalyzedSentence input) If possible, filters out the wrong POS tags.private voidremoveInanimVKly(AnalyzedSentence input) private voidprivate voidprivate voidprivate static voidremoveTokensWithout(AnalyzedTokenReadings readings, Pattern pattern) private voidremoveVerbImpr(AnalyzedSentence input) private voidremoveVmis(AnalyzedSentence input) private voidretagFemNames(AnalyzedSentence input) private voidretagInitials(AnalyzedSentence input) private voidretagPulralProp(AnalyzedSentence input) private voidMethods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.languagetool.tagging.disambiguation.Disambiguator
disambiguate
-
Field Details
-
LAST_NAME_TAG
- See Also:
-
INITIAL_REGEX
-
INANIM_VKLY
-
PLURAL_NAME
-
PLURAL_LNAME_PATTERN
-
ST_ABBR
- See Also:
-
LATIN_DIGITS_PATTERN
-
DIGITS_PATTERN
-
STATION_NAME_PATTERN
-
PATTERN_1
-
PATTERN_2
-
PATTERN_3
-
PATTERN_4
-
PATTERN_5
-
chunker
-
disambiguator
-
simpleDisambiguator
-
V_MIS_PREPS
-
V_NON_MIS_PREPS
-
IGNORE_IN_PRON_POS
-
PUNCT_AFTER_KLY_PATTERN
-
LIKELY_V_KLY
-
-
Constructor Details
-
UkrainianHybridDisambiguator
public UkrainianHybridDisambiguator()
-
-
Method Details
-
disambiguate
Calls two disambiguator classes: (1) a chunker; (2) a rule-based disambiguator.- Parameters:
input- The sentence with already tagged words. The words are expected to have multiple tags.- Returns:
- Analyzed sentence, where each word has only one (possibly the most correct) tag.
- Throws:
IOException
-
preDisambiguate
Description copied from interface:DisambiguatorIf possible, filters out the wrong POS tags. This code will run before disambiguation rules from xml are called. This allows to have some initial disambiguation logic in Java.- Specified by:
preDisambiguatein interfaceDisambiguator- Overrides:
preDisambiguatein classAbstractDisambiguator- Parameters:
input- The sentence with already tagged words. The words are expected to have multiple tags.- Returns:
- Analyzed sentence, where each word has only one (possibly the most correct) tag.
-
removeVerbImpr
-
retagFemNames
-
removeVmis
-
canRemoveVmis
-
disambiguatePronPos
-
removeLowerCaseBadForUpperCaseGood
-
removeLowerCaseHomonymsForAbbreviations
-
removeInanimVKly
-
likelyVklyContext
-
removePluralForNames
-
retagInitials
-
retagUnknownInitials
-
checkForInitialRetag
private static void checkForInitialRetag(AnalyzedTokenReadings lastName, List<Integer> initialsIdxs, AnalyzedTokenReadings[] tokens) -
disambiguateSt
-
retagPulralProp
-
removeTokensWithout
-
getInitialReadings
private static AnalyzedTokenReadings getInitialReadings(AnalyzedTokenReadings initialsReadings, AnalyzedTokenReadings lnameTokens, String initialType) -
isInitial
-