Package org.languagetool.rules.pl
Class MorfologikPolishSpellerRule
java.lang.Object
org.languagetool.rules.Rule
org.languagetool.rules.spelling.SpellingCheckRule
org.languagetool.rules.spelling.morfologik.MorfologikSpellerRule
org.languagetool.rules.pl.MorfologikPolishSpellerRule
-
Field Summary
FieldsModifier and TypeFieldDescriptionnon-word suffixes that should not be suggested (only morphological endings, never after a space)private static final PatternThe set of prefixes that are not allowed to be split in the suggestions.private static final Stringprivate final UserConfigFields inherited from class org.languagetool.rules.spelling.morfologik.MorfologikSpellerRule
conversionLocale, globalConfig, motherTongue, speller1, speller2, speller3Fields inherited from class org.languagetool.rules.spelling.SpellingCheckRule
CUSTOM_SPELLING_FILE, GLOBAL_SPELLING_FILE, HIGH_CONFIDENCE, ignoreWordsWithLength, language, languageModel, LANGUAGETOOL, LANGUAGETOOLER, MAX_TOKEN_LENGTH, wordListLoader, wordsToBeIgnored -
Constructor Summary
ConstructorsConstructorDescriptionMorfologikPolishSpellerRule(ResourceBundle messages, Language language, UserConfig userConfig, List<Language> altLanguages) -
Method Summary
Modifier and TypeMethodDescriptionGet the filename, e.g.,/resource/pl/spelling.dict.getId()A string used to identify the rule in e.g.getRuleMatches(String word, int startPos, AnalyzedSentence sentence, List<RuleMatch> ruleMatchesSoFar, int idx, AnalyzedTokenReadings[] tokens) private booleanisNotCompound(String word) Check whether the word is a compound adjective or contains a non-splitting prefix.private List<SuggestedReplacement> pruneSuggestions(List<SuggestedReplacement> suggestions) Remove suggestions -- not really runon words using a list of non-word suffixesGet the regular expression pattern used to tokenize the words as in the source dictionary.Methods inherited from class org.languagetool.rules.spelling.morfologik.MorfologikSpellerRule
addHyphenSuggestions, getDescription, getSpellingSuggestions, getTranslator, ignoreWord, isMisspelled, isMisspelled, match, orderSuggestions, setCheckCompound, setCompoundRegex, setIgnoreTaggedWords, setLocaleMethods inherited from class org.languagetool.rules.spelling.SpellingCheckRule
acceptPhrases, addIgnoreTokens, addIgnoreWords, addProhibitedWords, addSuggestionsToRuleMatch, createWrongSplitMatch, expandLine, filterDupes, filterNoSuggestWords, filterSuggestions, getAdditionalProhibitFileNames, getAdditionalSpellingFileNames, getAdditionalSuggestions, getAdditionalTopSuggestions, getAntiPatterns, getIgnoreFileName, getLanguageVariantSpellingFileName, getOnlySuggestions, getProhibitFileName, getSpellingFileName, ignorePotentiallyMisspelledWord, ignoreToken, ignoreWord, init, isDictionaryBasedSpellingRule, isEMail, isIgnoredNoCase, isInIgnoredSet, isLatinScript, isProhibited, isUrl, setConsiderIgnoreWords, setConvertsCase, startsWithIgnoredWord, tokenizeNewWordsMethods inherited from class org.languagetool.rules.Rule
addExamplePair, addTags, addToneTags, cacheAntiPatterns, estimateContextForSureMatch, getCategory, getCorrectExamples, getDistanceTokens, getErrorTriggeringExamples, getFullId, getIncorrectExamples, getLocQualityIssueType, getMinPrevMatches, getPriority, getRuleOptions, getSentenceWithImmunization, getSourceFile, getSubId, getTags, getToneTags, getUrl, hasTag, hasToneTag, isDefaultOff, isDefaultTempOff, isGoalSpecific, isIncludedInHiddenMatches, isOfficeDefaultOff, isOfficeDefaultOn, isPremium, makeAntiPatterns, setCategory, setCorrectExamples, setDefaultOff, setDefaultOn, setDefaultTempOff, setDistanceTokens, setErrorTriggeringExamples, setExamplePair, setGoalSpecific, setIncludedInHiddenMatches, setIncorrectExamples, setLocQualityIssueType, setMinPrevMatches, setOfficeDefaultOff, setOfficeDefaultOn, setPremium, setPriority, setTags, setToneTags, setUrl, supportsLanguage, toRuleMatchArray, useInOffice
-
Field Details
-
RESOURCE_FILENAME
- See Also:
-
POLISH_TOKENIZING_CHARS
-
prefixes
The set of prefixes that are not allowed to be split in the suggestions. -
bannedSuffixes
non-word suffixes that should not be suggested (only morphological endings, never after a space) -
userConfig
-
-
Constructor Details
-
MorfologikPolishSpellerRule
public MorfologikPolishSpellerRule(ResourceBundle messages, Language language, UserConfig userConfig, List<Language> altLanguages) throws IOException - Throws:
IOException
-
-
Method Details
-
getFileName
Description copied from class:MorfologikSpellerRuleGet the filename, e.g.,/resource/pl/spelling.dict.- Specified by:
getFileNamein classMorfologikSpellerRule
-
getId
Description copied from class:RuleA string used to identify the rule in e.g. configuration files. This string is supposed to be unique and to stay the same in all upcoming versions of LanguageTool. It's supposed to contain only the charactersA-Zand the underscore.- Specified by:
getIdin classMorfologikSpellerRule
-
tokenizingPattern
Description copied from class:MorfologikSpellerRuleGet the regular expression pattern used to tokenize the words as in the source dictionary. For example, it may contain a hyphen, if the words with hyphens are not included in the dictionary- Overrides:
tokenizingPatternin classMorfologikSpellerRule- Returns:
- A compiled
Patternthat is used to tokenize words ornull.
-
getRuleMatches
protected List<RuleMatch> getRuleMatches(String word, int startPos, AnalyzedSentence sentence, List<RuleMatch> ruleMatchesSoFar, int idx, AnalyzedTokenReadings[] tokens) throws IOException - Overrides:
getRuleMatchesin classMorfologikSpellerRule- Throws:
IOException
-
isNotCompound
Check whether the word is a compound adjective or contains a non-splitting prefix. Used to suppress false positives.- Parameters:
word- Word to be checked.- Returns:
- True if the word is not a compound.
- Throws:
IOException- Since:
- 2.5
-
pruneSuggestions
Remove suggestions -- not really runon words using a list of non-word suffixes- Returns:
- A list of pruned suggestions.
-