Package org.languagetool.language
Class English
java.lang.Object
org.languagetool.Language
org.languagetool.LanguageWithModel
org.languagetool.language.English
- All Implemented Interfaces:
AutoCloseable
- Direct Known Subclasses:
AmericanEnglish,AustralianEnglish,BritishEnglish,CanadianEnglish,NewZealandEnglish,SouthAfricanEnglish
Support for English - use the sub classes
BritishEnglish, AmericanEnglish,
etc. if you need spell checking.
Make sure to call LanguageWithModel.close() after using this (currently only relevant if you make
use of EnglishConfusionProbabilityRule).-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionCreates language specific chunker.Creates language specific disambiguator.Creates language specific sentence tokenizer.createDefaultSpellingRule(ResourceBundle messages) Creates language specific part-of-speech synthesizer.Creates language specific part-of-speech tagger.Creates language specific word tokenizer.filterRuleMatches(List<RuleMatch> ruleMatches, AnnotatedText text, Set<String> enabledRules) This function is called by JLanguageTool before CleanOverlappingFilter removes overlapping ruleMatchesString[]Get this language's country options , e.g.Languages that have country variants need to overwrite this to select their most common variant.protected intInformation about whether the support for this language in LanguageTool is actively maintained.Get the name(s) of the maintainer(s) for this language ornull.getName()Get this language's name in English, e.g.protected intReturns a priority for Rule or Category Id (default: 0).getRelevantLanguageModelCapableRules(ResourceBundle messages, LanguageModel lm, GlobalConfig globalConfig, UserConfig userConfig, Language motherTongue, List<Language> altLanguages) Get a list of rules that can optionally use aLanguageModel.getRelevantLanguageModelRules(ResourceBundle messages, LanguageModel languageModel, UserConfig userConfig) Get a list of rules that require aLanguageModel.getRelevantRules(ResourceBundle messages, UserConfig userConfig, Language motherTongue, List<Language> altLanguages) Get the rules classes that should run for texts in this language.getRemoteEnhancedRules(ResourceBundle messageBundle, List<RemoteRuleConfig> configs, UserConfig userConfig, Language motherTongue, List<Language> altLanguages, boolean inputLogging) For rules whose results are extended using some remote service, e.g.Get this language's character code, e.g.booleanbooleanhasNGramFalseFriendRule(Language motherTongue) Return true if language has ngram-based false friend rule returned byLanguage.getRelevantLanguageModelCapableRules(java.util.ResourceBundle, org.languagetool.languagemodel.LanguageModel, org.languagetool.GlobalConfig, org.languagetool.UserConfig, org.languagetool.Language, java.util.List<org.languagetool.Language>).booleanprepareLineForSpeller(String line) Methods inherited from class org.languagetool.LanguageWithModel
close, getLanguageModel, initLanguageModelMethods inherited from class org.languagetool.Language
adaptSuggestion, adjustMatch, createDefaultJLanguageTool, createDefaultPostDisambiguationChunker, equals, equalsConsiderVariantsIfSpecified, getChunker, getCommonWordsPath, getConsistencyRulePrefix, getDefaultDisabledRulesForVariant, getDefaultEnabledRulesForVariant, getDefaultSpellingRule, getDefaultSpellingRule, getDisambiguationUnifier, getDisambiguationUnifierConfiguration, getDisambiguator, getIgnoredCharactersRegex, getLocale, getLocaleWithCountryAndVariant, getPatternRules, getPostDisambiguationChunker, getRelevantRemoteRules, getRelevantRulesGlobalConfig, getRuleFileNames, getRulePriority, getSentenceTokenizer, getShortCodeWithCountryAndVariant, getSynthesizer, getTagger, getTranslatedName, getUnifier, getUnifierConfiguration, getVariant, getWordTokenizer, hashCode, hasVariant, isExternal, isHiddenFromGui, isSpellcheckOnlyLanguage, isVariant, setChunker, setDisambiguator, setPostDisambiguationChunker, setSentenceTokenizer, setSynthesizer, setTagger, setWordTokenizer, toAdvancedTypography, toString
-
Field Details
-
cache
-
FALSE_FRIENDS_PATTERN
-
id2prio
-
-
Constructor Details
-
English
Deprecated.useAmericanEnglishorBritishEnglishetc. instead - they have rules for spell checking, this class doesn't (deprecated since 3.2)
-
-
Method Details
-
getDefaultLanguageVariant
Description copied from class:LanguageLanguages that have country variants need to overwrite this to select their most common variant.- Overrides:
getDefaultLanguageVariantin classLanguage- Returns:
- default country variant
-
createDefaultSentenceTokenizer
Description copied from class:LanguageCreates language specific sentence tokenizer. This function will be called each time inLanguage.getSentenceTokenizer()if sentence tokenizer is not set.- Overrides:
createDefaultSentenceTokenizerin classLanguage
-
getName
Description copied from class:LanguageGet this language's name in English, e.g.EnglishorGerman (Germany). -
getShortCode
Description copied from class:LanguageGet this language's character code, e.g.enfor English. For most languages this is a two-letter code according to ISO 639-1, but for those languages that don't have a two-letter code, a three-letter code according to ISO 639-2 is returned. The country parameter (e.g. "US"), if any, is not returned.- Specified by:
getShortCodein classLanguage
-
getCountries
Description copied from class:LanguageGet this language's country options , e.g.US(as inen-US) orPL(as inpl-PL).- Specified by:
getCountriesin classLanguage- Returns:
- String[] - array of country options for the language.
-
createDefaultTagger
Description copied from class:LanguageCreates language specific part-of-speech tagger. The tagger must not benull, but it can be a trivial pseudo-tagger that only assignsnulltags. This function will be called each time inLanguage.getTagger()()} if tagger is not set.- Overrides:
createDefaultTaggerin classLanguage
-
createDefaultChunker
Description copied from class:LanguageCreates language specific chunker. This function will be called each time inLanguage.getChunker()if chunker is not set.- Overrides:
createDefaultChunkerin classLanguage
-
createDefaultSynthesizer
Description copied from class:LanguageCreates language specific part-of-speech synthesizer. This function will be called each time inLanguage.getSynthesizer()if synthesizer is not set.- Overrides:
createDefaultSynthesizerin classLanguage
-
createDefaultDisambiguator
Description copied from class:LanguageCreates language specific disambiguator. This function will be called each time inLanguage.getDisambiguator()if disambiguator is not set.- Overrides:
createDefaultDisambiguatorin classLanguage
-
createDefaultWordTokenizer
Description copied from class:LanguageCreates language specific word tokenizer. This function will be called each time inLanguage.getWordTokenizer()if word tokenizer is not set.- Overrides:
createDefaultWordTokenizerin classLanguage
-
getMaintainers
Description copied from class:LanguageGet the name(s) of the maintainer(s) for this language ornull.- Specified by:
getMaintainersin classLanguage
-
getMaintainedState
Description copied from class:LanguageInformation about whether the support for this language in LanguageTool is actively maintained. If not, the user interface might show a warning.- Overrides:
getMaintainedStatein classLanguage
-
getRelevantRules
public List<Rule> getRelevantRules(ResourceBundle messages, UserConfig userConfig, Language motherTongue, List<Language> altLanguages) throws IOException Description copied from class:LanguageGet the rules classes that should run for texts in this language.- Specified by:
getRelevantRulesin classLanguage- Throws:
IOException
-
getRelevantLanguageModelRules
public List<Rule> getRelevantLanguageModelRules(ResourceBundle messages, LanguageModel languageModel, UserConfig userConfig) throws IOException Description copied from class:LanguageGet a list of rules that require aLanguageModel. Returns an empty list for languages that don't have such rules.- Overrides:
getRelevantLanguageModelRulesin classLanguage- Throws:
IOException
-
getRelevantLanguageModelCapableRules
public List<Rule> getRelevantLanguageModelCapableRules(ResourceBundle messages, @Nullable LanguageModel lm, GlobalConfig globalConfig, UserConfig userConfig, Language motherTongue, List<Language> altLanguages) throws IOException Description copied from class:LanguageGet a list of rules that can optionally use aLanguageModel. Returns an empty list for languages that don't have such rules.- Overrides:
getRelevantLanguageModelCapableRulesin classLanguage- Parameters:
lm- null if no language model is available- Throws:
IOException
-
hasNGramFalseFriendRule
Description copied from class:LanguageReturn true if language has ngram-based false friend rule returned byLanguage.getRelevantLanguageModelCapableRules(java.util.ResourceBundle, org.languagetool.languagemodel.LanguageModel, org.languagetool.GlobalConfig, org.languagetool.UserConfig, org.languagetool.Language, java.util.List<org.languagetool.Language>).- Overrides:
hasNGramFalseFriendRulein classLanguage
-
getOpeningDoubleQuote
- Overrides:
getOpeningDoubleQuotein classLanguage- Since:
- 5.1
-
getClosingDoubleQuote
- Overrides:
getClosingDoubleQuotein classLanguage- Since:
- 5.1
-
getOpeningSingleQuote
- Overrides:
getOpeningSingleQuotein classLanguage- Since:
- 5.1
-
getClosingSingleQuote
- Overrides:
getClosingSingleQuotein classLanguage- Since:
- 5.1
-
isAdvancedTypographyEnabled
public boolean isAdvancedTypographyEnabled()- Overrides:
isAdvancedTypographyEnabledin classLanguage- Since:
- 5.1
-
getDefaultRulePriorityForStyle
protected int getDefaultRulePriorityForStyle()- Overrides:
getDefaultRulePriorityForStylein classLanguage
-
getPriorityMap
- Overrides:
getPriorityMapin classLanguage
-
getPriorityForId
Description copied from class:LanguageReturns a priority for Rule or Category Id (default: 0). Positive integers have higher priority. Negative integers have lower priority.- Overrides:
getPriorityForIdin classLanguage
-
getRemoteEnhancedRules
public Function<Rule,Rule> getRemoteEnhancedRules(ResourceBundle messageBundle, List<RemoteRuleConfig> configs, UserConfig userConfig, Language motherTongue, List<Language> altLanguages, boolean inputLogging) throws IOException Description copied from class:LanguageFor rules whose results are extended using some remote service, e.g.BERTSuggestionRanking- Overrides:
getRemoteEnhancedRulesin classLanguage- Returns:
- function that transforms old rule into remote-enhanced rule
- Throws:
IOException
-
hasMinMatchesRules
public boolean hasMinMatchesRules()- Overrides:
hasMinMatchesRulesin classLanguage
-
createDefaultSpellingRule
- Overrides:
createDefaultSpellingRulein classLanguage- Throws:
IOException
-
filterRuleMatches
public List<RuleMatch> filterRuleMatches(List<RuleMatch> ruleMatches, AnnotatedText text, Set<String> enabledRules) Description copied from class:LanguageThis function is called by JLanguageTool before CleanOverlappingFilter removes overlapping ruleMatches- Overrides:
filterRuleMatchesin classLanguage- Returns:
- filtered ruleMatches
-
prepareLineForSpeller
- Overrides:
prepareLineForSpellerin classLanguage
-
getMultitokenSpeller
- Overrides:
getMultitokenSpellerin classLanguage
-
AmericanEnglishorBritishEnglishetc.