Class MixedAlphabetsRule

java.lang.Object
org.languagetool.rules.Rule
org.languagetool.rules.uk.MixedAlphabetsRule

public class MixedAlphabetsRule extends Rule
A rule that matches words Latin and Cyrillic characters in them
  • Field Details

    • LIKELY_LATIN_NUMBER

      private static final Pattern LIKELY_LATIN_NUMBER
    • LATIN_NUMBER_WITH_CYRILLICS

      private static final Pattern LATIN_NUMBER_WITH_CYRILLICS
    • MIXED_ALPHABETS

      private static final Pattern MIXED_ALPHABETS
    • CYRILLIC_ONLY

      private static final Pattern CYRILLIC_ONLY
    • LATIN_ONLY

      private static final Pattern LATIN_ONLY
    • COMMON_CYR_LETTERS

      private static final Pattern COMMON_CYR_LETTERS
    • CYRILLIC_FIRST_LETTER

      private static final Pattern CYRILLIC_FIRST_LETTER
    • toLatMap

      private static final Map<Character,Character> toLatMap
    • toCyrMap

      private static final Map<Character,Character> toCyrMap
    • cyrChars

      private static final String cyrChars
      See Also:
    • latChars

      private static final String latChars
      See Also:
    • umlauts

      private static final String[] umlauts
    • umlautsReplace

      private static final String[] umlautsReplace
  • Constructor Details

  • Method Details

    • getId

      public final String getId()
      Description copied from class: Rule
      A string used to identify the rule in e.g. configuration files. This string is supposed to be unique and to stay the same in all upcoming versions of LanguageTool. It's supposed to contain only the characters A-Z and the underscore.
      Specified by:
      getId in class Rule
    • getDescription

      public String getDescription()
      Description copied from class: Rule
      A short description of the error this rule can detect, usually in the language of the text that is checked.
      Specified by:
      getDescription in class Rule
    • getShort

      private String getShort()
    • match

      public final RuleMatch[] match(AnalyzedSentence sentence)
      Description copied from class: Rule
      Check whether the given sentence matches this error rule, i.e. whether it contains the error detected by this rule. Note that the order in which this method is called is not always guaranteed, i.e. the sentence order in the text may be different from the order in which you get the sentences (this may be the case when LanguageTool is used as a LibreOffice/OpenOffice add-on, for example). In other words, implementations must be stateless, so that a previous call to this method has no influence on later calls.
      Specified by:
      match in class Rule
      Parameters:
      sentence - a pre-analyzed sentence
      Returns:
      an array of RuleMatch objects
    • likelyBadLatinI

      private boolean likelyBadLatinI(AnalyzedTokenReadings[] tokens, int i)
    • adjustForInvalidSuffix

      private String adjustForInvalidSuffix(String tokenString)
    • adjustForInvalidSuffix

      private String adjustForInvalidSuffix(String tokenString, String msg)
    • toLatinLeftOnly

      private String toLatinLeftOnly(String tokenString)
    • createRuleMatch

      private RuleMatch createRuleMatch(AnalyzedTokenReadings readings, List<String> replacements, String msg, AnalyzedSentence sentence)
    • toCyrillic

      private static String toCyrillic(String word)
    • toLatin

      private static String toLatin(String word)