Class PortugueseTagger

java.lang.Object
org.languagetool.tagging.BaseTagger
org.languagetool.tagging.pt.PortugueseTagger
All Implemented Interfaces:
Tagger

public class PortugueseTagger extends BaseTagger
Portuguese Part-of-speech tagger. Based on English tagger.
  • Field Details

    • ADJ_PART_FS

      private static final Pattern ADJ_PART_FS
    • VERB

      private static final Pattern VERB
    • PREFIXES_FOR_VERBS

      private static final Pattern PREFIXES_FOR_VERBS
    • ORDINAL_SUFFIX_MASC

      private static final String ORDINAL_SUFFIX_MASC
      See Also:
    • ORDINAL_SUFFIX_FEM

      private static final String ORDINAL_SUFFIX_FEM
      See Also:
    • ORDINAL_SUFFIX_PL

      private static final String ORDINAL_SUFFIX_PL
      See Also:
    • ORDINAL_SUFFIXES

      private static final String ORDINAL_SUFFIXES
    • ORDINAL_PATTERN

      private static final Pattern ORDINAL_PATTERN
    • ORDINAL_MASC_SG

      private static final Pattern ORDINAL_MASC_SG
    • ORDINAL_FEM_SG

      private static final Pattern ORDINAL_FEM_SG
    • ORDINAL_MASC_PL

      private static final Pattern ORDINAL_MASC_PL
    • ORDINAL_FEM_PL

      private static final Pattern ORDINAL_FEM_PL
    • PERCENT_PATTERN

      private static final Pattern PERCENT_PATTERN
    • DEGREE_PATTERN

      private static final Pattern DEGREE_PATTERN
  • Constructor Details

    • PortugueseTagger

      public PortugueseTagger()
  • Method Details

    • tagNumberExpressions

      private List<AnalyzedToken> tagNumberExpressions(String word)
    • buildMascPlNoun

      private AnalyzedToken buildMascPlNoun(String word)
    • buildOrdinalTokens

      private List<AnalyzedToken> buildOrdinalTokens(String word)
    • isOrdinal

      private boolean isOrdinal(String word)
    • isPercent

      private boolean isPercent(String word)
    • isDegree

      private boolean isDegree(String word)
    • tag

      public List<AnalyzedTokenReadings> tag(List<String> sentenceTokens)
      Description copied from interface: Tagger
      Returns a list of AnalyzedTokens that assigns each term in the sentence some kind of part-of-speech information (not necessarily just one tag).

      Note that this method takes exactly one sentence. Its implementation may implement special cases for the first word of a sentence, which is usually written with an uppercase letter.

      Specified by:
      tag in interface Tagger
      Overrides:
      tag in class BaseTagger
      Parameters:
      sentenceTokens - the text as returned by a WordTokenizer
    • tagMenteAdverbs

      private List<AnalyzedToken> tagMenteAdverbs(String word, String lowerWord, morfologik.stemming.IStemmer dictLookup)
    • tagPrefixedVerbs

      private List<AnalyzedToken> tagPrefixedVerbs(String word, morfologik.stemming.IStemmer dictLookup)
    • addTokens

      private void addTokens(List<AnalyzedToken> taggedTokens, List<AnalyzedToken> l)