Class EsperantoWordTokenizer

java.lang.Object
org.languagetool.tokenizers.WordTokenizer
org.languagetool.tokenizers.eo.EsperantoWordTokenizer
All Implemented Interfaces:
Tokenizer

public class EsperantoWordTokenizer extends WordTokenizer
  • Field Details

    • PATTERN_1

      private static final Pattern PATTERN_1
    • PATTERN_2

      private static final Pattern PATTERN_2
  • Constructor Details

    • EsperantoWordTokenizer

      public EsperantoWordTokenizer()
  • Method Details

    • tokenize

      public List<String> tokenize(String text)
      Tokenizes just like WordTokenizer with the exception that words such as "dank'" contain an apostrophe within it.
      Specified by:
      tokenize in interface Tokenizer
      Overrides:
      tokenize in class WordTokenizer
      Parameters:
      text - - Text to tokenize
      Returns:
      List of tokens. Note: a special string EO@APOS is used to replace apostrophe during tokenizing.