Class MultiWordChunker

java.lang.Object
org.languagetool.tagging.disambiguation.AbstractDisambiguator
org.languagetool.tagging.disambiguation.MultiWordChunker
All Implemented Interfaces:
Disambiguator

public class MultiWordChunker extends AbstractDisambiguator
Multiword tagger-chunker.
Author:
Marcin MiƂkowski
  • Constructor Details

    • MultiWordChunker

      public MultiWordChunker(String filename)
      Parameters:
      filename - file text with multiwords and tags
    • MultiWordChunker

      public MultiWordChunker(String filename, boolean allowFirstCapitalized)
      Parameters:
      filename - file text with multiwords and tags
      allowFirstCapitalized - if set to true, first word of the multiword can be capitalized
  • Method Details

    • disambiguate

      public final AnalyzedSentence disambiguate(AnalyzedSentence input)
      Implements multiword POS tags, e.g., <ELLIPSIS> for ellipsis (...) start, and </ELLIPSIS> for ellipsis end.
      Parameters:
      input - The tokens to be chunked.
      Returns:
      AnalyzedSentence with additional markers.