All Classes and Interfaces
Class
Description
A dialog with version and copyright information.
A rule that checks case in phrases
Checks that compounds (if in the list) are not written as separate words.
Another use of the compounds file -- check for compounds written with
dashes instead of hyphens (for example, Rabka — Zdrój).
Accepts rule matches if a date doesn't match the accompanying weekday, e.g.
Accepts rule matches if a date doesn't match the accompanying weekday, e.g.
Abstract Disambiguator class to provide default (empty) implementation
for
Disambiguator.preDisambiguate(AnalyzedSentence).A rule that gives hints about the use of filler words.
Checks if a given date is in the future
Used for mismatch detection between verb tense and a given date
Accepts rule matches if we are in the first days of a new year and the user
may have entered a date with the old year (but not a date in December).
An Abstract Pattern Rule that describes a pattern of words or part-of-speech tags
used for PatternRule and DisambiguationPatternRule.
A rule that matches "..", "::", "-," but not "...", "!..", "?!!", ",-" etc.
Adds simple replacement using lemmas
A rule that matches words which should not be used and suggests
correct ones instead.
A rule that matches words which should not be used and suggests correct ones instead.
An abstract rule that checks if there is a missing space before some conjunctions.
A rule that matches words which need a specific upper/lowercase spelling.
A rule that gives hints when a defined condition is fulfilled
and the percentage of hints in the text exceeds the given limit.
A rule that gives hints when a defined condition is fulfilled
and the percentage of hints in the text exceeds the given limit.
An abstract rule checks the appearance of same words in a sentence or in two consecutive sentences.
The method gives stylistic hints that a word is being used too often
when the set percentage has been exceeded.(default off).
A base class for
PatternToken-based rules.Represents possible values of a
PatternToken's lemma or text.Base class providing support for detecting, parsing and converting between measurements in different units
A rule that matches words for which two different spellings are used
throughout the document.
Rule for detecting same words in the sentence but not just in a row.
Filter that maps suggestion from adverb to adjective.
Simple agreement checker for German noun phrases.
Simple agreement checker for German noun phrases.
Create suggestions for German noun phrases that lack agreement.
A rule that matches words or phrases which should not be used and suggests
correct ones instead.
One reading of a German word.
A sentence that has been tokenized and analyzed.
A word (or punctuation, or whitespace) and its analysis (part-of-speech tag, lemma)
An array of
AnalyzedTokens used to store multiple POS tags and lemmas
for a given single token.A text with markup and with a mapping so error positions will refer to the original
position that includes the markup, even though only the plain text parts are checked.
Use this builder to create input of text with markup for LanguageTool, so that it
can check only the plain text parts and ignore the markup, yet still calculate the
positions of errors so that they refer to the complete text, including markup.
An internal marker for places in the source where the API would need to be cleaned up.
Handle requests to
/v2/ of the HTTP API.Support for Arabic.
Filter that maps suggestion from adverb to adjective.
A rule that matches words which should not be used and suggests correct ones instead.
Arabic localization of
AbstractDateCheckFilter.A rule that matches words which are complex and suggests easier to understand alternatives.
Date filter that expects a 'date' argument in the format 'dd-mm-yyyy'.
A rule that matches words which are homophones and suggests easier to understand alternatives.
Hybrid chunker for Arabic.
Filter that maps suggestion from adverb to adjective.
Filter that maps suggestion for numeric phrases.
A rule that matches redundant expression.
Tools for working with arabic strings.
Arabic word form synthesizer.
Filter that maps suggestion from adverb to adjective.
Arabic version of
AbstractWordCoherencyRuleA rule that matches wordy expressions.
Word repeat rule for َArabic, to avoid false alarms in the generic word repetition rule.
Takes correct sentences, introduces errors (e.g.
Deprecated.
this language is unmaintained in LT and might be removed in a future release if we cannot find contributors for it (deprecated since 3.6)
Asturian part-of-speech tagger.
Exception thrown if user is not authenticated.
Data for
AvsAnRule.Check if the determiner (if any) preceding a word is:
an if the next word starts with a vowel
a if the next word does not start with a vowel
This rule loads some exceptions from external files
det_a.txt and
det_an.txt (e.g.Exception thrown if the request is malformed.
The algorithm of a language model, independent of the way data
is stored (see sub classes for that).
Base tagger using Morfologik binary dictionaries.
Deprecated.
this language is unmaintained in LT and might be removed in a future release if we cannot find contributors for it (deprecated since 3.6)
A rule that matches words which need a specific upper/lowercase spelling.
Specific to Belarusian: apostrophes (', ’, ʼ) are part of the
word.
German / English translator.
A stub to allow clients to do synchronous rpc calls to service BertLm.
A stub to allow clients to do ListenableFuture-style rpc calls to service BertLm.
Base class for the server implementation of the service BertLm.
A stub to allow clients to do asynchronous rpc calls to service BertLm.
Protobuf type
bert.BatchBertLmResponseProtobuf type
bert.BatchBertLmResponseProtobuf type
bert.BatchScoreRequestProtobuf type
bert.BatchScoreRequestProtobuf type
bert.BertLmResponseProtobuf type
bert.BertLmResponseProtobuf type
bert.MaskProtobuf type
bert.MaskProtobuf type
bert.PredictionProtobuf type
bert.PredictionProtobuf type
bert.ScoreRequestProtobuf type
bert.ScoreRequestreorder suggestions from another rule using BERT as a LM
A bitext pattern rule class.
Loads
PatternRules from an XML file.Interface for classes that implement reading from bitext files,
such as translation memory files, glossary files, aligned text...
Abstract bitext rule class.
A rule that matches words which should not be used and suggests correct ones instead.
Checks that compounds (if in the list) are not written as separate words.
Breton Tagger.
A rule that matches words or phrases which should not be used and suggests
correct ones instead.
Helper to load text files from classpath.
Check that adjectives and verbs are not written with an uppercase
first letter (except at the start of a sentence) and cases
like this:
Das laufen fällt mir leicht. (laufen needs
to be uppercased).Antipatterns for
CaseRule.A rule that matches words or phrases which should not be used and suggests
correct ones instead.
Hybrid chunker-disambiguator for Catalan
Catalan word form synthesizer.
Catalan Tagger
Adds Catalan suggestions to
WordRepeatBeginningRule.Avoid false alarms in the word repetition rule.
Tokenizes a sentence into words.
Pre-defined rule categories.
A rule's category.
Identifier for a category.
Category ids.
Configuration for checking a text with
RemoteLanguageTool.Builder for a
CheckConfiguration.Assign chunks (i.e.
The name of a chunk.
Keep a central registry of circuit breakers so that we can expose metrics for all of them
Is responsible for loading the necessary classes for LanguageTool
library.
Tags a word using two taggers, combining their results.
A rule that matches periods, commas and closing parenthesis preceded by whitespace and
opening parenthesis followed by whitespace.
Information about common words - use as a fallback if language identification yields low confidence.
Accept Dutch compounds that are not accepted by the speller.
A spell checker that combines Hunspell and Morfologik spell checking
to support compound words and offer fast suggestions for some misspelled
compound words.
Check consistent spelling for German compounds.
Helps to write down compound word tagging results so we can analyze the algorithm effectiveness
Checks the compound spelling of infinitive clause (Erweiterter Infinitiv mit zu)
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Checks that compounds (if in the list) are not written as separate words.
Data about words that are compounds and should thus not be written
as separate words.
Allows to tag compound words with hyphen dynamically by analyzing each part
Interface for components that take compound words and split
them into their parts.
Configuration like list of disabled rule IDs, server mode etc.
Dialog that offers the available rules so they can be turned on/off
individually.
Two words that can easily be confused - for internal use only.
Load data for
.
invalid reference
AccentuationCheckRule
Load data for
.
invalid reference
AccentuationCheckRule
Load data for
.
invalid reference
AccentuationCheckRule
LanguageTool's homophone confusion check that uses ngram lookups
to decide which word in a confusion set (from
confusion_sets.txt) suits best.Words that can easily be confused - for internal use only.
Loads a confusion set from a plain text file (UTF-8).
A string in a
ConfusionPair - for internal use only.A rule that gives Hints about the use of German filler words.
Check for consistent use of apostrophes (typographic (’) vs.
Helper class to mark errors in text.
A rule that matches words or phrases which should not be used and suggests
correct ones instead, e.g.
A person that contributed rules or code to LanguageTool.
Constants for contributors who contribute to more than one language (use to avoid duplication).
A text, typically a sentence, that doesn't contain an error,
at least not one for a specific rule.
Find correction loops, i.e.
Support for CrimeanTatar
Crimean Tatar word form synthesizer
CrimeanTatar Part-of-speech tagger.
Deprecated.
this language is unmaintained in LT and might be removed in a future release if we cannot find contributors for it (deprecated since 3.6)
Danish Part-of-speech tagger.
Prüft, dass in Bindestrich-Komposita kein Leerzeichen eingefügt wird (wie z.B.
Check for compounds written with dashes instead of hyphens (for example, Rabka — Zdrój).
Encapsulate database access.
Encapsulate database access.
Breton localization of
AbstractDateCheckFilter.Catalan localization of
AbstractDateCheckWithSuggestionsFilter.German localization of
AbstractDateCheckFilter.English localization of
AbstractDateCheckFilter.Esperanto localization of
AbstractDateCheckFilter.Spanish localization of
AbstractDateCheckFilter.French localization of
AbstractDateCheckFilter.Italian localization of
AbstractDateCheckFilter.Dutch localization of
AbstractDateCheckWithSuggestionsFilter.Polish localization of
AbstractDateCheckFilter.Russian localization of
AbstractDateCheckFilter.Ukrainian localization of
AbstractDateCheckFilter.Checks whether the date range is valid, i.e., that the starting
date happens before the end date.
A rule that matches words in the dative plural and suggests
the common plural instead.
The filter is used to create suggestions for the DATA_DEKADY rule.
Identify the language of a text.
Responsible for getting any items from LanguageTool's resource
directories.
Trivial disambiguator.
A simple demo rule as an example for how to implement your own Java-based
rule in LanguageTool.
A trivial tagger that does nothing than assign null
tags to words.
Data for
DhaNoBeirtRule.An item from a user's dictionary, fetched from a database.
Filter spelling error with phrases the users wants to have accepted.
Checks if the translation has a really different length than the source
(smaller than 30% or longer by 250%).
Checks if the translation has the same ending punctuation as the source.
Disambiguated example.
A Rule that describes a pattern of words or part-of-speech tags used for
disambiguation.
Possible disambiguator actions.
Loads
DisambiguationPatternRules from a disambiguation rules XML
file.Disambiguator interface.
Date filter that expects a 'date' argument in the format 'dd-mm-yyyy'.
A rule that matches ".." (but not "..." etc) and ",,".
Hybrid chunker-disambiguator for Dutch
Dutch word form synthesizer.
Dutch tagger.
Coherent use of Du/du, Dich/dich etc.
A rule that checks for empty lines.
Support for English - use the sub classes
BritishEnglish, AmericanEnglish,
etc.OpenNLP-based chunker.
Our chunker detects noun phrases but not whether they are singular
or plural noun phrases.
A rule that matches words or phrases which should not be used and suggests
correct ones instead, e.g.
Check for compounds written with dashes instead of hyphens.
A rule that matches words which require specific diacritics (e.g,
a la instead of à la).False friends for Dutch native speakers who write English text, based on ngrams.
False friends for French native speakers who write English text, based on ngrams.
False friends for German native speakers who write English text, based on ngrams.
False friends for non-native speakers who write English text, based on ngrams.
False friends for Spanish native speakers who write English text, based on ngrams.
A rule that matches words which should not be used and suggests
correct ones instead.
Hybrid chunker-disambiguator for English.
A
PartialPosTagFilter for English that also runs the disambiguator.A rule that matches words which are complex and suggests easier to understand alternatives.
A rule that matches redundant expression.
A rule that matches words which need a specific upper/lowercase spelling.
English word form synthesizer.
English Part-of-speech tagger.
Adds a list English adverbs to
WordRepeatBeginningRule.Word repeat rule for English, to avoid false alarms in the generic word repetition rule.
Limit the maximum number of request per IP address for a given time range.
A text, typically a sentence, that doesn't contain an error,
but still leads to an error.
A part-of-speech tagger for Esperanto.
Helper class to create error examples.
An experimental feature that may change without warning in future versions.
Used via Jackson-databind + myBatis -> return info from DB via JSON in /users/me route
Loads extra helper dictionaries in plain text format
A pattern rule for finding false friends.
Loads
PatternRules from a false friends XML file.Loads the false friend rules as bitext pattern rules.
Hybrid chunker-disambiguator for French
A
PartialPosTagFilter for French that also runs the disambiguator.French word form synthesizer.
French Tagger
Based on Dicollecte (http://www.dicollecte.org/) implemented in FSA.
Tokenizes a sentence into words.
A rule that matches words which should not be used and suggests correct ones instead.
Hybrid chunker-disambiguator for Portuguese
A rule that matches redundant expression.
Galician word form synthesizer.
Galician Part-of-speech tagger.
A rule that matches common Wikipedia errors.
A rule that matches wordy expressions.
Tokenizes a sentence into words.
Rule that finds unpaired quotes, brackets etc.
Rule that finds unpaired quotes
Support for German - use the sub classes
GermanyGerman, SwissGerman, or AustrianGerman
if you need spell checking.A rule-based German chunker for noun phrases.
Checks that compounds are not written as separate words.
Split German nouns using the jWordSplitter library.
Double punctuation rule with German-specific error message.
A rule that gives Hints about the use of German filler words.
Helper methods for working with German POS tags.
Check if to paragraphs begin with the same word.
A rule that checks the readability of German text (using the Flesch-Reading-Ease Formula)
If tooEasyTest == true, the rule tests if paragraph level > level (readability is too easy)
If tooEasyTest == false, the rule tests if paragraph level < level (readability is too difficult)
A rule checks the appearance of same words in a sentence or in two consecutive sentences.
German word form synthesizer.
German part-of-speech tagger, requires data file in
de/german.dict in the classpath.Constants used to describe the properties of German tokens.
Extends
WordRepeatBeginningRule by a list of German adverbs ("Konjunktionaladverbien")
for a more precise error message.Check if a word is repeated twice, taking into account an exception
for German where e.g.
Tokenize sentences to tokens like Google does for its ngram index.
A token as tokenized in the Google ngram index.
Queries a local Grammalecte server.
A rule that matches redundant expressions.
A rule that matches words which need a specific upper/lowercase spelling.
Adds a list Greek adverbs to
WordRepeatBeginningRule.This class is a scanner generated by
JFlex 1.4.3
on 6/12/19 12:42 AM from the specification file
/home/pminos/NetBeansProjects/workspace-languagetool/languagetool-parent/languagetool-language-modules/el/src/main/jflex/GreekWordTokenizerImpl.jflexBase class fur rules running on external servers;
see gRPC service definition in languagetool-core/src/main/proto/ml_server.proto
See #create(Language, ResourceBundle, RemoteRuleConfig, boolean, String, String, Map) for an easy way to add rules; return rule in Language::getRelevantRemoteRules
add it like this:
Internal rule to create rule matches with IDs based on Match Sub-IDs
A rule that finds hidden characters in the text
A small embedded HTTP server that checks text.
A small embedded HTTPS server that checks text.
Like
HunspellRule, but does not offer suggestions for incorrect words
as that is very slow with Hunspell.A hunspell-based spellchecking-rule.
XML Filter that ignores whitespace-only nodes.
A text, typically a pair of sentences that contains an error.
A text, typically a sentence, that contains an error.
Inflects English words according to a German POS tag.
Checks if INN number is incorrect
For internal use only.
Specific to
KOMMA_ZWISCHEN_HAUPT_UND_NEBENSATZ - helps setting the comma suggestion, if easily possible.CharSequence that noticed thread interrupts -- as might be necessary
to recover from a loose regex on unexpected challenging input.
A rule that matches words which should not be used and suggests
correct ones instead.
Hybrid chunker-disambiguator for Irish
A
PartialPosTagFilter for Irish that also runs the disambiguator.A rule that matches words which should not be used and suggests
correct ones instead.
A rule that matches words which need a specific upper/lowercase spelling.
Irish word form synthesizer.
Irish POS tagger.
Italian word form synthesizer.
Italian tagger.
Avoid false alarms in the word repetition rule.
Some constants for Localization Quality Issue Type from the
Internationalization Tag Set (ITS) Version 2.0.
The main class used for checking text against different rules:
built-in Java rules (for English: a vs.
Callback for checking if result of
JLanguageTool.check(String) is still needed.Constants for correct paragraph-rule handling.
JNA helper tools.
A rule that matches words which should not be used and suggests correct ones instead.
A Khmer rule that checks if there is a missing space before some conjunctions.
Khmer Part-of-speech tagger.
Check if a word is repeated in Khmer, e.g.
Tokenizes a sentence into words.
Base class for any supported language (English, German, etc).
Detect language per token and add that to the analysis.
Create a language by specifying the language's XML rule file.
Information about whether a language in LanguageTool is actively maintained.
A language model that provides information about ngram probabilities.
Helper methods to list all supported languages and to get language objects
by their name or language code etc.
Helps to tag numeric adjectives with letter ending
Expand lines according to their suffix, e.g.
Dummy class for UserConfig
has to be overridden by concrete linguistic service e.g.
A rule that matches words which should not be used and suggests
correct ones instead.
A rule that warns on long paragraphs.
A rule that warns on long sentences.
A rule that warns on long sentences.
ThreadPoolExecutor with some stopping logic for OOM and metrics tracking
Like
LuceneSingleIndexLanguageModel, but can merge the results of
lookups in several independent indexes to one result.Information about ngram occurrences, taken from Lucene indexes (one index per ngram level).
A synthesizer that reads the inflected form and POS information from a plain (UTF-8) text file.
A tagger that reads the POS information from a plain (UTF-8) text file.
A fake markup data object giving information on the impact of the fake markup.
A
Match is the configuration of an algorithm used to match AnalyzedTokenReadingss.Possible string case conversions.
The state of a matching process.
A rule checks a sentence for a missing comma before or after a relative clause (only for German language).
Checks that a sentence contains at least one verb.
A rule that matches words Latin and Cyrillic characters in them
A stub to allow clients to do synchronous rpc calls to service MLServer.
A stub to allow clients to do ListenableFuture-style rpc calls to service MLServer.
Base class for the server implementation of the service MLServer.
A stub to allow clients to do asynchronous rpc calls to service MLServer.
Protobuf type
lt_ml_server.AnalyzedMatchRequestProtobuf type
lt_ml_server.AnalyzedMatchRequestProtobuf type
lt_ml_server.AnalyzedSentenceProtobuf type
lt_ml_server.AnalyzedSentenceProtobuf type
lt_ml_server.AnalyzedTokenProtobuf type
lt_ml_server.AnalyzedTokenProtobuf type
lt_ml_server.AnalyzedTokenReadingsProtobuf type
lt_ml_server.AnalyzedTokenReadings
NOTE: should this be split this up into different packages?
NOTE: should this be split this up into different packages?
Protobuf type
lt_ml_server.AnalyzeResponseProtobuf type
lt_ml_server.AnalyzeResponseProtobuf type
lt_ml_server.MatchProtobuf type
lt_ml_server.MatchProtobuf enum
lt_ml_server.Match.MatchTypeProtobuf type
lt_ml_server.MatchListProtobuf type
lt_ml_server.MatchListProtobuf type
lt_ml_server.MatchRequestProtobuf type
lt_ml_server.MatchRequestProtobuf type
lt_ml_server.MatchResponseProtobuf type
lt_ml_server.MatchResponseProtobuf type
lt_ml_server.PostProcessingRequestProtobuf type
lt_ml_server.PostProcessingRequestProtobuf type
lt_ml_server.ProcessingOptionsProtobuf type
lt_ml_server.ProcessingOptionsProtobuf enum
lt_ml_server.ProcessingOptions.LevelProtobuf type
lt_ml_server.ProcessRequestProtobuf type
lt_ml_server.ProcessRequestProtobuf type
lt_ml_server.ProcessResponseProtobuf type
lt_ml_server.ProcessResponseProtobuf type
lt_ml_server.RuleProtobuf type
lt_ml_server.RuleProtobuf enum
lt_ml_server.Rule.TagProtobuf type
lt_ml_server.RuleCategoryProtobuf type
lt_ml_server.RuleCategoryProtobuf type
lt_ml_server.SuggestedReplacementProtobuf type
lt_ml_server.SuggestedReplacementProtobuf enum
lt_ml_server.SuggestedReplacement.SuggestionTypeDeprecated.
since 4.4, use GermanSpellerRule
Morfologik speller that merges results from binary (.dict) and plain text
(.txt) dictionaries.
An experimental rule for spell check (only yo variant)(set default off).
Morfologik-based spell checker.
Tags a word using a Morfologik binary dictionary.
A map containing multiple values per key, memory-optimized for case when there's only one value.
A very simple property-style configuration.
Combines the results of several
LanguageModels.Check if there is duplicated whitespace in a sentence.
A variant of
JLanguageTool that uses several threads for rule matching.Multiword tagger-chunker.
Multiword tagger-chunker.
A rule that matches words or phrases which should not be used and suggests
correct ones instead.
LanguageTool's probability check that uses ngram lookups
to decide if an ngram of the input text is so rare in our
ngram index that it should be considered an error.
A
PartialPosTagFilter for English that does not run the disambiguator.A
PartialPosTagFilter for Irish that does not run the disambiguator.A
PartialPosTagFilter for Portuguese that does not run the disambiguator.A
PartialPosTagFilter for Russian that does not run the disambiguator.A rule that gives Hints about the use of non-significant verbs.
No-op chunker.
A language that is part of languagetool-core but that hasn't any rules.
Rule for checking correct spell of ordinal numerals.
Finds spellings that were only correct in the pre-reform orthography.
Fixes the suggestion for ordinal suffixes, like "1nd" -> "1st".
Check if to paragraphs begin with the same word.
Filters rule matches so that only matches are kept where a part of the
token has a given POS tag.
A rule that gives Hints about the use of German filler words.
Authenticator that extracts username and password from URL, e.g.
Thrown when URL path or rule requested etc cannot be found.
A Rule that describes a language error as a simple pattern of words or of
part-of-speech tags.
A
PatternRule's id with an optional sub-id.Loads
PatternRules from an XML file.Matches a pattern rule against text.
Allows transformation of pattern rules to add additional matching/filtering logic
Used in
JLanguageTool.transformPatternRules(java.util.List<org.languagetool.rules.patterns.AbstractPatternRule>, org.languagetool.Language)
Return a value if a transform is applied, none otherwiseMakes XML definition of rules accessible as strings.
A part of a pattern, represents the 'token' element of the
grammar.xml.Fields that are null in most instances of PatternToken
Helper to build
PatternTokens.A rule that matches words which should not be used and suggests
correct ones instead.
Support for Persian.
A rule that matches periods, commas and closing parenthesis preceded by whitespace and
opening parenthesis followed by whitespace.
A rule that matches ".." (but not "..." etc) and "،،".
A Persian rule that checks if there is a missing space before some conjunctions.
List of Persian adverbs for WordRepeatBeginningRule
Word repeat rule for English, to avoid false alarms in the generic word repetition rule.
Wrapper for JLanguageTool instances that can be made immutable.
Caches pre-configured JLanguageTool instances to avoid costly setup time of rules, etc.
Hybrid chunker-disambiguator for Polish.
Polish word form synthesizer.
Polish POS tagger based on FSA morphological dictionaries.
A rule that matches words which should not be used and suggests correct ones instead.
Post-spelling-reform Portuguese.
This rule checks if a word without graphical accent and with a verb POS tag
should be a noun or an adjective with graphical accent.
Load data for
PortugueseAccentuationCheckRule.A rule that matches words or phrases which should not be used and suggests
correct ones instead.
A rule that matches words which should not be used and suggests correct ones instead.
A rule that matches words which should not be used and suggests correct ones instead.
A rule that matches words which should not be used and suggests correct ones instead.
Checks that compounds (if in the list) are not written as separate words.
A rule that matches words which require specific diacritics (e.g,
a la instead of à la).A rule that gives hints on the use of Portuguese filler words.
Hybrid chunker-disambiguator for Portuguese.
Rule for simple and frequent one-to-one spelling fixes in Portuguese.
A rule that checks the readability of Portuguese text (using the Flesch-Reading-Ease Formula)
If tooEasyTest == true, the rule tests if paragraph level > level (readability is too easy)
If tooEasyTest == false, the rule tests if paragraph level < level (readability is too difficult)
A rule that matches redundant expression.
A rule that matches words or phrases which should not be used and suggests
correct ones instead.
Portuguese word form synthesizer
Portuguese Part-of-speech tagger.
Localized from the German version by Tiago F.
A rule that matches known empty expressions.
A rule that matches common Wikipedia errors.
Portuguese version of
AbstractWordCoherencyRule.A rule that matches wordy expressions.
Adds a list Portuguese adverbs to
WordRepeatBeginningRule.Palavras que se repetem no Poruguês.
Tokenizes a sentence into words.
This rule checks if an adjective doesn't agree with the previous noun and at
the same time it doesn't agree with any of the previous words.
This rule checks if an adjective doesn't agree with the previous noun and at
the same time it doesn't agree with any of the previous words.
This rule checks if an adjective doesn't agree with the previous noun and at
the same time it doesn't agree with any of the previous words.
for e.g.
for e.g.
A stub to allow clients to do synchronous rpc calls to service PostProcessingServer.
A stub to allow clients to do ListenableFuture-style rpc calls to service PostProcessingServer.
Base class for the server implementation of the service PostProcessingServer.
A stub to allow clients to do asynchronous rpc calls to service PostProcessingServer.
Checks that compounds (if in the list) are not written as separate words.
Check for compounds written with dashes instead of hyphens.
Less to more common words loaded from CSV.
Finds words for which a better variant exists.
Information about premium-only rules.
Note: this is tuned for AgreementSuggestor, be careful when using it for other use cases.
Checks that compounds (if in the list) are not written as separate words.
Check for compounds written with dashes instead of hyphens.
A rule that matches words which should not be used and suggests
correct ones instead.
Probability, but doesn't check for a range of 0 to 1.
A stub to allow clients to do synchronous rpc calls to service ProcessingServer.
A stub to allow clients to do ListenableFuture-style rpc calls to service ProcessingServer.
Base class for the server implementation of the service ProcessingServer.
A stub to allow clients to do asynchronous rpc calls to service ProcessingServer.
Find compounds that might be morphologically correct but are still probably wrong, like 'Lehrzeile'.
Protobuf type
lt_ml_server.CachedResultMatchProtobuf type
lt_ml_server.CachedResultMatchProtobuf enum
lt_ml_server.CachedResultMatch.MatchTypeProtobuf type
lt_ml_server.CachedRuleProtobuf type
lt_ml_server.CachedRuleProtobuf enum
lt_ml_server.CachedRule.TagProtobuf type
lt_ml_server.MatchPositionProtobuf type
lt_ml_server.MatchPositionProtobuf type
lt_ml_server.NewLanguageMatchProtobuf type
lt_ml_server.NewLanguageMatchA rule that matches "..", "::", "-," but not "...", "!..", "?!!", ",-" etc
TODO: spaces seem to be special, extract from regexp?
A rule that checks for a punctuation mark at the end of a paragraph.
A rule that checks for a punctuation mark at the end of a paragraph.
A rule that checks there's a '¿' character if the sentence ends with '?',
same for '¡' and '!'.
A rule that matches spaces before ?,:,; and ! (required for correct French
punctuation).
A rule that matches spaces before ?,:,; and ! (required for correct French
punctuation).
A range in a text with its (potential/guessed) language.
A rule that checks the readability of English text (using the Flesch-Reading-Ease Formula)
If tooEasyTest == true, the rule tests if paragraph level > level (readability is too easy)
If tooEasyTest == false, the rule tests if paragraph level < level (readability is too difficult)
Keep only those matches whose 'year' value is last year or in
recent years (up to
maxYearsBack years ago).A rule that detects redundant modal and auxiliary verbs.
Filters matches that match a regex.
Matches 'regexp' elements from XML rules against sentences.
Filter rule matches after a RegexPatternRule has matched already.
Create a
RegexRuleFilter.Evaluates a
RegexRuleFilter for RegexPatternRules.Check a text using a remote LanguageTool server via HTTP or HTTPS.
Check a text using a remote LanguageTool server via HTTP or HTTPS.
The result of checking text on a remote LanguageTool server.
rules in remote-rule-filter.xml, same syntax as pattern rules
used as filters for results from matches provided by remote rule
i.e.
A potential error as returned by the HTTP API of LanguageTool.
A potential error as returned by the HTTP API of LanguageTool.
Information about the remote server as returned by the server's XML response.
A rule that suggests better names for technical operation names
Loads the relevant words from
/ca/replace_operationnames.txt.Limit the maximum number of request per IP address for a given time range.
Message bundle helper class used for translation of the user interface.
A resource bundle that uses its fallback resource bundle if the
value from the original bundle is null or empty.
Is responsible for getting the necessary resources for the LanguageTool
library.
A cache to speed up text checking for use cases where sentences are checked more than once.
Extend results by adding rules matches as hidden matches.
Romanian word form synthesizer.
Romanian Part-of-speech tagger
WordRepeatBeginningRule implementation for Romanian language.Tokenizes a sentence into words.
Abstract rule class.
Resolves an XML file's external entity URIs as relative paths.
Thrown if external rule filename doesn't match the required format.
Filter rule matches after a PatternRule has matched already.
Create a
RuleFilter.Evaluates a
RuleFilter.Information about an error rule that matches text and the position of the match.
Unlike
Category, this is specific to a RuleMatch, not to a rule.Write rule matches and some meta information as JSON.
Filter rule matches.
Class to configure rule option by option panel
An object holding a set of rules with an optional possibility to fetch only the ones applicable for a given sentence
(via
RuleSet.rulesForSentence(org.languagetool.AnalyzedSentence)), to avoid even invoking the definitely inapplicable ones.Filter rule matches so that only the longest match is kept from overlapping
matches with the same rule that contains tokens with maxOccurrence > 1.
A rule-based prototype Russian chunker.
Checks that compounds (if in the list) are not written as separate words.
Check for compounds written with dashes instead of hyphens.
A rule that gives Hints about the use of Russian filler words.
Hybrid chunker-disambiguator for Russian.
A
PartialPosTagFilter for Russian that also runs the disambiguator.A rule that matches words or phrases which should not be used and suggests
correct ones instead.
Avoid false alarms in the word repetition rule.
A rule that matches words which need a specific upper/lowercase spelling.
Russian word form synthesizer.
Part-of-speech tagger.
Russian version of
AbstractWordCoherencyRule.WordRootRepeatRule.
Filter rule matches so that only the first match is kept from overlapping
matches with the same rule group (actually: the same id).
Checks if the translation for segments that have more than two words
is different.
Interface for JPanel that can persist its state.
Words that can easily be confused - for internal use only.
A range in a text that makes up a sentence.
Tokenizes text into sentences.
Checks that there's whitespace between sentences etc.
Checks that there's whitespace between sentences.
A rule that gives Hints about the use of German filler words.
A rule that gives Hints about the use of German filler words.
Super class for HTTP and HTTPS server.
Provide short (up to ~ 40 characters) descriptions for words.
Checks whether a shortened year range (such as '1998-92') is valid, i.e., that the starting
date happens before the end date.
Search for names similar to ones used before in the same text, as these
might be typos.
Simple German (see e.g.
For internal use only.
Suggestions for replacing adverbs -ment
Loads the relevant word forms from
rules/ca/replace_adverbs_ment.txt.A rule that matches words which should not be used and suggests correct ones
instead.
A rule that matches words which should not be used and suggests
correct ones instead.
Load replacement data from a UTF-8 file.
Dicritics with IEC rules.
A rule that matches lemmas found only in DNV (AVL dictionary) and suggests
alternative words.
A rule that matches lemmas found only in DNV (AVL dictionary) and suggests
alternative words.
A rule that matches lemmas found only in DNV (AVL dictionary) and suggests
alternative words.
A rule that matches words which should not be used and suggests correct ones
instead.
A rule that matches words which should not be used.
A rule that matches proper names that has been renamed
Loads the relevant words from
rules/uk/replace_renamed.txt.A rule that matches words or phrases which should not be used and suggests
correct ones instead.
A rule that matches words which should not be used and suggests correct ones
instead.
A rule that matches words which should not be used and suggests
correct ones instead.
A rule that matches words which should not be used and suggests
correct ones instead.
A rule that matches words which should not be used and suggests correct ones
instead.
A rule that matches words which should not be used and suggests
correct ones instead.
A rule that matches words or phrases which should not be used and suggests
correct ones instead.
A rule that matches words or phrases which should not be used and suggests
correct ones instead.
A rule that matches words or phrases which should not be used and suggests
correct ones instead.
A rule that matches words which should not be used and suggests correct ones instead.
A rule that matches words which should not be used and suggests correct ones
instead.
A rule that matches words for which better alternatives exist and suggests them instead.
A rule that matches words that are written by 1992 spelling rules and suggests 2019 spelling instead.
A rule that matches words that are written by 2019 spelling (not approved in 1992 version)
A rule that matches incorrect verbs (including all inflected forms) and
suggests correct ones instead.
A rule that matches incorrect verbs (including all inflected forms) and
suggests correct ones instead.
A very simple sentence tokenizer that splits on
[.!?…] followed by whitespace
or an uppercase letter.Slovak word form synthesizer.
Slovak Part-of-speech tagger based on Russian Part-of-speech tagger.
Deprecated.
this language is unmaintained in LT and might be removed in a future release if we cannot find contributors for it (deprecated since 3.6)
A rule that matches words which should not be used and suggests
correct ones instead.
Hybrid chunker-disambiguator for Spanish
Spanish word form synthesizer.
Spanish Tagger.
A rule that matches common Wikipedia errors.
Adds Spanish suggestions to
WordRepeatBeginningRule.Avoid false alarms in the word repetition rule.
Tokenizes a sentence into words.
An abstract rule for spellchecking rules.
Old to new spelling data and similar formats loaded from CSV.
Class to tokenize sentences using rules from an SRX file.
Tools for loading an SRX tokenizer file.
An object encapsulating a text pattern and the way it's matched (case-sensitivity / regular expression),
plus some optimizations over standard regular expression matching.
A convenience class to work with bitext strings.
Tools for working with strings.
Constants for printing XML rule matches.
A rule checks sentences beginning repeatedly with a subject.
A rule checks the use of very short sentences repeatedly.
A rule that gives Hints about too often used Adjectives
A rule that gives Hints about too often used Adjectives
A rule that gives Hints about too often used Nouns
A rule that gives Hints about too often used Nouns
A rule that gives Hints about too often used Verbs
A rule that gives Hints about too often used Verbs
Check subject verb agreement for verb forms "ist", "sind", "war" and "waren".
A list of strings that are expected to occur in a larger string in the given order,
with optional requirements for them to start/end the text.
classify the type of the suggestion
so that downstream tasks (e.g.
Filter suggestions against a rule.
Helper class for SuggestionChangesTest, tracks experiment configuration and results
Rules should use getInstance() != null -> getInstance().getCurrentExperiment() in constructors to fetch relevant parameters
Use isRunningExperiment if no parameters are needed
Computes features that can be stored for training a model to rank suggestions,
or when using the same model for prediction
Used directly by SuggestionChangesTest for the former,
and as a superclass for the latter
Implementing classes must provide confidence values with the SuggestedReplacement objects returned by orderSuggestions
Deprecated.
this language is unmaintained in LT and might be removed in a future release if we cannot find contributors for it (deprecated since 3.6)
Actively maintained since v6.2+
Hybrid chunker-disambiguator for Swedish.
Swedish word form synthesizer.
Swedish Part-of-speech tagger.
Helper class for
GenericUnpairedBracketsRule to identify
symbols indexed with integers.spell checking using SymSpell algorithm, implementation from
https://github.com/Lundez/JavaSymSpell in org.languagetool.rules.spelling.symspell.implementation
potential faster alternative to Morfologik, especially in generating suggestions
Part-of-speech synthesizer interface.
Protobuf type
lt_ml_server.SynthesizeRequestProtobuf type
lt_ml_server.SynthesizeRequestProtobuf type
lt_ml_server.SynthesizeRequestItemProtobuf type
lt_ml_server.SynthesizeRequestItemProtobuf type
lt_ml_server.SynthesizeResponseProtobuf type
lt_ml_server.SynthesizeResponseProtobuf type
lt_ml_server.SynthesizeResponseItemProtobuf type
lt_ml_server.SynthesizeResponseItemA stub to allow clients to do synchronous rpc calls to service SynthesizeServer.
A stub to allow clients to do ListenableFuture-style rpc calls to service SynthesizeServer.
Base class for the server implementation of the service SynthesizeServer.
A stub to allow clients to do asynchronous rpc calls to service SynthesizeServer.
Reader of simple tab-delimited bilingual files.
Deprecated.
this language is unmaintained in LT and might be removed in a future release if we cannot find contributors for it (deprecated since 3.6)
Filipino Part-of-speech tagger.
Result of
WordTagger.The part-of-speech tagger interface, whose implementations are usually language-dependent.
Tamil Part-of-speech tagger.
A rule that considers the complete text, not just one sentence after
the other.
A part of a text with markup, either plain text (to be checked by LanguageTool),
or markup (to be ignored by LanguageTool).
Exception thrown if the text to be checked is longer than the
configured maximum.
A rule that checks if adjective and following noun agree on gender and inflection
A rule that checks if noun and verb agree
A rule that checks if adjective and following noun agree on gender and inflection
A rule that checks if preposition and a noun agree on inflection etc
A rule that checks if noun and verb agree
Interface for classes that tokenize text into smaller units.
GUI-related tools.
Thrown if too many requests per time unit have been sent from one
ip address or if the aggregated size of all requests per ip per time unit
exceeds the limit.
A rule that matches place names in French which should be
translated in Breton.
Entry of a translation dictionary.
A rule that matches short dash inside words
Hybrid chunker-disambiguator for Ukrainian.
Ukrainian word form synthesizer
Ukrainian part-of-speech tagger.
Tokenizes a sentence into words.
Exception thrown if the server is overloaded, e.g.
Implements unification of features over tokens.
A rule that gives Hints about the use of German phrases.
Implements unsynchronized stack (contrary to default Java
Stack,
this one is based on ArrayList).Finds some(!) words written uppercase that should be spelled lowercase and vice versa.
Finds some(!) words written uppercase that should be spelled lowercase.
Accepts rule matches when the uppercased word of the
token parameter
has a noun reading, e.g.Checks that a sentence starts with an uppercase letter.
User-specific configuration.
An item from our users table, with some information about the user/limits/etc.
Checker for v2 of the API, which returns JSON.
A spell checker that's fast but not guaranteed to always agree with the
"real" spell checker used in LT.
To be used in LEERZEICHEN_NACH_KLAMMER.
Simple agreement checker for German verbs and subject.
List of German verb prefixes.
A rule that checks for WhiteSpaces at the begin of a paragraph
A rule that checks for a whitespace at the end of a paragraph
A rule that matches several punctuation signs such as
: ; and % preceded by whitespace.Check incorrect use of "spiegelt ...
Loads word variations words from a UTF-8 file.
Catalan version of
AbstractWordCoherencyRule.German version of
AbstractWordCoherencyRule.English version of
AbstractWordCoherencyRulePersian version of
AbstractWordCoherencyRule.Dutch version of
AbstractWordCoherencyRule.Polish version of
AbstractWordCoherencyRule.Swedish version of
AbstractWordCoherencyRule.Catalan version of
AbstractWordCoherencyRule.Reader of WordFast Translation Memory text files.
Check if three successive sentences begin with the same word, e.g.
Check if a word is repeated, e.g.
Find the potential part-of-speech tags (POS tags) for a word.
Tokenizes a sentence into words.
Check if there is a confusion of two words (which might have a similar spelling) depending on the context.
Rule-based disambiguator.
XML rule handler that loads rules from XML and throws
exceptions on errors and warnings.
Validate XML files with a given DTD or XML Schema (XSD).
Defines an input source based on the correct path to external entities.
XML handler that throws exception on error and warning, does nothing otherwise.
Defines an EntityResolver-like object (cf.
Date filter that expects a 'date' argument in the format 'yyyy-mm-dd'.
Date filter that expects a 'date' argument in the format 'yyyy-mm-dd'.
Date filter that expects a 'date' argument in the format 'yyyy-mm-dd'.
New year date filter that expects a 'date' argument in the format 'yyyy-mm-dd'.
New year date filter that expects a 'date' argument in the format 'yyyy-mm-dd'.
New year date filter that expects a 'date' argument in the format 'yyyy-mm-dd'.