OpenGrm Advanced Usage Below are a variety of topics covered in greater depth or of more specialized interest than found in the Quick Tour. Reading the Quick...
NGramApply Description This utility intersects an n gram model with each FST in an input FST archive. It produces an output FST archive that contains each FST after...
NGramCount Description This utility counts n grams from an input FST archive. This produces a count FST with the same topology as the eventual normalized model,...
OpenGrm NGram COPYING Licensed under the Apache License, Version 2.0 (the `License`); you may not use these files except in compliance with the License. You may obtain...
NGramInfo Description The command line utility ngraminfo prints various information about an n gram model obtained from the NGramModel class and the underlying...
OpenGrm NGram Library Version 1.3.16 is now available for download. NGram is now available on conda forge. This allows Linux (x86) and Mac OS X users who already...
NGramMake Description This operation produces a smoothed, normalized language model from input n gram count FST. It smooths the model in one of six ways: witten...
NGramMarginal Description (Available in versions 1.1.0 and higher.) This operation re estimates smoothed n gram models by imposing marginalization constraints...
NGramMerge Description This operation merges two n gram language models or two n gram count FSTs. The operation provides options for weighting the two input FSTs...
NGram Model Format The following gives the encoding of all n gram models produced by the utilities here, including those with unnormalized counts, as a cyclic weighted...
NGramPerplexity Description Command line utility to calculate the perplexity of a corpus given a model. Verbose mode gives the per word contribution to the perplexity...
NGramPrint Description By default, only n grams are printed (without backoff epsilon transitions), in the same format as discussed above for reading in n gram...
OpenGrm NGram Library Quick Tour This tour is organized around the stages of n gram model creation, modification and use: corpus I/O ( ngramsymbols , farcompilestrings...
NGramRandGen Description This operation randomly generates a set of successful sentences in the input FST and outputs them in finite state archive format. Backoff...
NGramRead Description It has flags for specifying the format of the text input, currently one of two options: By default, the text file is read as a sorted...
OpenGrm NGram README OpenGrm NGram Release 1.3 The OpenGrm NGram library is used for making and modifying n gram language models encoded as weighted finite state...
NGramShrink Description This operation shrinks or prunes an n gram language model in one of three ways: count pruning: prunes based on count cutoffs for...
NGramSymbols Description Command line utility to produce a symbol table from an input text corpus. Creates a symbol entry for every type in the corpus, as well as...
Thrax Release 0.1 (Alpha version.) Thrax Release 1.0 Removed dependency on ICU for UTF8 string parsing: with icu configuration flag no longer needed and...