Get desktop application:
View/edit binary Protocol Buffers messages
Next available ID: 23
Filename for FST-based pair LM.
Whether to invert the pairs, i.e., input symbols are output in pairs.
Whether pair LM is encoded as a transducer.
Filename for pre-computed transliteration pairs. Expects a TSV file with 3 columns: input string, output string and cost (negative log probability).
Pre-computed translit_cands_file items override PairLM rather than mix.
Mixture weight between pair LM candidates and pre-computed pairs.
Filename for language model for full-string transliteration. Expects an FST encoded model.
Whether language model requires closure (e.g., word-level wordpiece model).
Symbol in language model FST corresponding to OOV symbols.
Additional cost accrued by OOV symbols.
Symbol in the language model FST corresponding to epsilon.
The maximum number of candidate transliterations per word.
Threshold on scores for word candidates, in terms of the difference of log probability with the highest probability candidate.
A minimum posterior probability for any transliteration candidate at a word-position, as part of final pruning in addition to max_word_cands.
Internal word-piece prefix. If non-empty, will assume language model is constructed of word-pieces.
Allows turning off caching to prevent memory leak when running as a service.
Maximum number of tokens to transliterate in parallel using different cores.
Whether to apply language model at the word (rather than sentence) level.
Filename of word-piece model for on-the-fly word-piece segmentation.
String prefix indicating word-initial wordpieces. Uses default if not set.
Whether to sample from k-best rather than returning all of them.
Random seed for sampling; defaults to std::random_device if not set. This is mainly provided for testing purposes, best to leave unset.