Get desktop application:
View/edit binary Protocol Buffers messages
The "Mean Opinion Score - Listening Quality Objective" for the degraded signal, following the comparison to the reference signal. Produced using an SVR model.
Mean of FVNSIM
Mean of per patch mean frequency band similarity across all patches. The order of the elements is lowest to highest in frequency bands.
Mean of 10th percentile frequency band similarity across all patches.
Stddev of per patch mean frequency band similarity across all patches. The order of the elements is lowest to highest in frequency bands.
Mean/Average energy over patches for each frequency band in the degraded signal.
Lowest to highest center frequency bands:
The patch comparison results for each patch compared between the reference and degraded signals.
If ViSQOl was used at the command line to process a reference and degraded filepath pair for comparison, this will hold the reference filepath.
If ViSQOl was used at the command line to process a reference and degraded filepath pair for comparison, this will hold the degraded filepath.
If the degraded audio was additionally aligned, this will store the value applied, in seconds. If it is positive, then the degraded audio was later than the reference, if it is negative then the reference audio was later than the degraded. To align audio, apply the lag to the audio file that was later by prepending 0 or moving indices.
Contains info related to the similarity result for each patch.
Used in:
Similarity score for this patch.
Stored lowest to highest.
The time (in sec) where this patch starts in the reference signal.
The time (in sec) where this patch end in the reference signal.
The time (in sec) where this patch starts in the degraded signal.
The time (in sec) where this patch ends in the degraded signal.
Contains info related to the input signals.
Used in:
The sample rate of the input signals. Both signals must have the same sample rate. This value must be supplied.
Contains config info for ViSQOL
Used in:
Not yet supported.
The path to a svr model file. If not supplied, the default model is used.
If true, the input audio files will be compared using the ViSQOL speech mode. See use_unscaled_speech_mos_mapping below for details regarding the mapping of NSIM to MOS-LQO.
Not yet supported.
Currently, only 48k samples rates are supported by ViSQOL Audio - (Speech mode does not have this restriction). To override this and run with non-48k input, set this bool to true.
When using the speech comparison mode, a value of false for this bool will cause a perfect NSIM score of 1.0 to be mapped to a perfect MOS-LQO of 5.0. If this bool is instead set to true, a perfect NSIM score will instead be mapped to a MOS-LQO of ~4.x.
If provided, this value will be used to determine how far the comparison algorithm will search to find the most optimal match for a given reference frame.
If true, use a deep lattice model to map similarity to MOS. This generally provides more accurate results than the default polynomial or SVR. This is recommended unless comparing to historic conformance scores. The binary default for this is `true`.