Get desktop application:
View/edit binary Protocol Buffers messages
Synthesizing text into speech.
The name of the model. Specifies basic synthesis functionality. Currently should be empty. Do not use it.
Text to synthesis, one of text synthesis markups.
Raw text (e.g. "Hello, Alice").
Optional hints for synthesis.
Optional. Default: 22050 Hz, linear 16-bit signed little-endian PCM, with WAV header
Part of synthesized audio.
Used in:
Sequence of bytes of the synthesized audio in format specified in output_audio_spec.
Used in:
The audio format specified in request parameters.
The audio format specified inside the container metadata.
Used in:
Used in:
Audio bit depth 16-bit signed little-endian (Linear PCM).
Data is encoded using the OPUS audio codec and compressed using the OGG container format.
Data is encoded using MPEG-1/2 Layer III and compressed using the MP3 container format.
Used in:
The hint for TTS engine to specify synthesised audio characteristics.
ID of speaker to use.
Hint to change speech rate.
Hint to specify pronunciation character for the speaker.
Used in:
Encoding type.
Sampling frequency of the signal.
Used in:
Audio bit depth 16-bit signed little-endian (Linear PCM).