Skip to main content
Skip table of contents

SDK specifics for Voice Synthesis

SDK

VSDK-CSDK

Language count

65

Voice count

201

Creation of custom voice possible1

Yes (in studio)

Emotion simulation2

Yes (on request)

Emotion presets3

No

Voice quality choice4

neural-tts-arm, neural-tts-x86, embedded-compact, embedded-pro, embedded-high, embedded-premium, premium-high

Not all voices are available in every quality.

Language list

afb-APG, arb-001, ben-IN, bho-IN-JH, bul-BG, cat-ES, cat-ES-VC, ces-CZ, cmn-CN, cmn-CND, cmn-TW, dan-DK, deu-DE, ell-GR, eng-AU, eng-GB, eng-GB-SCT, eng-IE, eng-IN, eng-US, eng-ZA, eus-ES, fas-APG, fin-FI, fra-BE, fra-CA, fra-FR, glg-ES-GA, heb-IL, hin-IN, hrv-HR, hun-HU, ind-ID, ita-IT, jpn-JP, kan-IN-KA, kor-KR, mar-IN, msa-MY, nld-BE, nld-NL, nor-NO, pol-PL, por-BR, por-PT, ron-RO, rus-RU, slk-SK, slv-SL, spa-AR, spa-CL, spa-CO, spa-ES, spa-MX, swe-SE, tam-IN-TN, tel-IN, tha-TH, tur-TR, ukr-UA, vie-VN, yue-HK, zho-CN-SC, zho-CN-SH, zho-CN-SN

Voice size

Embedded-compact

800KiB ↦ 30 MiB

Embedded-pro

3.4MiB ↦ 114 MiB

Embedded-high

27MiB ↦ 316MiB

Embedded-premium

40MiB ↦ 527 MiB

Premium-high

191MiB ↦ 266 MiB

Neural-tts-x86

81MiB ↦ 124 MiB

Neural-tts-arm

81MiB ↦ 124 MiB

SDK code size

WINDOWS - X86_64 | 25 MB
LINUX - X86_64 | 9 MB
ARMV7HF | 7 MB
ARMV8 | 9 MB
ANDROID 6.0 (API 23) | 55 MB

Platform supported

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23)

Hardware supported

MPU


  1. Creation of custom voice possible: You can modify the voice using Ssml markups which change the pitch, the rate, the timbre, the volume, …

  2. Emotion simulation: You can change the voice style using the Ssml markups i.e lively, neutral, formal, conversational, apologetic, didactic, … You can check this page for more details about the supported styles of each voice.

  3. Emotion presets: You can play recorded emotion audio by writing the name of the record in your voice synthesis. You can check vsdk-baratinoo voices features for more details about the supported emotion presets by voice.

  4. Neural voices limitation: Neural voices doesn’t support bookmark marker, textunit, word, … markers, change of rate, pitch, volume and timbre.

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.