SDK

VSDK-CSDK

VSDK-BARATINOO

VSDK-VTAPI

Language count

65

8

30

Voice count

181

15

85

Creation of custom voice possible1

Yes (in studio)

Yes (in studio)

Yes (in studio)

Emotion simulation2

Yes (on request)

No

No

Emotion presets3

No

Yes

No

Voice quality choice

embedded-compact, embedded-pro, embedded-high, embedded-premium, premium-high

Not all voices are available in every quality.

default

D22, P22

Not all voices are available in every quality.

Not all qualities are available for every OS.

Language list

afb-APG, arb-001, ben-IN, bho-IN-JH, bul-BG, cat-ES, cat-ES-VC, ces-CZ, cmn-CN, cmn-CND, cmn-TW, dan-DK, deu-DE, ell-GR, eng-AU, eng-GB, eng-GB-SCT, eng-IE, eng-IN, eng-US, eng-ZA, eus-ES, fas-APG, fin-FI, fra-BE, fra-CA, fra-FR, glg-ES-GA, heb-IL, hin-IN, hrv-HR, hun-HU, ind-ID, ita-IT, jpn-JP, kan-IN-KA, kor-KR, mar-IN, msa-MY, nld-BE, nld-NL, nor-NO, pol-PL, por-BR, por-PT, ron-RO, rus-RU, slk-SK, slv-SL, spa-AR, spa-CL, spa-CO, spa-ES, spa-MX, swe-SE, tam-IN-TN, tel-IN, tha-TH, tur-TR, ukr-UA, vie-VN, yue-HK, zho-CN-SC, zho-CN-SH, zho-CN-SN

arb-MA, deu-DE, eng-GB, eng-US, fra-FR, ita-IT, nld-NL, spa-ES

arb-001, ces-CZ, cmn-CN, cmn-TW, deu-DE, eng-AU, eng-GB, eng-IN, eng-US, fra-CA, fra-FR, hin-IN, hun-HU, ind-ID, ita-IT, jpn-JP, kor-KR, nor-NO, pol-PL, por-BR, por-PT, ron-RO, rus-RU, slk-SK, spa-AR, spa-ES, spa-MX, swe-SE, tha-TH, yue-CN

Voice size

Embedded-compact

800KiB ↦ 30 MiB

Embedded-pro

3.4MiB ↦ 114 MiB

Embedded-high

27MiB ↦ 316MiB

Embedded-premium

40MiB ↦ 527 MiB

Premium-high

191MiB ↦ 266 MiB

default

50MiB ↦ 300MiB

D22

4MiB ↦ 36MB

P22

126MiB ↦ 450MiB

SDK code size

~50 MiB

~25 MiB

~5 MiB

Platform supported

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23)

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23)

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23)

Hardware supported

MPU

MPU

MPU


  1. Creation of custom voice possible: You can modify the voice using Ssml markups which change the pitch, the rate, the timbre, the volume, …

  2. Emotion simulation: You can change the voice style using the Ssml markups i.e lively, neutral, formal, conversational, apologetic, didactic, … You can check this page for more details about the supported styles of each voice.

  3. Emotion presets: You can play recorded emotion audio by writing the name of the record in your voice synthesis. You can check vsdk-baratinoo voices features for more details about the supported emotion presets by voice.