Skip to main content
Skip table of contents

SDK specifics

Below, you will find an overview of the different SDKs and their specificities regarding your uses of widgets.
Each SDK has different specificities and their uses will depend on what you are looking for in terms of technical characteristics but also according to the characteristics of your computer and your operating system.

Dictation

SDK

VSDK-VASR

Language count

2

Language list

eng-US fra-FR

Vocabulary Customization

YES

Resource size

~77 MB ↦ ~123 MB

SDK code size

WINDOWS - X86_64 | 17 MB
LINUX - X86_64 | 42 MB
ARMV7HF | 21 MB
ARMV8 | 33 MB
ANDROID 7.0 (API 24) | 27 MB

Platform supported

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 7.0 (API 24)

Hardware supported

MPU

Grammar Base Recognition

SDK

VSDK-VASR

VSDK-CSDK

VSDK-TNL

Language count

6

41

7

Language list

deu-DE eng-US fra-FR ita-IT spa-ES nld-NL

afb-APG bul-BG ces-CZ cmn-CN cmn-TW dan-DK deu-DE ell-GR eng-AU eng-CN eng-GB eng-IN eng-US fas-APG fin-FI fra-CA fra-FR heb-IL hin-IN hun-HU ind-ID ita-IT jpn-JP kor-KR msa-MY nld-NL nor-NO pol-PL por-BR por-PT rus-RU slk-SK spa-ES spa-MX swe-SE tha-TH tur-TR yue-CN yue-HK zho-CN-SC zho-CN-SH

cmn-CN deu-DE eng-GB eng-US fra-FR jpn-JP spa-ES

Dynamic data

Yes

Yes

Yes

Phonetic

Yes

Yes

No

Phonetic in dynamic data

Yes

Yes

No

Tag annotation

Yes

Yes

No

Intermediate results

Yes

Yes

Yes

Voice activity detection

Yes

Yes

Yes

Confidence score

Yes

Yes

No

Event detection

Yes

Yes

Yes

Model + resource size

~33 MB

~15 MB

~6 MB

SDK code size

WINDOWS - X86_64 | 17 MB
LINUX - X86_64 | 42 MB
ARMV7HF | 21 MB
ARMV8 | 33 MB
ANDROID 7.0 (API 24) | 27 MB

WINDOWS - X86_64 | 25 MB
LINUX - X86_64 | 9 MB
ARMV7HF | 7 MB
ARMV8 | 9 MB
ANDROID 6.0 (API 23) | 55 MB

WINDOWS - X86_64 | 7 MB
LINUX - X86_64 | 2.5 MB
ARMV7HF | 2 MB
ARMV8 | 2.5 MB
ANDROID 6.0 (API 23) | 4.2 MB

Platform supported

WINDOWS - X86_64 LINUX - X86_64 LINUX - ARMV7HFLINUX - ARMV8 ANDROID 7.0 (API 24)

WINDOWS - X86_64 LINUX - X86_64 LINUX - ARMV7HF LINUX - ARMV8 ANDROID 6.0 (API 23)

WINDOWS - X86_64 LINUX - X86_64 LINUX - ARMV7HF LINUX - ARMV8 ANDROID 6.0 (API 23)

Hardware supported

MPU

MPU

MPU

Voice Biometrics

Feature

VSDK-TSSV

VSDK-IDVOICE

Authentication from file

Yes

Yes

Authentication from streaming (microphone)

Yes

Yes

Identification from file

Yes

Yes

Identification from streaming (microphone)

Yes

No

Text dependent

Yes

Yes

Text independent

Yes

Yes

Resource size

< 1 MB

~230 MB

Voice template size

~50 kB / user

~5 kB / user

SDK code size

WINDOWS - X86_64 | 4 MB
LINUX - X86_64 | 1.8 MB
ARMV7HF | 1.4 MB
ARMV8 | 1.8 MB
ANDROID 6.0 (API 23) | 11 MB

WINDOWS - X86_64 | 3.9 MB
LINUX - X86_64 | 1.7 MB
ARMV8 | 1.7 MB
ANDROID 6.0 (API 23) | 12 MB

Platform supported

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23)

WINDOWS - X86_64 LINUX - X86_64 ARMV8 ANDROID 6.0 (API 23)

Hardware supported

MPU

MPU

Enrollment feature (per utterance)

VSDK-TSSV

VSDK-IDVOICE

Is accepted

Yes

Yes

SNR level

Yes

Yes

SNR is acceptable

Yes

Yes

Contains speech

Yes

Yes

Speech begin time

Yes

No

Speech end time

Yes

No

Speech duration

Yes

Yes

Speech duration is acceptable

Yes

Yes

Is peak clipped

Yes

No

Is band limited

Yes

No

Is consistent (relative to previous ones, text dependent only)

Yes

No

Is phrase verified (text dependent only)

Yes

No

Voice Synthesis

SDK

VSDK-CSDK

VSDK-BARATINOO

Language count

65

8

Voice count

181

15

Creation of custom voice possible1

Yes (in studio)

Yes (in studio)

Emotion simulation2

Yes (on request)

No

Emotion presets3

No

Yes

Voice quality choice

embedded-compact, embedded-pro, embedded-high, embedded-premium, premium-high

Not all voices are available in every quality.

default

Language list

afb-APG, arb-001, ben-IN, bho-IN-JH, bul-BG, cat-ES, cat-ES-VC, ces-CZ, cmn-CN, cmn-CND, cmn-TW, dan-DK, deu-DE, ell-GR, eng-AU, eng-GB, eng-GB-SCT, eng-IE, eng-IN, eng-US, eng-ZA, eus-ES, fas-APG, fin-FI, fra-BE, fra-CA, fra-FR, glg-ES-GA, heb-IL, hin-IN, hrv-HR, hun-HU, ind-ID, ita-IT, jpn-JP, kan-IN-KA, kor-KR, mar-IN, msa-MY, nld-BE, nld-NL, nor-NO, pol-PL, por-BR, por-PT, ron-RO, rus-RU, slk-SK, slv-SL, spa-AR, spa-CL, spa-CO, spa-ES, spa-MX, swe-SE, tam-IN-TN, tel-IN, tha-TH, tur-TR, ukr-UA, vie-VN, yue-HK, zho-CN-SC, zho-CN-SH, zho-CN-SN

arb-MA, deu-DE, eng-GB, eng-US, fra-FR, ita-IT, nld-NL, spa-ES

Voice size

Embedded-compact

800KiB ↦ 30 MiB

Embedded-pro

3.4MiB ↦ 114 MiB

Embedded-high

27MiB ↦ 316MiB

Embedded-premium

40MiB ↦ 527 MiB

Premium-high

191MiB ↦ 266 MiB

default

50MiB ↦ 300MiB

SDK code size

WINDOWS - X86_64 | 25 MB
LINUX - X86_64 | 9 MB
ARMV7HF | 7 MB
ARMV8 | 9 MB
ANDROID 6.0 (API 23) | 55 MB

WINDOWS - X86_64 | 2.6 MB
LINUX - X86_64 | 1.1 MB
ARMV7HF | 816 KB
ARMV8 | 1.1 MB
ANDROID 6.0 (API 23) | 17.7 MB

Platform supported

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23)

WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23)

Hardware supported

MPU

MPU


  1. Creation of custom voice possible: You can modify the voice using Ssml markups which change the pitch, the rate, the timbre, the volume, …

  2. Emotion simulation: You can change the voice style using the Ssml markups i.e lively, neutral, formal, conversational, apologetic, didactic, … You can check this page for more details about the supported styles of each voice.

  3. Emotion presets: You can play recorded emotion audio by writing the name of the record in your voice synthesis. You can check vsdk-baratinoo voices features for more details about the supported emotion presets by voice.

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.