Providers specifics for Voice Synthesis
Provider | Cerence | Voxygen | Readspeaker |
---|---|---|---|
Language count | 67 | 8 | 30 |
Voice count | |||
Creation of custom voice possible1 | Yes (in studio) | Yes (in studio) | Yes (in studio) |
Emotion simulation2 | Yes (on request) | No | No |
Emotion presets3 | No | Yes | No |
Voice quality choice |
Not all voices are available in every quality. |
|
Not all voices are available in every quality. Not all qualities are available for every OS. |
Language list |
|
|
|
SDK name | CSDK | Baratinoo | VTAPI |
Voice size |
800KiB ↦ 30 MiB
3.4MiB ↦ 114 MiB
27MiB ↦ 316MiB
40MiB ↦ 527 MiB
191MiB ↦ 266 MiB |
50MiB ↦ 300MiB |
4MiB ↦ 36MB
126MiB ↦ 450MiB |
SDK code size | ~50 MiB | ~25 MiB | ~5 MiB |
Platform supported | WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23) | WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23) | WINDOWS - X86_64 LINUX - X86_64 ARMV7HF ARMV8 ANDROID 6.0 (API 23) |
Hardware supported | MPU | MPU | MPU |
Creation of custom voice possible: You can modify the voice using Ssml markups which change the pitch, the rate, the timbre, the volume, …
Emotion simulation: You can change the voice style using the Ssml markups i.e lively, neutral, formal, conversational, apologetic, didactic, … You can check this page for more details about the supported styles of each voice.
Emotion presets: You can play recorded emotion audio by writing the name of the record in your voice synthesis. You can check Voxygen voices features for more details about the supported emotion presets by voice.