Introduction
Overview
Vivoka is a platform that combines different providers of a voice technologies in a one product, which makes testing and integration seamless.
Technologies
Speech Recognition — transcribes audio to text.
Wake up Word — listens for a phrase that is meant to activate a device when spoken (e.g. ‘Ok, Google!’).
Speech Synthesis — synthesize speech from text.
Voice Biometrics — lets you authorize or identify person based on their voice.
Speech Enhancement — helps you handle noise in your audio (usually used together with speech recognition).
NLU — helps you parse natural language text to commands.
What do we support?
Technologies work offline — our solutions work on embedded low-resource devices with no access to the Internet.
Supported languages — we support up to 41 languages for speech-to-text and up to 65 languages for text-to-speech.
Platforms — Linux amd/arm, Android, Windows.
Our products
Right now we have 4 main products.
To help you with configuration and testing we have:
VDK-Studio — allows you to configure and test technologies, and export them for integration.
Console — a user space where you can manage your projects, request access and download the necessary resources.
To help you with integration we have:
VSDK — low-level library (available for C++ and Android).
VDK-Service — high-level
VSDKwrapper (rest API server).
How does it work?
Let’s walk through the whole journey together.
1. Exploration
In the beginning of your project, when you already know what voice technologies you need, you will have to search for a right technology provider.
VDK-Studio simplify this search process for you. Their primary goal is to save your time and effort. This application provides an interface that lets you configure technologies and try them without writing any code.
Later you will be able to export your project ready for deployment.
2. Development
Chosen technologies might still require configuration, which you can do directly in VDK-Studio. During this phase our “How to guides” and responsive support will be here to accompany you.
3. Integration
Now you are ready to export your project and start deployment. Here comes VDK-Service or it’s low-level alternative VSDK. They provide a layer to help you connect exported project with your code base.
Our big advantage is the fact that regardless of a provider or a technology — you will stay within the same ecosystem.
Imagine how complicated the process would be without this layer. Let’s say you found a speech recognition tech from one provider and speech synthesis from another. It probably means that they will have different support, documentation and integration process. This path is risky and will cost you much more time!