Skip to main content
Skip table of contents

Audio Asset Management

Some features in VDK-Studio Web require audio files to function properly. We’ve designed a seamless workflow to make file management easy and efficient:

image-20251124-153224.png

Audio asset manager preview

Use the dedicated upload modal to add your audio files directly. Files are automatically converted to a standardized format and sample rate, ensuring full compatibility with our technology.

The built-in Audio Asset Manager gives you complete control:

  • Upload new audio files in seconds

  • Rename files for better organization

  • Delete unnecessary files to keep your workspace clean

  • Transcribe audio files and access the text version instantly

How to manage my Audio Assets ?

Hub 5.png

Empty Audio Asset library

The first time you open your project’s Audio Asset library, you can either start recording (1) audio from scratch or import audio files (2) directly from your computer. You can also import a ZIP file with audio files and their transcriptions using the Bulk Import / Export transcribed files button (3).

(1) How to record my voice ?

How to record my voice ?

Hit start recording to capture your voice live. Recordings last up to 10 seconds and stop automatically. Audio files record at 16 kHz sample rate in WAV format.

d1214c3d-e0d9-4597-ac0e-ed40d6769ae0.png

Let it end or click Stop Record

When your audio asset library isn’t empty, you’ll still see the record button next to the import file button.

image-20251127-091223.png

The small version

(2) How to import files ?

Introduction to the file format

We recommend uploading lossless audio files for the best recognition by our technologies. Lossy files work too, but note that performance may drop.

Each file has a size and duration limit: 20MB and 10 seconds max. You can upload as many files as you want—no limit there, aside from decency. We limit duration to 10 seconds because voice recognition works best with short sentences, like commands or wake words. This limit suits any unit tests you want to try.

Please note you can upload audio files with at least a 16 kHz sample rate. Any file not in WAV 16 kHz will quickly be converted automatically to this format to work with our technologies.

How to import files ?

Hub 8.png

The upload files modal

Click browse files or drag your audio files onto the cloud icon. Please ensure your files match the detailed table's expectations, or the server will reject them.

Hub 82.png

Files added before sending to the server

You can add or remove files (2) until you decide it’s ready for upload (1).

Hub 83.png

Our file was refused because it last over 10 seconds

Each successfully uploaded file gets a green checkmark. Each failed upload shows a red warning icon you can mouse over to see why the server rejected it.

(3) I want to import transcribed audio files

The bulk import follows all constraints outlined in the How to import files? section.

Hub 51.png

Download ZIP file

To understand the ZIP format, you can Download the Template (1a) zip file or download the project’s previously uploaded (or recorded) files using the Download Current Assets button (1b).

Modify the extracted ZIP by adding your audio files and editing the assets.csv reference file. This file contains audio_path and audio_transcription properties.

CSV Structure

The CSV file must contain two required columns (in any order):

  • audio_path: Path or filename of the audio file

  • audio_transcription: Transcription text for the audio

Example CSV (assets.csv):

CSV
audio_path,audio_transcription
ready.wav,ready
start_picking.wav,start picking

With extra columns (ignored):

CSV
category,audio_path,audio_transcription,duration,notes
sounds,ready.wav,ready,0.5s,example sound
sounds,start_picking.wav,start picking,0.8s,another example

Path Transformation

When using paths in the CSV, they are transformed to filenames:

  • Forward slashes / are replaced with underscores _

  • Backslashes \\ are replaced with underscores _

Example:

CSV
audio_path,audio_transcription
user1/audio.wav,Hello world

Becomes:

  • user1_audio.wav in the VDK Studio

Click the next button (2) when your zip is ready to be uploaded.

Upload Zip file

image-20260210-093544.png

Drop your ZIP file

Import Behavior

When you import this ZIP (or a modified version):

  1. CSV is parsed with flexible column ordering

  2. New files are imported as new assets

  3. Existing files (same name AND size) are skipped to enable continuous synchronization

  4. Modified files (same name but different size) are updated

  5. Invalid files are skipped with detailed error messages

image-20260210-093637.png

You will get this error if no assets.csv is found

image-20260210-095717.png

Import Results

You can make consecutive uploads and see what has been wrong, successfull or ignored during the process.

image-20260210-095906.png

Finally, our files are imported with their corresponding transcriptions !

Transcriptions are mandatory for Unit Testing. Please take a look at our dedicated documentation.

Hub 7.png

A detailled overview of the Audio

  • You can find the Record Audio button (1) and the Import File button (2) in the page's header toolbar.

  • Each Audio Asset file can be played back (3) or deleted permanently (4)

  • Files can be renamed (5) and transcribed (6). Transcription is required for the Unit Testing features.

  • You can select multiple files using the dedicated checkboxes (7) and delete the selected files permanently (8).

  • A search bar (9) is available to help you find paginated (10) Audio Assets that are not immediatly visible.

  • The total number of Audio Assets (11) in your project is displayed in the secondary menu.

You will find our Audio Asset Management in several places within the VDK-Studio Web. For example, whenusing Unit Tests or the Speech Enhancement widget, you may be asked to attach an audio file to test the feature. The Audio Asset Manager will appear either directly on the page or within a dedicated modal.

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.