C8C.AI
Voice & Data Collection · 2026
Audio Sample Reel

The kinds of conversational data we deliver.

Each clip below is roughly two minutes of consented, naturally-spoken audio captured for AI training. Together they cover studio capture, remote multispeaker sessions, domain dialogue, code-switching, and accent variation. Below the samples is a snapshot of our active locale coverage. Press play to listen — more samples are available on request.

Samples

11 categories
01 / Studio

Studio · Two-Speaker Conversation (Colombian Spanish)

es-CO studio channel-separated

Spontaneous two-speaker Colombian Spanish recorded in studio. Stereo, channel-separated (one speaker per channel), 16-bit / 44.1 kHz — the format diarization and ASR pipelines expect. studio capture · stereo · channel-separated · 16/44.1

02 / Audition Examples (×2)

Voice Talent Auditions — Line Read + Contextual Read

to add EN + ES

Two audition clips showing how we vet voice talent before booking — a scripted line-read plus a contextual read. Pulled from C8C's PII-redacted auditions library covering New York, Bogotá, and Medellín pools. candidate pool: auditions-portal

Andrew to select 2 clips
03 / Single Speaker

Single-Speaker Capture · Studio or Remote

to add

A solo-speaker reference — one talent, headset-quality remote capture or booth, demonstrating the per-speaker signal we deliver in every multi-speaker session as channel-separated tracks. format: WAV · headset or booth

Andrew to select clip
04 / Scottish Conversation

Scottish English · Two-Speaker Conversation

en-AB to add

Native Scottish English between two speakers — strong regional accent, naturally paced, the kind of accent diversity off-the-shelf datasets rarely cover. remote session · two-track mix

Andrew to select clip
05 / Group Spanish

Multispeaker Spanish · Live Group Session (≈2 min)

es-LATAM to add

Three-or-more speaker live Spanish session — natural overlap, back-channelling, and interruption. Each speaker captured on a separate track; mixed for preview here, channel-separated source on request. 3+ speakers · per-speaker tracks · remote real-time capture

Andrew to select clip
06 / Code-Switching

Bilingual Code-Switching

to add bilingual

Natural mid-utterance code-switching from a bilingual speaker — how multilingual users actually talk, and the kind of input ASR/NLU systems must handle if they're going to ship to global markets. format: WAV · anonymized

Andrew to select clip
07 / English — Meeting / Call Center

Welsh English · Call Center Two-Speaker

en-WL domain 2-speaker

Welsh English call-center scenario — agent-and-customer dialogue with realistic turn-taking, clarification requests, and natural emotion. Stereo channel-separated for clean speaker separation. studio capture · stereo · channel-separated

08 / English — Medical

Irish English · Healthcare Telemedicine

en-IE domain telemedicine

Irish English healthcare scenario — a clinician-and-patient exchange in the telemedicine register. Two-speaker, stereo, channel-separated. studio capture · stereo · channel-separated · 16/44.1

09 / Highly Expressive

Cinematic / Emotional Speech

to add expressive

Highly emotive, performance-driven speech — wider pitch and intensity range than conversational data, useful for emotion classifiers, expressive TTS, and prosody-aware models. format: WAV · performance capture

Andrew to select clip
10 / Scripted

Scripted Read · Studio Quality

to add scripted

Studio-quality scripted read — clean signal, controlled pacing, and consistent loudness. The format we deliver for TTS training, prompt banks, and QA reference recordings. format: WAV · studio booth

Andrew to select clip
11 / Utterances

Short Utterances · Wake Words / Commands

to add utterance

Short, isolated utterances — wake words, commands, prompt phrases — delivered as clip-per-utterance with metadata. Used for keyword spotters, voice-control models, and Lombard / noise-robustness sets. format: WAV per utterance · with metadata

Andrew to select clip

Local Language Coverage

35+ languages

Available

In delivery or ready to deliver
en-US American English
en-IE Irish English
en-WL Welsh English
en-AB Scottish English
en-SG Singapore English
es-SV Salvadoran Spanish
es-CO Colombian Spanish
es-MX Mexican Spanish
es-US US Spanish ↔ EN
es-ES European Spanish ↔ EN
pt-BR Brazilian Portuguese ↔ EN
pt-PT European Portuguese ↔ EN
fr-CA Canadian French ↔ EN
fr-FR European French ↔ EN
de-DE Standard German ↔ EN
it-IT Italian ↔ EN
hi-IN Hindi ↔ EN
he-IL Hebrew ↔ EN
tl-PH Tagalog (code-switch)

Source on request

Sourced for project
ar-AE Arabic (UAE) ↔ EN
ru-RU Russian ↔ EN
nl-NL Dutch ↔ EN
no-NO Norwegian ↔ EN
da-DK Danish ↔ EN
fi-FI Finnish ↔ EN
sv-SE Swedish ↔ EN
ta-IN Tamil ↔ EN
th-TH Thai ↔ EN
vi-VN Vietnamese ↔ EN
tr-TR Turkish ↔ EN
ko-KR Korean ↔ EN
ja-JP Japanese ↔ EN
zh-CN Chinese (Simplified) ↔ EN
zh-HK Chinese (Hong Kong) ↔ EN
zh-TW Chinese (Taiwan) ↔ EN