Speech Data Company

Custom speech data for teams building voice products

We run speech recording, transcription, and dataset delivery workflows tailored to your technical requirements, from targeted recruitment to structured output.

Most voice products serve fewer than 30 of the world's 7,000+ languages.

Spirelight closes that gap.

All in one platform

The contributors and software you need, all in one place.

Vetted speakers in the right languages, dialects, and age bands record straight from our browser-based capture tools. No extra apps, no third-party software, no fragmented workflow.

Live progress, batch downloads

Live progress, validated formats, and batch downloads as you go.

Watch every hour land, then pull validated batches straight into your pipeline whenever you need them. No waiting for final delivery to start training.

What We Do

Built for projects where the details matter

Some teams need more than generic annotation or off-the-shelf audio. They need the right speakers, the right scenarios, the right formats, and a workflow that holds together from collection to delivery.

spirelight · session
Live session · Iberian Spanish

Scripted monologues, dialect-tagged

Recording
  • MRMaria · Madrid · 32Done
  • JPJavier · Sevilla · 41Recording
  • ALAna · Bilbao · 27Queued
WAV · 48 kHz · stereo Prompt set 02 / 12
spirelight · transcript
en-IE_002_dialogue_03.json QA · 2 reviewers
  1. 00:00.42 S1 Could you walk me through the booking flow you used last Tuesday?
  2. 00:03.10 S2 Sure, I opened the app, tapped the search bar, then… flagged
  3. 00:06.94 S1 Got it. Any pauses or hesitations there?
  4. 00:09.38 S2 Yeah, [pause 1.2s] I had to scroll to find the right date.
Word-level timestamps · Speaker-aware · Diarised
manifest.json
{
"project": { 3 fields }, {
"id": "sl-9241",
"language": "es-ES",
"hours": 3000
},
"audio": { 3 fields }, {
"format": "wav",
"sample_rate": 48000,
"channels": 2
},
"transcripts": { click to expand }, {
"format": "jsonl",
"timestamps": "word",
"diarised": true
},
"delivery": { click to expand } {
"channel": "s3-bucket",
"checksums": "sha256",
"batches": true
}
}
01

Speech collection

Remote or on-site recording projects with targeted contributors, configurable prompts, and controlled capture flows.

  • On-site or remote capture
  • Dual-channel and dialogue setups
  • Audio plus video when needed
02

Transcription

Manual, machine-assisted, or hybrid transcription with review layers, timestamps, and speaker-aware structure.

  • Word-level timestamps
  • Reviewer sampling during production
  • Domain terminology handling
03

Dataset delivery

Audio, transcripts, metadata, and manifests packaged to match your pipeline. Click any field to see how it's structured.

  • JSON and manifest delivery
  • Custom metadata schemas
  • Bucket transfer or API handoff
Why Spirelight

More specific than off-the-shelf data

Targeted Crowd Recruitment

Source speakers by language, dialect, location, age, gender, or project-specific criteria.

Flexible Production Setup

Configure monologues, dialogues, dual-channel capture, audio plus video, or structured prompt flows depending on the task.

Collection & Transcription Together

Avoid workflow fragmentation by handling recording, transcription, QA, and packaging in one production chain.

In-Production Quality Control

Review files while the project is running and catch issues before they become delivery problems.

Fast Scaling When Needed

Built for projects that need to move fast without becoming generic.

European Language Strength

A strong fit for multilingual and dialect-sensitive projects across European markets.

Technical Capabilities

Built for technical speech data requirements

Many projects do not fail because the idea is wrong. They fail because the data is too broad, too noisy, badly structured, or impossible to reproduce. We work at the level where those details matter.

  • 01
  • 02
  • 03
  • 04
Speech Collection Workflow
Workflow: Collection
01

Remote & On-site Collection

We source speakers by language, dialect, location, age, and gender. Whether it's controlled recording on-site or distributed remote capture, we handle the recruitment and execution.

  • › Remote recording workflows
  • › On-site recording setups
  • › Monologues and dialogues
  • › Custom prompts and scenarios
Audio Engineering
Mode: Audio Capture
02

Technical Audio Engineering

Built for technical speech requirements. We configure monologues, dialogues, dual-channel capture, and noise environment checks before any recording begins.

  • › Dual-channel capture
  • › Speaker-separated recordings
  • › Audio plus video capture
  • › Hardware checks before recording
Quality Assurance
Process: Verification
03

Human-in-the-Loop QA

Manual, machine-assisted, or hybrid transcription with multiple review layers. We catch issues while the project is running, not at the end.

  • › Human review layers
  • › Machine-assisted transcription
  • › Word-level timestamps
  • › Domain terminology handling
Structured Delivery
Format: Deployment
04

Structured Delivery

Audio, transcripts, and metadata packaged to match your pipeline. We deliver via bucket transfer or direct API handoff with full manifest validation.

  • › Manifest and checksum packaging
  • › Bucket delivery or API handoff
  • › Custom metadata schemas
  • › JSON and manifest delivery
Use Cases

Designed around your actual requirements

Automotive Voice

Command phrases, in-car scenarios, multilingual prompt sets, and structured dialogue data for voice interfaces across regions and accents.

Wake Word Datasets

Trigger word collection across demographic groups with controlled recording conditions and environmental variation.

Multilingual Assistants

Cross-language training data for virtual assistants covering multiple European languages and regional variants.

Call Simulation

Dialogue capture with role-play scenarios, separated speakers, and real conversational variation for customer interaction systems.

Accessibility Research

Combined audio and video capture with controlled consent flows and structured research delivery for assistive technology.

STT Evaluation

Domain-specific audio with timestamped transcription output for speech-to-text system testing and improvement.

TTS Datasets

Controlled scripts, expressive prompts, higher fidelity requirements, and linked speaker metadata for text-to-speech training.

Dialect Coverage

Language coverage projects targeting specific dialect regions with metadata-rich speaker profiles and geographic targeting.

Track Record

Projects we have delivered

Multilingual Production

Large multilingual dialogue collection

Type
Dialogue recording
Complexity
5+ languages, tight timeline
Handled
Recruitment, recording, transcription, QA
Delivery
Structured JSON + audio bundles
Controlled Recording

On-site controlled recording workflow

Type
On-site capture
Complexity
Hardware control, environment specs
Handled
Setup, capture, quality gates, packaging
Delivery
Dual-channel WAV + metadata CSV
High-Volume Pipeline

Transcription and QA at scale

Type
Transcription pipeline
Complexity
Multi-reviewer, domain-specific
Handled
Transcription, review layers, consistency
Delivery
Timestamped JSON + manifests
Team

The team behind the projects

Andreas Kromann

Andreas Kromann

CEO
Commercial lead and project design
Emil Thorsson

Emil Thorsson

CFO
Operations, compliance coordination, and delivery support
Gustav Aggeboe

Gustav Aggeboe

CTO
Platform architecture and technical implementation
Joyi Ulfat

Joyi Ulfat

Senior Project Manager
Production oversight and project execution
Mateo Thelen

Mateo Thelen

Project Manager
Coordination of contributors, workflows, and delivery steps
Pekka Larjovuori

Pekka Larjovuori

Crowd Source Expert
Recruitment strategy and crowd operations
Get Started

Need a speech dataset that matches your real requirements?

Tell us what you need to collect, how it should be structured, and where the difficult parts are. We will help scope the workflow.

Or email us directly at hello@spirelight.net
10,000+
Verified contributors across Europe
50+
Languages & dialects captured
100%
Human QA on every project
48h
Average turnaround on brief response