Cabin Chinese Voice Assistant Control Command Dataset

#voice recognition #natural language processing #voice command control #automated driving #smart driving #in-car voice control #autonomous driving technology #driving safety systems
  • 500 hours
  • 1.3G
  • WAV
  • CC-BY-NC-SA 4.0
  • MOBIUSI INCMOBIUSI INC
Updated:2026-02-04

AI Analysis & Value Prop

Currently, smart driving technology is becoming increasingly widespread, but there is significant room for improvement in the accuracy and reaction speed of its voice assistants. In-car voice control systems often encounter issues of misrecognition and incorrect reactions, leading to poor user experience. Some existing solutions have very limited recognition rate and response speed due to lack of optimization for the driving environment. This dataset aims to improve voice recognition accuracy, response speed, and stability in noisy environments through a rich sample of voice commands. Data collection is conducted using high-fidelity audio equipment in the cabins of various vehicle models, covering multiple traffic environments such as urban, suburban, and highways; quality control is stringent, including multi-round manual annotation, consistency verification, system evaluation, and expert review; the annotation team consists of traffic engineering experts and voice technology engineers, totaling 30 people; data preprocessing involves background noise reduction, audio segmentation, and voice enhancement; the final data is stored in WAV format, organized by environment and command type.

Dataset Insights

Sample Examples

759fe10d522f6a010eec5918e0b16e22.wav

  • 759fe10d522f6a010eec5918e0b16e22.wav
    00:00

Technical Specifications

FieldTypeDescription
file_namestringFile name
durationstringDuration
audio_ratestringAudio sample rate
audio_channelstringAudio channel
voice_commandstringThe driving control command recognized from the audio file.
speaker_genderstringThe gender category of the speaker in the audio, such as male or female.
speaker_age_groupstringThe age range of the speaker, such as child, teenager, or adult.
background_noise_levelstringThe degree of environmental background noise in the audio, such as low, medium, or high.
language_dialectstringThe type of language or dialect used in the audio.
emotionstringThe emotion expressed by the speaker in the audio, such as anger, happiness, or calm.
speaking_ratefloatThe speaking rate of the speaker in the audio, measured in words per minute.
accentstringThe accent characteristic of the speaker in the audio, such as Mandarin or Cantonese accent.

Compliance Statement

Authorization TypeCC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike)
Commercial UseRequires exclusive subscription or authorization contract (monthly or per-invocation charging)
Privacy and AnonymizationNo PII, no real company names, simulated scenarios follow industry standards
Compliance SystemCompliant with China's Data Security Law / EU GDPR / supports enterprise data access logs

Frequently Asked Questions

What are the main applications of this dataset?
The Cockpit Voice Assistant Control Command Dataset is primarily used in driving scenarios, including the training and optimization of voice recognition systems to improve the accuracy and response speed of voice assistants during driving.
What type of voice commands does this dataset include?
The dataset includes various types of driving-related voice commands, such as navigation, phone control, and music playback, aimed at providing a wide range of command data to support multifunctional voice assistant needs.
How was the dataset collected?
The dataset was collected by simulating various driving environments and capturing voice commands in different driving scenarios to ensure the reliability and effectiveness of the data in diverse acoustic environments.
What performance aspects can be improved using this dataset?
Using this dataset can significantly improve the accuracy of voice recognition systems in noisy environments, the quick responsiveness to voice commands, and the diversity in language recognition, enhancing driving safety and user experience.
What contributions does this dataset make to the development of voice assistant systems?
This dataset provides rich voice command resources for the development of voice assistant systems, helping developers improve voice recognition algorithms and enhance the practicality and user satisfaction of the systems in real-world applications.

Can't find the data you need?

Post a request and let data providers reach out to you.

Get this Dataset

Verified for Enterprise Use

Cite this Work

@dataset{Mobiusi2026,
  title={Cabin Chinese Voice Assistant Control Command Dataset},
  author={MOBIUSI INC},
  year={2026},
  url={https://www.mobiusi.com/datasets/f933869c7c62ef4773817b684ceb52ba},
  urldate={2026-02-04},
  keywords={driving voice dataset, in-car voice assistant, traffic voice recognition data, automotive voice control},
  version={1.0}
}

Using this in research? Please cite us.

placeholder
placeholder
placeholder
placeholder
placeholder
placeholder
placeholder

Popular Dataset Searches