Dangerous Driving Behaviors (e.g., Fatigue) Audio Detection Dataset in Driver Training

#audio classification #behavior detection #fatigue detection #speech recognition #driving safety #driver training #fatigue detection #traffic safety
  • 500 hours
  • 1.3G
  • WAV
  • CC-BY-NC-SA 4.0
  • MOBIUSI INCMOBIUSI INC
Updated:2026-02-04

AI Analysis & Value Prop

In the current driver training industry, dangerous behaviors such as fatigued driving remain one of the main causes of traffic accidents. Existing fatigue detection solutions mostly rely on video monitoring or physiological signal monitoring, which have issues such as high equipment costs and privacy protection difficulties. This dataset provides a more feasible solution for dangerous behavior detection through audio data, aiming to improve detection effectiveness and coverage. Data collection is conducted using high-sensitivity microphones installed in driving simulators to gather sound data during different driving states. Multiple rounds of annotation and consistency checks are employed to ensure high data quality. The annotation team consists of over 20 experts in the traffic safety field. Data underwent preprocessing steps like noise reduction and feature extraction and is stored in WAV format, organized by driving scenarios.

Dataset Insights

Sample Examples

7fb1c1d7e1ac3d597a9245a96f68eb70.wav

  • 7fb1c1d7e1ac3d597a9245a96f68eb70.wav
    00:00

Technical Specifications

FieldTypeDescription
file_namestringFile name
durationstringDuration
audio_ratestringAudio sample rate
audio_channelstringAudio channel
audio_durationfloatThe total playback duration of the audio file in seconds.
num_speakersintThe number of simultaneous speakers present in the audio.
primary_languagestringThe primary language used in the audio file.
noise_levelfloatThe level of background noise in the audio, usually measured in decibels (dB).
emotion_detectedstringThe primary emotion detected in the audio, such as fatigue or anger.
speech_ratefloatThe average speech rate of the speaker, measured in words per minute (WPM).
keyword_detectedstringList of significant keywords detected in the audio.
clarity_scorefloatThe clarity score of the speech in the audio, usually on a scale from 0 to 1.

Compliance Statement

Authorization TypeCC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike)
Commercial UseRequires exclusive subscription or authorization contract (monthly or per-invocation charging)
Privacy and AnonymizationNo PII, no real company names, simulated scenarios follow industry standards
Compliance SystemCompliant with China's Data Security Law / EU GDPR / supports enterprise data access logs

Frequently Asked Questions

How does this dataset help improve drivers' safety awareness?
By analyzing audio data to identify risky behaviors such as fatigue or distraction, it helps raise safety awareness.
How was the audio data in this dataset collected?
The audio data was collected using sensor devices installed in vehicles during driving training.
What is the significance of this dataset for the safety industry in detecting hazardous driving behaviors?
This dataset provides key insights into driver behavior, aiding in the development of more effective safety measures.
What research or applications is this dataset suitable for?
This dataset is suitable for research in machine learning and artificial intelligence, particularly in the fields of driving behavior analysis and safety system development.
Can this dataset be used for real-time detection of driver status?
Yes, with the appropriate model training, this dataset can be used for real-time monitoring and detection of driver status.

Can't find the data you need?

Post a request and let data providers reach out to you.

Get this Dataset

Verified for Enterprise Use

Cite this Work

@dataset{Mobiusi2026,
  title={Dangerous Driving Behaviors (e.g., Fatigue) Audio Detection Dataset in Driver Training},
  author={MOBIUSI INC},
  year={2026},
  url={https://www.mobiusi.com/datasets/da7d5a167dbbbb310a36b85b7af60f53?dataset_scene_cate_type=6},
  urldate={2026-02-04},
  keywords={dangerous driving audio detection dataset, driver training audio, fatigue detection audio data, traffic safety audio dataset},
  version={1.0}
}

Using this in research? Please cite us.

placeholder
placeholder
placeholder
placeholder
placeholder
placeholder
placeholder

Popular Dataset Searches