Audio Dataset for Abnormal Sound Extraction in Open Office Environment

#Information extraction #anomaly detection #classification #Acoustic analysis #anomaly detection #intelligent security monitoring
  • 500 hours
  • 1.5G
  • WAV
  • CC-BY-NC-SA 4.0
  • MOBIUSI INCMOBIUSI INC
Updated:2026-02-04

AI Analysis & Value Prop

In open office environments, detecting abnormal sounds is crucial for enhancing security and work efficiency. However, the complex acoustic environment and diverse sound types make traditional monitoring solutions ineffective. Existing solutions have limitations in accuracy and real-time capability, making it difficult to handle sudden and abnormal acoustic events. This dataset aims to address the needs for sound information extraction accuracy and real-time monitoring. Data is collected using directional microphones in real office environments, covering various sudden sounds during working hours. Quality control involves multiple rounds of manual annotation and machine verification to ensure data consistency and high precision. The annotation team consists of experienced acoustic engineers and data scientists, with a total scale of over 20 people. Data preprocessing utilizes noise reduction filters and feature enhancement techniques. Audio data is stored in WAV format and organized according to sound event types.

Dataset Insights

Sample Examples

32e14fee5eb63cdc26153634b09dbfe3.wav

  • 32e14fee5eb63cdc26153634b09dbfe3.wav
    00:00
  • 67f98a8021cfc6bc03c92196d9d83919.wav
    00:00

Technical Specifications

FieldTypeDescription
file_namestringFile name
durationstringDuration
audio_ratestringAudio sample rate
audio_channelstringAudio channel
sound_typestringThe specific type of abnormal sound occurring in the audio, such as: knocking, screaming, machine noise, etc.
sound_intensityfloatThe loudness or volume level of the abnormal sound in the audio.
sound_durationfloatThe duration for which the abnormal sound lasts in the audio, measured in seconds.
sound_frequency_rangestringThe frequency range of the abnormal sound in the audio, typically expressed in Hertz (Hz).
background_noise_levelfloatThe average loudness or volume level of the environmental background noise in the audio.
disturbance_indexintegerA quantified index based on the intensity of impact of the abnormal sound on the office environment.

Compliance Statement

Authorization TypeCC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike)
Commercial UseRequires exclusive subscription or authorization contract (monthly or per-invocation charging)
Privacy and AnonymizationNo PII, no real company names, simulated scenarios follow industry standards
Compliance SystemCompliant with China's Data Security Law / EU GDPR / supports enterprise data access logs

Frequently Asked Questions

What is the Open Office Environment Anomalous Sound Extraction Audio Dataset?
This is an audio dataset containing anomalous sounds from open office environments, designed for automatic sound information extraction and anomaly detection.
What application areas is this dataset suitable for?
This dataset is suitable for automatic sound information extraction and anomaly detection in general daily environments.
What are the advantages of using this dataset for anomaly detection?
Using this dataset for anomaly detection can enhance the ability to identify and respond to anomalous sounds in open office environments, thereby improving safety and efficiency.
What kind of data support can this type of dataset provide for research?
This dataset provides real audio samples from open office environments, supporting researchers in developing and testing anomalous sound detection algorithms.

Can't find the data you need?

Post a request and let data providers reach out to you.

Get this Dataset

Verified for Enterprise Use

Cite this Work

@dataset{Mobiusi2026,
  title={Audio Dataset for Abnormal Sound Extraction in Open Office Environment},
  author={MOBIUSI INC},
  year={2026},
  url={https://www.mobiusi.com/datasets/589eb44e2f501a43d3131832d731da80?cate=1},
  urldate={2026-02-04},
  keywords={abnormal sound dataset, open office environment audio, sound information extraction},
  version={1.0}
}

Using this in research? Please cite us.

placeholder
placeholder
placeholder
placeholder
placeholder
placeholder
placeholder

Popular Dataset Searches