Waiting Room Prompt Chinese Speech Recognition Audio Dataset

#Speech Recognition #Natural Language Processing #Speech to Text #Hospital Speech Systems #Health Consultation #Intelligent Voice Assistant
  • 500 hours
  • 1.6G
  • MP3
  • CC-BY-NC-SA 4.0
  • MOBIUSI INCMOBIUSI INC
Updated:2026-02-04

AI Analysis & Value Prop

In modern hospitals, managing waiting room information and enhancing patient experience remains a challenge. Existing speech systems are mostly limited to simple dialogues or fixed responses, making it hard to handle complex oral inputs. The construction of the Waiting Room Prompt Speech Recognition Audio Dataset aims to provide more accurate speech recognition models and offer more efficient waiting room management solutions for hospitals and clinics. Data is collected in real hospital environments using various recording devices, including directional microphones and portable recording equipment, to ensure diversity in background noise. The data undergoes multi-round annotation and expert review, with an annotation team comprising linguistics experts and medical practitioners, exceeding 20 members. The data is subjected to noise filtering, speech enhancement, and other preprocessing steps before being stored in MP3 format, organized by speaker, scenario, and other labels.The core advantage of this dataset lies in its high-quality annotation accuracy, achieving over 95% consistency and completeness. It innovatively employs voice noise filtering and enhancement technology to accurately simulate real hospital environment usage conditions. In terms of application value, it can reduce the workload of medical assistants and improve patient experience; compared to similar datasets, it offers unique support for professional medical terms and dialects, with its scarcity reflecting in the difficult-to-obtain real hospital environment recordings. It is also suitable for other high-noise environments, such as large customer service centers, and provides good scalability and versatility.

Dataset Insights

Sample Examples

f78c56990ae449a28499823db438136b.wav

  • f78c56990ae449a28499823db438136b.wav
    00:00

Technical Specifications

FieldTypeDescription
file_namestringFile name
durationstringDuration
audio_ratestringAudio sample rate
audio_channelstringAudio channel
language_spokenstringThe language spoken in the audio.
speaker_genderstringThe gender of the speaker in the audio.
speaker_age_groupstringThe age group of the speaker in the audio, e.g., child, adult, senior.
accent_typestringThe type of accent exhibited by the speaker in the audio.
background_noise_levelstringThe level of background noise in the audio, e.g., high, medium, low.
speech_to_noise_ratiostringThe degree of speech-to-noise ratio in the audio.
dialogue_typestringThe type of dialogue in the audio, e.g., multi-party conversation, monologue.
emotional_tonestringThe emotional tone of the speaker in the audio, e.g., angry, calm, happy.
speech_pacestringThe speaking pace of the speaker in the audio, e.g., slow, medium, fast.

Compliance Statement

Authorization TypeCC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike)
Commercial UseRequires exclusive subscription or authorization contract (monthly or per-invocation charging)
Privacy and AnonymizationNo PII, no real company names, simulated scenarios follow industry standards
Compliance SystemCompliant with China's Data Security Law / EU GDPR / supports enterprise data access logs

Frequently Asked Questions

What is the waiting room prompt speech recognition audio dataset?
The waiting room prompt speech recognition audio dataset is a collection of audio data aimed at improving speech recognition efficiency in healthcare settings.
What types of data are included in the waiting room prompt speech recognition audio dataset?
The dataset includes audio data relevant to healthcare settings to enhance the accuracy and efficiency of speech recognition.
How is this dataset applied in the healthcare industry?
By improving speech recognition, the waiting room prompt speech recognition audio dataset can help optimize communication processes in hospitals and clinics, enhancing patient experience.
What are the benefits of using the waiting room prompt speech recognition audio dataset?
Using this dataset can improve the accuracy of speech recognition systems in medical contexts, reducing misunderstandings and thus increasing service efficiency.
Which fields can benefit from the waiting room prompt speech recognition audio dataset?
Any field related to healthcare and patient communication can benefit from this dataset, including hospitals, clinics, and health service providers.

Can't find the data you need?

Post a request and let data providers reach out to you.

Get this Dataset

Verified for Enterprise Use

Cite this Work

@dataset{Mobiusi2026,
  title={Waiting Room Prompt Chinese Speech Recognition Audio Dataset},
  author={MOBIUSI INC},
  year={2026},
  url={https://www.mobiusi.com/datasets/0d21586f0e2ff59b973e6e714acd10e8},
  urldate={2026-02-04},
  keywords={Medical Speech Recognition Dataset, Waiting Room Audio Data, Health Consultation Speech},
  version={1.0}
}

Using this in research? Please cite us.

placeholder
placeholder
placeholder
placeholder
placeholder
placeholder
placeholder

Popular Dataset Searches