Public Restroom Guide Chinese Voice Dataset

#voice recognition #natural language processing #acoustic environment analysis #smart home #public facility management #human-computer interaction
  • 500 hours
  • 1.2G
  • WAV
  • CC-BY-NC-SA 4.0
  • MOBIUSI INCMOBIUSI INC
Updated:2026-02-04

AI Analysis & Value Prop

In the process of modern urbanization, public restrooms, as infrastructure, have become an important reflection of urban civilization due to their service convenience and user experience. However, due to the lack of language guidance, users often face difficulties in locating and unclear usage instructions, affecting the user experience. Existing voice guidance systems still lack robustness, recognition accuracy, and interaction naturalness in public environments, failing to fully meet user needs. This dataset aims to provide high-quality voice samples for training voice recognition systems to enhance the intelligence level of public facility guidance.The audio data in this dataset is collected using professional recording equipment in various real simulated scenarios, covering speakers of different genders, ages, and accents. Meanwhile, environmental noise is used to enhance diversity and naturalness. To ensure data quality, multiple rounds of annotation and consistency checks are combined, and reviewed by an expert team with backgrounds in acoustics and linguistics. Data preprocessing steps include denoising, segmentation, and standardization of audio signals, finally stored in WAV format, and systematically organized by language and scenario.The core advantages of this dataset are reflected in its annotation accuracy of over 98%; strict control over the consistency and integrity of voice samples; use of new data augmentation technologies such as acoustic feature transformation and multi-path enhancement to improve system adaptability. By enhancing the accuracy of voice recognition guidance systems, it solves actual difficulties in public facility navigation. Compared with similar datasets, it provides richer context and acoustic feature data. Its rarity lies in covering a wide range of user group voice samples, with high universality and scalability, providing a reference standard for other similar voice applications.

Dataset Insights

Sample Examples

65b61544c8c941be3426d568ecc15380.wav

  • 65b61544c8c941be3426d568ecc15380.wav
    00:00

Technical Specifications

FieldTypeDescription
file_namestringFile name
durationstringDuration
audio_ratestringAudio sample rate
audio_channelstringAudio channel
languagestringThe language type in the audio, such as Mandarin, English, etc.
gender_of_speakerstringThe gender of the speaker in the recording, which could be male or female.
age_group_of_speakerstringThe age group of the speaker in the recording, such as child, youth, middle-aged, elderly.
accentstringAccent characteristics in the recording, such as American English, British English, etc.
environment_noise_levelstringThe level of environmental noise in the recording, such as low noise, medium noise, high noise.
speaking_speedstringThe speaking speed in the recording, such as slow, normal, fast.
scripted_or_unscriptedstringDetermines whether the recording is scripted or spontaneous.
dialogue_or_monologuestringWhether the content of the recording is a dialogue or a monologue.

Compliance Statement

Authorization TypeCC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike)
Commercial UseRequires exclusive subscription or authorization contract (monthly or per-invocation charging)
Privacy and AnonymizationNo PII, no real company names, simulated scenarios follow industry standards
Compliance SystemCompliant with China's Data Security Law / EU GDPR / supports enterprise data access logs

Frequently Asked Questions

What is the Public Restroom Guidance Speech Dataset?
The Public Restroom Guidance Speech Dataset is a high-quality audio dataset designed to improve user experience in public facilities.
What is the data modality of the Public Restroom Guidance Speech Dataset?
The data modality of the dataset is audio.
Which industry is the Public Restroom Guidance Speech Dataset applicable to?
This dataset is applicable to the general daily use industry.
What is the main purpose of the Public Restroom Guidance Speech Dataset?
The main purpose of this dataset is to improve user experience in public facilities.

Can't find the data you need?

Post a request and let data providers reach out to you.

Get this Dataset

Verified for Enterprise Use

Cite this Work

@dataset{Mobiusi2026,
  title={Public Restroom Guide Chinese Voice Dataset},
  author={MOBIUSI INC},
  year={2026},
  url={https://www.mobiusi.com/datasets/e79a747ca52ab48654e97812e895875c?dataset_scene_cate_type=2},
  urldate={2026-02-04},
  keywords={public restroom voice guidance, voice recognition training data, public facility intelligence},
  version={1.0}
}

Using this in research? Please cite us.

placeholder
placeholder
placeholder
placeholder
placeholder
placeholder
placeholder

Popular Dataset Searches