MOBIUSI INC5cdbba1dd8c92fdba581aec0bcba902d.wav
| Field | Type | Description |
|---|---|---|
| file_name | string | File name |
| duration | string | Duration |
| audio_rate | string | Audio sample rate |
| audio_channel | string | Audio channel |
| speaker_gender | string | Indicates the gender of the speaker, such as male or female. |
| speaker_accent | string | Describes the type of accent of the speaker, such as American English or British English. |
| speech_speed | double | Measures the speed of speech, i.e., the number of words per second. |
| background_noise_level | double | Reflects the intensity of background noise in the audio, usually expressed in decibels. |
| speech_intelligibility | string | Assesses the clarity of the speech, including options like clear, medium, and unclear. |
| topic_category | string | Indicates the topic category of the academic speech, such as Science, Art, or History. |
| transcription_quality | string | Evaluation of the quality of the audio transcription, such as high, medium, low. |
| Authorization Type | CC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike) |
| Commercial Use | Requires exclusive subscription or authorization contract (monthly or per-invocation charging) |
| Privacy and Anonymization | No PII, no real company names, simulated scenarios follow industry standards |
| Compliance System | Compliant with China's Data Security Law / EU GDPR / supports enterprise data access logs |

Post a request and let data providers reach out to you.
@dataset{Mobiusi2026,
title={Academic Speech Chinese Speech Recognition Audio Dataset},
author={MOBIUSI INC},
year={2026},
url={https://www.mobiusi.com/datasets/c87d99d8ae60a439b392f0a0de0f7231},
urldate={2026-02-04},
keywords={Academic speech recognition, content media speech dataset, TTS audio data},
version={1.0}
}Using this in research? Please cite us.