Key Information Extraction Dataset for Doctor-Patient Conversations in Consultation Rooms

#Speech recognition #natural language processing #information extraction #Medical dialogue analysis #intelligent medical assistant #clinical data mining
  • 500 hours
  • 1.4G
  • WAV
  • CC-BY-NC-SA 4.0
  • MOBIUSI INCMOBIUSI INC
Updated:2026-02-04

AI Analysis & Value Prop

In the medical field, doctor-patient conversations contain a wealth of key information that is crucial for improving diagnostic efficiency and patient satisfaction. However, current technological solutions often lack accuracy and reliability when identifying and extracting this key information, especially in noisy environments and when dealing with multilingual and medical terminologies. This dataset aims to improve key information extraction capabilities in audio dialogues, meeting the specific needs of the healthcare industry. Data collection was conducted using professional recording equipment in consultation room environments to ensure realistic scenario simulation. Quality control involved a three-round annotation process, consistency checks, and medical expert review, with the annotation team consisting of professionals with medical backgrounds. Data preprocessing includes audio denoising, segmentation, and transcription, with the final data stored in WAV format and organized in JSON format as transcribed dialogue texts.

Dataset Insights

Sample Examples

4d50e757efe8ecccfeadf913b490ef1e.wav

  • 4d50e757efe8ecccfeadf913b490ef1e.wav
    00:00
  • a0e6b37c0060a8887eb18ec3bb6bfbd1.wav
    00:00

Technical Specifications

FieldTypeDescription
file_namestringFile name
durationstringDuration
audio_ratestringAudio sample rate
audio_channelstringAudio channel
dialogue_typestringIndicates whether the dialogue takes place during an initial consultation, a follow-up, or a subsequent visit.
speaker_rolestringIdentifies whether the speaker is the doctor or the patient.
emotionstringEmotion detected from the speaker's audio, such as calm, anxiety, anger, etc.
languagestringThe language used in the dialogue, such as Chinese, English, etc.
speech_intelligibilitystringThe clarity of speech, such as clear, mumbling, etc.
keywordsstringMedical keywords mentioned in the dialogue, such as symptoms, drug names, etc.
duration_silencefloatTotal duration of silence periods in the dialogue (in seconds).
speech_ratefloatAverage speech rate of the speaker (words per second).
dialogue_turnsintThe number of turns in the dialogue, indicating the number of times speakers alternately speak.

Compliance Statement

Authorization TypeCC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike)
Commercial UseRequires exclusive subscription or authorization contract (monthly or per-invocation charging)
Privacy and AnonymizationNo PII, no real company names, simulated scenarios follow industry standards
Compliance SystemCompliant with China's Data Security Law / EU GDPR / supports enterprise data access logs

Frequently Asked Questions

What is the primary purpose of this dataset?
The dataset is used to enhance the information extraction capabilities of medical dialogue systems.
For which fields of study is this dataset applicable?
This dataset is applicable to research in the healthcare sector, particularly in improving the information extraction capabilities of dialogue systems.
What is the modality of the dataset?
The modality of the dataset is audio.
How can this dataset be used to support the development of medical dialogue systems?
By analyzing and extracting key information from doctor-patient dialogues, this dataset can aid in developing smarter medical dialogue systems.
What unique advantages does this dataset offer?
This dataset focuses on extracting critical details from medical dialogues, enhancing the precision and efficiency of information extraction.

Can't find the data you need?

Post a request and let data providers reach out to you.

Get this Dataset

Verified for Enterprise Use

Cite this Work

@dataset{Mobiusi2026,
  title={Key Information Extraction Dataset for Doctor-Patient Conversations in Consultation Rooms},
  author={MOBIUSI INC},
  year={2026},
  url={https://www.mobiusi.com/datasets/a2968b707540461e2930ef1f4278b05b},
  urldate={2026-02-04},
  keywords={doctor-patient conversation dataset, medical audio data, key information extraction audio},
  version={1.0}
}

Using this in research? Please cite us.

placeholder
placeholder
placeholder
placeholder
placeholder
placeholder
placeholder

Popular Dataset Searches