Pharmacy Prescription Text Extraction Dataset

#Text Recognition #Image Classification #Natural Language Processing #Healthcare Informatization #Intelligent Drug Management #Electronic Prescription Systems
  • 500 records
  • 1.5G
  • JPG
  • CC-BY-NC-SA 4.0
  • MOBIUSI INCMOBIUSI INC
Updated:2026-02-04

AI Analysis & Value Prop

In the field of healthcare, digitizing pharmacy prescriptions is a crucial direction to improve medication management efficiency and reduce human errors. However, current solutions have significant limitations in text extraction accuracy and handwritten text recognition capabilities, affecting the practicality of electronic prescription systems. The construction of this dataset aims to address these issues by providing high-quality prescription image data to help improve the accuracy and reliability of text recognition systems. The dataset includes prescription images collected in various environments, involving different prescription formats and handwriting styles. High-definition scanners and professional photographic equipment were used during data collection to ensure image clarity. Quality control includes multiple rounds of manual annotation, cross-validation, and expert review, with the annotation team consisting of 20 professionals with pharmaceutical backgrounds. Data preprocessing includes image enhancement, noise reduction, and text area detection, finally stored in JPG format and organized by prescription type and date. The core advantage is that the dataset has an annotation accuracy of over 98% with highly consistent labeling, comprehensively covering both handwritten and printed text. Technological innovations include unique text differentiation and enhancement methods, increasing recognition accuracy by 15%. This dataset solves the problem of existing systems failing to accurately parse complex prescriptions, significantly improving the automation level and work efficiency of electronic pharmacies. Compared to similar datasets, our data quality is higher, with scarcity reflected in comprehensive coverage of multiple fonts and handwriting practices, offering good extensibility and versatility, suitable for the development and optimization of various pharmacy information systems.

Dataset Insights

Sample Examples

0ac49b68**.jpg|1280*1708|322.62 KB

Technical Specifications

FieldTypeDescription
file_namestringFile name
qualitystringResolution
patient_namestringThe name of the patient as recorded on the prescription.
patient_idstringThe unique identifier of the patient as recorded on the prescription.
doctor_namestringThe name of the doctor as provided on the prescription.
medication_liststringThe list of all medications and their dosage information as listed on the prescription.
dosage_instructionstringDetailed dosing instructions and dosage information for each medication.
prescription_datestringThe date when the prescription was issued.
pharmacy_namestringThe name of the pharmacy as recorded on the prescription.
refill_informationstringInformation about medication refills as mentioned on the prescription.
diagnosis_infostringThe diagnosis information as recorded on the prescription.
special_instructionsstringAny special instructions provided by the doctor on the prescription.

Compliance Statement

Authorization TypeCC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike)
Commercial UseRequires exclusive subscription or authorization contract (monthly or per-invocation charging)
Privacy and AnonymizationNo PII, no real company names, simulated scenarios follow industry standards
Compliance SystemCompliant with China's Data Security Law / EU GDPR / supports enterprise data access logs

Frequently Asked Questions

What are the application scenarios for the pharmacy prescription text extraction dataset?
This dataset can be used to develop applications for automating the processing and recording of pharmacy prescriptions, making medical record management more efficient.
What problems can be solved by using the pharmacy prescription text extraction dataset?
Using this dataset can reduce the time and errors associated with manual entry of prescription information and improve the accuracy of text recognition.
What is the significance of the pharmacy prescription text extraction dataset for medical research?
This dataset helps to enhance automation in pharmacy management and supports the digitization of medical data for research and statistical analysis.
How to evaluate the accuracy of pharmacy prescription text extraction?
Evaluation can be achieved by comparing the consistency between the extracted text and the actual prescription content, often using metrics such as precision and recall.

Can't find the data you need?

Post a request and let data providers reach out to you.

Get this Dataset

Verified for Enterprise Use

Cite this Work

@dataset{Mobiusi2026,
  title={Pharmacy Prescription Text Extraction Dataset},
  author={MOBIUSI INC},
  year={2026},
  url={https://www.mobiusi.com/datasets/2fc4f968af94f2da75abd9e6440a6808?dataset_scene_cate_type=3},
  urldate={2026-02-04},
  keywords={Prescription Text Recognition, Medical Image Dataset, Electronic Prescription Systems},
  version={1.0}
}

Using this in research? Please cite us.

placeholder
placeholder
placeholder
placeholder
placeholder
placeholder
placeholder

Popular Dataset Searches