MOBIUSI INC| Field | Type | Description |
|---|---|---|
| file_name | string | File name |
| quality | string | Resolution |
| language_type | string | The type of language contained in the image, such as English, Chinese, French, etc. |
| character_count | int | The total number of characters in the image. |
| text_position | string | The specific position of characters within an image, such as top, center, bottom, etc. |
| font_size | int | The font size of characters within an image. |
| font_type | string | The specific font type used in an image. |
| ocr_accuracy | float | The accuracy of optical character recognition, ranging from 0 to 1. |
| distortion_level | string | The clarity and distortion level of characters in an image, such as no distortion, slight distortion, severe distortion. |
| Authorization Type | CC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike) |
| Commercial Use | Requires exclusive subscription or authorization contract (monthly or per-invocation charging) |
| Privacy and Anonymization | No PII, no real company names, simulated scenarios follow industry standards |
| Compliance System | Compliant with China's Data Security Law / EU GDPR / supports enterprise data access logs |

Post a request and let data providers reach out to you.
@dataset{Mobiusi2025,
title={Multilingual Character Detection Dataset},
author={MOBIUSI INC},
year={2025},
url={https://www.mobiusi.com/datasets/1c2070e5f0f438d199da424998917cfc},
urldate={2025-08-28},
keywords={Multilingual Character Detection,Industrial Image Dataset,Quality Control in Exports,Character Recognition Dataset},
version={1.0}
}Using this in research? Please cite us.