MOBIUSI INC| Field | Type | Description |
|---|---|---|
| file_name | string | File name |
| duration | string | Duration |
| quality | string | Resolution |
| code_language | string | The programming language used in the code file. |
| lines_of_code | int | The total number of lines in the code file. |
| code_complexity | string | A measure of the complexity of the code (e.g., cyclomatic complexity). |
| number_of_functions | int | The number of functions or methods defined in the code file. |
| number_of_classes | int | The number of classes defined in the code file. |
| dependencies | string | A list of external libraries or modules that the code file depends on. |
| code_comments_ratio | float | The ratio of comment lines to the total number of lines in the code file. |
| code_style | string | The coding style or standard that the code file adheres to (e.g., PEP 8). |
| Authorization Type | CC-BY-NC-SA 4.0 (Attribution–NonCommercial–ShareAlike) |
| Commercial Use | Requires exclusive subscription or authorization contract (monthly or per-invocation charging) |
| Privacy and Anonymization | No PII, no real company names, simulated scenarios follow industry standards |
| Compliance System | Compliant with China's Data Security Law / EU GDPR / supports enterprise data access logs |

Post a request and let data providers reach out to you.
@dataset{Mobiusi2026,
title={Code Generation Benchmark Dataset},
author={MOBIUSI INC},
year={2026},
url={https://www.mobiusi.com/datasets/e4992ef4823435a23d0d533bf3a013a7?dataset_scene_id=19},
urldate={2026-02-04},
keywords={code generation, benchmark dataset, information technology},
version={1.0}
}Using this in research? Please cite us.