
Mirage
How It Works
Mock and Synthetic data generation
Mirage offers mock and synthetic data generation to enable agencies to generate alternative forms of data when the original dataset is too sensitive to use or share, or where more data is needed for software testing or machine learning applications.
- Mock data is created using rule-based approaches, designed to mimic the structure and format of real-world data. It does not require any input data so does not pose any privacy risk, but the resultant data is not highly realistic. Thus, it is more suited for scenarios requiring predictable and repeatable datasets.
Start generating mock data with our quick start guide.
(Available through both Web UI and API.) - Synthetic data is generated using statistical models and gen AI techniques that learn patterns and relationships in real-world data. This requires the user to input data to train the data generation model. This is suitable for scenarios where statistical patterns in an existing dataset need to be retained, or if you require a more realistic dataset.
Start generating synthetic data with our quick start guide.
(Available through Web UI, API coming soon.)
Last updated 28 Apr 2026
Was this article useful?
Did this page help you? - Yes
Thanks for letting us know that this page is useful for you!
If you've got a moment, please tell us what we did right so that we can do more of it.
Did this page help you? - No
Thanks for letting us know that this page still needs work to be done.
If you've got a moment, please tell us how we can make this page better.
Sent. Thank you for the feedback!

Realistic data, safely generated for government innovation.