Decision Intelligence

Decision Intelligence

What is synthetic data?

A field guide to the various species of fake data: Part 1

Cassie Kozyrkov's avatar
Cassie Kozyrkov
Mar 24, 2025
∙ Paid

Synthetic data is data that you’re planning to treat as if it came from the place/group you wish it came from. (It didn’t.)

Synthetic data is, to put it bluntly, fake data. As in, data that’s not actually from the population you’re interested in. (Population is a technical term in data science, which I explain here in blog form and here in video form.)

Sy…

User's avatar

Continue reading this post for free, courtesy of Cassie Kozyrkov.

Or purchase a paid subscription.
© 2026 Cassie Kozyrkov · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture