Recently, I stumbled across the promotion of an interesting talk organized by the Amsterdam FinTech week: “synthetic data, real opportunities.” Much to my disappointment, out of a 9-people panel, there wasn’t a single woman in the panel.

Synthetic data and privacy as research fields owes a lot to the work of pioneer women. For example, the research of Dr. Latanya Sweeney on re-identification has been critical, much as the research from Cynthia Dwork on differential privacy.

Today, synthetic data is no longer confined to the academics field, as proved by the 50+ tech companies actively developing solutions and services. In…


Are you looking for a synthetic data company? Or simply seeking an overview of this fast-growing market? Search no more. Here is a list of companies providing structured or unstructured synthetic data products and services.

I’ve divided the lists into 1) providers of synthetic data for structured data (tabular data) and 2) providers of synthetic data for unstructured data (image & video). I’ve added at the end comments on the ecosystem growth and investment trends. For more details on the structured/unstructured segmentation, check out my post “types of synthetic data and real-life examples”.

Disclaimer: I work at Statice, one of…


Key take-aways

  • Insurers have at their disposal a new ecosystem of data sources
  • Regulations, privacy concerns, and legacy technology impact insurers’ agility and competitiveness.
  • The data privacy and quality guarantees of privacy-preserving technologies make them an opportunity for insurers.
  • Privacy-preserving synthetic data can be used along the data value lifecycle to train fraud detection models or aggregate customer data for analysis.

Evolving practices, new demands from consumers, and increasing competitive pressure push insurance players to make the most out of the data they collect. …


Journey into the world of data privacy — Episode 07

In this series, I share the learnings of my journey into the field of data privacy.

Episode 1: How “anonymous” is anonymized data?
Episode 2:
PETs: the technologies organization should consider adopting
Episode 3:
Introduction to privacy-preserving synthetic data
Episode 4:
10 use-cases for privacy-preserving synthetic data
Episode 5:
Data privacy and protection techniques
Episode 6:
Types of synthetic data and real-life examples

January 28th, Data Privacy Day

On January 28th, the world will celebrate Data Privacy Day, a national day initiated in 2007 by the Council of Europe. This date marks the signing of…


Journey into the world of data privacy — Episode 06

In this series, I share the learnings of my journey into the field of data privacy.

Episode 1: How “anonymous” is anonymized data?
Episode 2:
PETs: the technologies organization should consider adopting
Episode 3:
Introduction to privacy-preserving synthetic data
Episode 4:
10 use-cases for privacy-preserving synthetic data
Episode 5:
Data privacy and protection techniques
Episode 7:
List of events and resources for Data Privacy day 2021

This post presents the different synthetic data types that currently exist: text, media (video, image, sound), and tabular synthetic data.

After a brief definition…


Strict data regulations and cumbersome data governance processes are causing innovation inertia in banks and financial institutions. Where data should drive product development and fuel analysis, we see slow and tedious processes preventing teams from accessing, sharing, and leveraging data. This post explores these challenges, as well as how to regain the ability to work with data safely and efficiently.

The digital transformation presupposes access to data

Data is central for financial institutions on the path to digital transformation. It fuels operational efficiency, helps enterprises build personalized customer experiences, and allows developing competitive products. …


Cet article reprend une partie des informations présentées dans différents postes en anglais publiés ici et sur le blog de Statice. Il répond aux points suivants :

Qu’est-ce que la donnée synthétique ? Définition, origine et typologies

Comment génère-t-on des données synthétiques ?

Quelles sont les applications de la donnée synthétique ?

La donnée synthétique comme outil d’anonymisation des données personnelles

Qu’est-ce que la donnée synthétique

La donnée synthétique, synthetic data en anglais, est une donnée générée artificiellement. Cette approche se différencie de la collecte et production de données “réelles”, par exemple la collecte de données utilisateurs ou de données de santé. …


If your are evaluating privacy-preserving synthetic data, three questions are usually important:

  • Does it solve my problem?
  • Does the technology fit my requirements?
  • What’s the Return on Investment (ROI) of implementing privacy-preserving synthetic data?

We already addressed the problem and application question in this story. This one focuses on the last item: what’s the added-value of synthetic data?

The costs of data inertia and insufficient privacy-preservation mechanisms

Data is critical to the operations of data-driven companies. Forrester reported insights-driven businesses growing at an average of more than 30% annually. However, there are a few obstacles in the way of “the data-driven enterprise”, especially when it comes to personal data…


Our team recently held a webinar on synthetic data in the financial industry. We discussed how synthetic data could actually represent an opportunity to tackle internal data restrictions and general slowness. We got interesting questions from the audience that we answer in this article:

  • How long would it take to create synthetic data, for a dataset of one million customer data records, for example?
  • From the perspective of complying with the GDPR, which tests do a data officer must do before releasing a synthetic dataset?
  • When generating synthetic data, how can we keep the statistical characteristics present in the original…

Journey into the world of data privacy — Episode 05

In this series, I share the learnings of my journey into the field of data privacy.

Episode 1: How “anonymous” is anonymized data?
Episode 2:
PETs: the technologies organization should consider adopting
Episode 3:
Introduction to privacy-preserving synthetic data
Episode 4:
10 use-cases for privacy-preserving synthetic data
Episode 6:
Types of synthetic data and real-life examples
Episode 7:
List of events and resources for Data Privacy day 2021

The economics, legal, and corporate implications of data privacy are now too strong to be ignored. In the last decades, different privacy-enhancing…

Elise Devaux

Tech enthusiast, digital marketing manager. Working at Statice, startup specialized in synthetic data for privacy-preserving data applications 👉 www.statice.ai

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store