Blog
Community

Synthetic Data in 2024: The Year In Review
Check out how 2024 has been the biggest year for synthetic data. Google, Apple, Meta, OpenAI all emphasized the importance of using synthetic data in their AI model development. While Snowflake, Databricks, DataCebo and several others released new tooling required to create synthetic data.

Why we changed the SDV license to BSL (and how that impacts our users)
The Business Source License allows the SDV team to continue innovating while also enabling many types of usage.

3 user-centric growth strategies for open source
Our open source grew faster when we adopted a user-centric mindset. Here are 3 strategies we used along the way.

The Most Important Open Source Demographic That No One Thinks About
How we define a user in 2023 to build a community around synthetic data.

Your Feedback in Action, Part 2: Data Workflow
After thousands of downloads, see how the synthetic data workflow in the SDV has evolved based on feedback from users.

Your Feedback in Action, Part 1: Data Models
After thousands of downloads, see how SDV's machine learning models have evolved based on feedback from users.

Meet the Synthetic Data Vault
Welcome to the SDV Blog! The SDV is a comprehensive, open source software for synthetic data generation. Join our growing community as we create an ecosystem to solve real world problems!
Become part of our community
Join our Slack community to discuss your synthetic data projects and connect with other users.
Join our SlackExplore our blog
Read our newest insights about synthetic data, updates on our products, and successful use cases.
Read our blog