Neha Patki and Romain Palazzo
28 August, 2023
See how we improved synthetic data without modifying the generative algorithms or affecting performance.
Neha Patki and Arnav Modi
19 October, 2023
It is said that Generative AI requires large amounts of data for training. We've tested the SDV and found that a smaller subsample is all you need.
02 May, 2023
Can you anonymize PII without sacrificing usability? Explore the latest techniques in anonymization.
14 April, 2023
The Business Source License allows the SDV team to continue innovating while also enabling many types of usage.
08 March, 2023
The SDV 1.0 library has formalized key principles for generative AI. See what's new and enrich your synthetic data project.
27 February, 2023
Use the SDV Flights Synthesizer to simulate disruptive scenarios and improve software resilience.
09 February, 2023
A 2022 year-end review of the SDV and what you can expect in 2023.
07 February, 2023
Understand categorical data to help you create higher quality synthetic data
26 January, 2023
Our open source grew faster when we adopted a user-centric mindset. Here are 3 strategies we used along the way.
23 January, 2023
How we define a user in 2023 to build a community around synthetic data.
10 January, 2023
Imbalanced data can prevent your projects from succeeding. Will synthetic data work? Explore the rationale behind label balancing.
Santiago Gomez Paz
20 December, 2022
It can be difficult to verify the progress that a GAN is making. What if we combined it with easily interpretable metrics and visualizations?
Let's put synthetic data to work.