Skip to content

gianlucatruda/TableDiffusion

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TableDiffusion

arxivbadge githubbadge blogbadge

This is the supporting code for the paper Generating tabular datasets under differential privacy.

Please check out a quick overview on my blog.

TableDiffusion is a project focused on providing differentially-private generative models for sensitive tabular data. The goal is to enable the synthesis of data that maintains the statistical properties of the original dataset while ensuring the privacy of individuals' information.

The most notable model from this work is TableDiffusion, the first differentially-private diffusion model for tabular data. See tablediffusion/models/table_diffusion.py

⚠️ Disclaimer: This codebase is intended for research purposes only and is not ready for production use. The current implementation may not preserve privacy guarantees due to seed and sampler settings that are not suitable for a production environment.

Paper explanation on YouTube

https://youtu.be/2QRrGWoXOb4

Paper presentation on YouTube

Citing this work

Truda, Gianluca. "Generating tabular datasets under differential privacy." arXiv preprint arXiv:2308.14784 (2023).

@article{truda2023generating,
  title={Generating tabular datasets under differential privacy},
  author={Truda, Gianluca},
  journal={arXiv preprint arXiv:2308.14784},
  year={2023}
}