[PR #3003] [MERGED] Add TabGAN - synthetic tabular data generation with GANs, Diffusion, and LLMs #6710

Closed
opened 2026-04-17 07:38:01 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/vinta/awesome-python/pull/3003
Author: @Diyago
Created: 3/28/2026
Status: Merged
Merged: 3/29/2026
Merged by: @JinyangWang27

Base: masterHead: add-tabgan


📝 Commits (2)

  • 13b8410 Add TabGAN - synthetic tabular data generation library
  • fed92e7 Remove extra empty line, place TabGAN in alphabetical order

📊 Changes

1 file changed (+1 additions, -0 deletions)

View changed files

📝 README.md (+1 -0)

📄 Description

Add TabGAN — Synthetic Tabular Data Generation

TabGAN is a Python library for generating high-quality synthetic tabular data using multiple generative approaches through a unified API:

  • CTGAN (Conditional Tabular GAN) for mixed data types
  • ForestDiffusion (tree-based diffusion) for structured data
  • GReaT (Large Language Models) for semantic dependencies

Key Features

  • Unified API across GANs, Diffusion Models, and LLMs
  • Adversarial filtering ensures distribution consistency
  • Privacy metrics (DCR, NNDR, membership inference)
  • Constraint enforcement (range, uniqueness, formula, regex)
  • HTML quality reports with distribution comparisons
  • sklearn TabGANTransformer for Pipeline integration
  • 100K+ PyPI downloads, 115 tests, Apache 2.0

Paper: Tabular GANs for uneven distribution


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/vinta/awesome-python/pull/3003 **Author:** [@Diyago](https://github.com/Diyago) **Created:** 3/28/2026 **Status:** ✅ Merged **Merged:** 3/29/2026 **Merged by:** [@JinyangWang27](https://github.com/JinyangWang27) **Base:** `master` ← **Head:** `add-tabgan` --- ### 📝 Commits (2) - [`13b8410`](https://github.com/vinta/awesome-python/commit/13b84109e03fa38f10c757fab12d499176724634) Add TabGAN - synthetic tabular data generation library - [`fed92e7`](https://github.com/vinta/awesome-python/commit/fed92e79f39c84b76106abeab37d0787729e872f) Remove extra empty line, place TabGAN in alphabetical order ### 📊 Changes **1 file changed** (+1 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `README.md` (+1 -0) </details> ### 📄 Description ## Add [TabGAN](https://github.com/Diyago/Tabular-data-generation) — Synthetic Tabular Data Generation [TabGAN](https://github.com/Diyago/Tabular-data-generation) is a Python library for generating high-quality synthetic tabular data using multiple generative approaches through a unified API: - **CTGAN** (Conditional Tabular GAN) for mixed data types - **ForestDiffusion** (tree-based diffusion) for structured data - **GReaT** (Large Language Models) for semantic dependencies ### Key Features - Unified API across GANs, Diffusion Models, and LLMs - Adversarial filtering ensures distribution consistency - Privacy metrics (DCR, NNDR, membership inference) - Constraint enforcement (range, uniqueness, formula, regex) - HTML quality reports with distribution comparisons - sklearn `TabGANTransformer` for Pipeline integration - 100K+ PyPI downloads, 115 tests, Apache 2.0 *Paper: [Tabular GANs for uneven distribution](https://arxiv.org/abs/2010.00638)* --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-17 07:38:01 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/awesome-python#6710