[PR #2274] [CLOSED] add Pandas-Profiling #6139

Closed
opened 2026-04-17 07:20:21 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/vinta/awesome-python/pull/2274
Author: @aquemy
Created: 12/16/2022
Status: Closed

Base: masterHead: master


📝 Commits (1)

📊 Changes

1 file changed (+1 additions, -0 deletions)

View changed files

📝 README.md (+1 -0)

📄 Description

What is this Python project?

pandas-profiling generates profile reports from a pandas DataFrame.
The reports are like a richer version of DataFrame.describe with a lot of information that any data scientist is looking for: distribution description, missing values, type inference, zero, high correlation, skewness, etc.

The reports can be generated in PDF or HTML. They contain alerts that give a clear information on the issues with a given dataset.

The library can also generate comparison reports between two datasets, so it can be used to compare two versions of the same dataset, or some new data once a model is in production (sort of model monitoring but for data).

It is well maintained with active development, has 10k stars on GitHub and it is used by many companies, including large ones.

What's the difference between this Python project and similar ones?

As far as I know, there is no alternative library.

--

Anyone who agrees with this pull request could submit an Approve review to it.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/vinta/awesome-python/pull/2274 **Author:** [@aquemy](https://github.com/aquemy) **Created:** 12/16/2022 **Status:** ❌ Closed **Base:** `master` ← **Head:** `master` --- ### 📝 Commits (1) - [`b19df8e`](https://github.com/vinta/awesome-python/commit/b19df8e18cd16caf14afb2030bb7ed8f5e88f29f) add Pandas-Profiling ### 📊 Changes **1 file changed** (+1 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `README.md` (+1 -0) </details> ### 📄 Description ## What is this Python project? `pandas-profiling` generates profile reports from a `pandas` DataFrame. The reports are like a richer version of `DataFrame.describe` with a lot of information that any data scientist is looking for: distribution description, missing values, type inference, zero, high correlation, skewness, etc. The reports can be generated in PDF or HTML. They contain alerts that give a clear information on the issues with a given dataset. The library can also generate comparison reports between two datasets, so it can be used to compare two versions of the same dataset, or some new data once a model is in production (sort of model monitoring but for data). It is well maintained with active development, has 10k stars on GitHub and it is used by many companies, including large ones. ## What's the difference between this Python project and similar ones? As far as I know, there is no alternative library. -- Anyone who agrees with this pull request could submit an *Approve* review to it. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-17 07:20:21 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/awesome-python#6139