[PR #2772] Add Chinese-Elite #2085

New Issue

GiteaMirror · 2025-11-06T13:29:10-06:00

GiteaMirror commented

2025-11-06 13:29:10 -06:00

📋 Pull Request Information

Original PR: https://github.com/vinta/awesome-python/pull/2772
Author: @anonym-g
Created: 10/16/2025
Status: 🔄 Open

Base: master ← Head: master

📝 Commits (1)

1ef6c24 Update README.md

📊 Changes

1 file changed (+1 additions, -0 deletions)

View changed files

📝 README.md (+1 -0)

📄 Description

Hi, I would like to add the "Chinese-Elite" project to the Miscellaneous section. It's a full-featured open-source tool built with a Python backend that doesn't fit neatly into the library-focused categories.

Chinese-Elite - A tool that automatically maps and analyzes relationship networks of Chinese elites by parsing public data using LLMs, in Python.

What is this Python project?

"Chinese-Elite" is an end-to-end tool with a backend written entirely in Python.

It uses Large Language Models to automatically extract entities and relationships from unstructured text (e.g., Wikipedia) to build a knowledge graph.

The project features a fully automated data pipeline (run_pipeline.py and GitHub Actions) that handles data fetching, graph merging, and cleaning.

The Python backend also generates all data files required by its interactive visualization frontend.

What's the difference between this Python project and similar ones?

It stands apart from typical data projects by its methodology and scope:

vs. Web Scraping Projects: It performs semantic extraction using an LLM to understand context and complex relationships, rather than relying on brittle, rule-based scrapers (e.g., CSS selectors).
vs. Standalone Scripts/Notebooks: It is engineered as a fully automated, self-updating pipeline via GitHub Actions for continuous operation, unlike manual, one-off analysis scripts.
vs. Data Exploration Tools: Its primary role is to generate a structured graph dataset from raw text. Tools like Datasette are used to explore existing structured data, so "Chinese-Elite" operates a step earlier in the data lifecycle.

Anyone who agrees with this pull request could submit an Approve review to it.

_{🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.}

## 📋 Pull Request Information **Original PR:** https://github.com/vinta/awesome-python/pull/2772 **Author:** [@anonym-g](https://github.com/anonym-g) **Created:** 10/16/2025 **Status:** 🔄 Open **Base:** `master` ← **Head:** `master` --- ### 📝 Commits (1) - [`1ef6c24`](https://github.com/vinta/awesome-python/commit/1ef6c24f9520491129cc92ef515dbcb58620cd6a) Update README.md ### 📊 Changes **1 file changed** (+1 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `README.md` (+1 -0) </details> ### 📄 Description Hi, I would like to add the "Chinese-Elite" project to the `Miscellaneous` section. It's a full-featured open-source tool built with a Python backend that doesn't fit neatly into the library-focused categories. > [Chinese-Elite](https://github.com/anonym-g/Chinese-Elite) - A tool that automatically maps and analyzes relationship networks of Chinese elites by parsing public data using LLMs, in Python. ## What is this Python project? "Chinese-Elite" is an end-to-end tool with a backend written entirely in Python. It uses Large Language Models to automatically extract entities and relationships from unstructured text (e.g., Wikipedia) to build a knowledge graph. The project features a fully automated data pipeline (`run_pipeline.py` and GitHub Actions) that handles data fetching, graph merging, and cleaning. The Python backend also generates all data files required by its interactive visualization frontend. ## What's the difference between this Python project and similar ones? It stands apart from typical data projects by its methodology and scope: * **vs. Web Scraping Projects:** It performs **semantic extraction** using an LLM to understand context and complex relationships, rather than relying on brittle, rule-based scrapers (e.g., CSS selectors). * **vs. Standalone Scripts/Notebooks:** It is engineered as a **fully automated, self-updating pipeline** via GitHub Actions for continuous operation, unlike manual, one-off analysis scripts. * **vs. Data Exploration Tools:** Its primary role is to **generate** a structured graph dataset from raw text. Tools like Datasette are used to explore *existing* structured data, so "Chinese-Elite" operates a step earlier in the data lifecycle. --- Anyone who agrees with this pull request could submit an *Approve* review to it. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>

GiteaMirror added the pull-request label 2025-11-06 13:29:10 -06:00

Sign in to join this conversation.

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/awesome-python#2085