[PR #836] [CLOSED] Add weboob to Web Crawling section #747

Closed
opened 2025-11-06 13:01:45 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/vinta/awesome-python/pull/836
Author: @hydrargyrum
Created: 2/21/2017
Status: Closed

Base: masterHead: patch-1


📝 Commits (1)

  • 74c4407 Add weboob to Web Crawling section

📊 Changes

1 file changed (+1 additions, -0 deletions)

View changed files

📝 README.md (+1 -0)

📄 Description

What is this Python project?

It's a framework for scraping HTML sites, and aggregating data from multiple sites from a same category (e.g. banking sites, news sites, video sites, etc.).
There are ready-made modules for popular websites and ready-apps to interact with them.
Think youtube-dl applied to other domains than video!

What's the difference between this Python project and similar ones?

  • It's possible to scrape new websites with declarative-style extraction rules
  • It provides a standardized API for categories of sites for dedicated tasks (e.g. banking, web forums, video sites, news sites, music lyrics sites, etc.)
    • Scraped websites are grouped in those categories
  • Scraped websites are grouped in categories for a dedicated task:
  • The project comes with many existing backends for real-life websites
  • It has an internal upgrade system

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/vinta/awesome-python/pull/836 **Author:** [@hydrargyrum](https://github.com/hydrargyrum) **Created:** 2/21/2017 **Status:** ❌ Closed **Base:** `master` ← **Head:** `patch-1` --- ### 📝 Commits (1) - [`74c4407`](https://github.com/vinta/awesome-python/commit/74c4407db6a086398c12b21a1500ee99fa63b27f) Add weboob to Web Crawling section ### 📊 Changes **1 file changed** (+1 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `README.md` (+1 -0) </details> ### 📄 Description ## What is this Python project? It's a framework for scraping HTML sites, and aggregating data from multiple sites from a same category (e.g. banking sites, news sites, video sites, etc.). There are ready-made modules for popular websites and ready-apps to interact with them. Think youtube-dl applied to other domains than video! ## What's the difference between this Python project and similar ones? * It's possible to scrape new websites with declarative-style extraction rules * It provides a standardized API for categories of sites for dedicated tasks (e.g. banking, web forums, video sites, news sites, music lyrics sites, etc.) * Scraped websites are grouped in those categories * Scraped websites are grouped in categories for a dedicated task: * The project comes with many existing backends for real-life websites * It has an internal upgrade system --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2025-11-06 13:01:45 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/awesome-python#747