[PR #2599] Add Crawlee for Python #1916

Open
opened 2025-11-06 13:25:47 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/vinta/awesome-python/pull/2599
Author: @souravjain540
Created: 7/24/2024
Status: 🔄 Open

Base: masterHead: patch-1


📝 Commits (1)

📊 Changes

1 file changed (+2 additions, -0 deletions)

View changed files

📝 README.md (+2 -0)

📄 Description

What is this Python project?

Crawlee for Python is a web scraping and browser automation library that quickly builds reliable scrapers in Python.

What's the difference between this Python project and similar ones?

Similar to Scrapy.

  • You can scrape in both HTTP and headless browser modes.
  • Use BeautifoulSoupCrawler to scrape static pages and PlaywrightCrawler to scrape dynamic JS-rendered websites.
  • No need to install any plugins or set up middleware.
  • Completely typed hint in Python.
  • work with windows as well.

Anyone who agrees with this pull request could submit an Approve review to it.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/vinta/awesome-python/pull/2599 **Author:** [@souravjain540](https://github.com/souravjain540) **Created:** 7/24/2024 **Status:** 🔄 Open **Base:** `master` ← **Head:** `patch-1` --- ### 📝 Commits (1) - [`4b9d569`](https://github.com/vinta/awesome-python/commit/4b9d56976e0a5842483355700991f157984424ea) Add Crawlee for Python ### 📊 Changes **1 file changed** (+2 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `README.md` (+2 -0) </details> ### 📄 Description ## What is this Python project? [Crawlee for Python](https://github.com/apify/crawlee-python/) is a web scraping and browser automation library that quickly builds reliable scrapers in Python. ## What's the difference between this Python project and similar ones? Similar to Scrapy. - You can scrape in both HTTP and headless browser modes. - Use BeautifoulSoupCrawler to scrape static pages and PlaywrightCrawler to scrape dynamic JS-rendered websites. - No need to install any plugins or set up middleware. - Completely typed hint in Python. - work with windows as well. Anyone who agrees with this pull request could submit an *Approve* review to it. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2025-11-06 13:25:47 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/awesome-python#1916