[PR #954] [CLOSED] added hodor to "HTML Manipulation" #12222

Closed
opened 2026-04-27 01:44:58 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/vinta/awesome-python/pull/954
Author: @cyriac
Created: 10/16/2017
Status: Closed

Base: masterHead: patch-1


📝 Commits (1)

  • 4c0a439 added hodor to "HTML Manipulation"

📊 Changes

1 file changed (+1 additions, -0 deletions)

View changed files

📝 README.md (+1 -0)

📄 Description

What is this Python project?

Hodor is a configuration driven wrapper on top of lxml and cssselect. It helps extracts information based on the rules it gets. A rule can either be based on xpath, or css.

What's the difference between this Python project and similar ones?

It is a simple to use wrapper that leverages lxml and cssselect to write simple scrapers based on configuration and has inbuilt support for pagination, robots.txt, grouping results, and proxy. This helps writing web scapers with minimal lines of code.

--

Anyone who agrees with this pull request could vote for it by adding a 👍 to it, and usually, the maintainer will merge it when votes reach 20.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/vinta/awesome-python/pull/954 **Author:** [@cyriac](https://github.com/cyriac) **Created:** 10/16/2017 **Status:** ❌ Closed **Base:** `master` ← **Head:** `patch-1` --- ### 📝 Commits (1) - [`4c0a439`](https://github.com/vinta/awesome-python/commit/4c0a43914209aad090efc9df97c6eb0ab73543ea) added hodor to "HTML Manipulation" ### 📊 Changes **1 file changed** (+1 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `README.md` (+1 -0) </details> ### 📄 Description ## What is this Python project? Hodor is a configuration driven wrapper on top of lxml and cssselect. It helps extracts information based on the rules it gets. A rule can either be based on xpath, or css. ## What's the difference between this Python project and similar ones? It is a simple to use wrapper that leverages lxml and cssselect to write simple scrapers based on configuration and has inbuilt support for pagination, robots.txt, grouping results, and proxy. This helps writing web scapers with minimal lines of code. -- Anyone who agrees with this pull request could vote for it by adding a :+1: to it, and usually, the maintainer will merge it when votes reach **20**. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-27 01:44:58 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/awesome-python#12222