[PR #1748] Adding hub to the awesome list #1518

Open
opened 2025-11-06 13:17:53 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/vinta/awesome-python/pull/1748
Author: @sparkingdark
Created: 4/1/2021
Status: 🔄 Open

Base: masterHead: adding-hubpackage


📝 Commits (1)

  • c7fd752 Adding hub to the awesome list

📊 Changes

1 file changed (+1 additions, -0 deletions)

View changed files

📝 README.md (+1 -0)

📄 Description

What is this Python project?

Hub - Fastest unstructured dataset management for TensorFlow/PyTorch by activeloop.ai. Stream & version-control data. Converts large data into a single numpy-like array on the cloud, accessible on any machine.

Describe features.

  • Store and retrieve large datasets with version-control
  • Collaborate as in Google Docs: Multiple data scientists working on the same data in sync with no interruptions
  • Access from multiple machines simultaneously
  • Deploy anywhere - locally, on Google Cloud, S3, Azure, and Activeloop (by default - and for free!)
  • Integrate with your ML tools like Numpy, Dask, Ray, PyTorch, or TensorFlow
  • Create arrays as big as you want. You can store images as big as 100k by 100k!
  • Keep the shape of each sample dynamic. This way you can store small and big arrays as 1 array.
  • Visualize any slice of the data in a matter of seconds without redundant manipulations

What's the difference between this Python project and similar ones?

Enumerate comparisons.

It's much more deep learning, machine learning-oriented, and makes easy handling of the data.

Anyone who agrees with this pull request could submit an Approve review to it.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/vinta/awesome-python/pull/1748 **Author:** [@sparkingdark](https://github.com/sparkingdark) **Created:** 4/1/2021 **Status:** 🔄 Open **Base:** `master` ← **Head:** `adding-hubpackage` --- ### 📝 Commits (1) - [`c7fd752`](https://github.com/vinta/awesome-python/commit/c7fd752d70ed6408f9386afb1833826a416412eb) Adding hub to the awesome list ### 📊 Changes **1 file changed** (+1 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `README.md` (+1 -0) </details> ### 📄 Description ## What is this Python project? Hub - Fastest unstructured dataset management for TensorFlow/PyTorch by activeloop.ai. Stream & version-control data. Converts large data into a single numpy-like array on the cloud, accessible on any machine. Describe features. - Store and retrieve large datasets with version-control - Collaborate as in Google Docs: Multiple data scientists working on the same data in sync with no interruptions - Access from multiple machines simultaneously - Deploy anywhere - locally, on Google Cloud, S3, Azure, and Activeloop (by default - and for free!) - Integrate with your ML tools like Numpy, Dask, Ray, PyTorch, or TensorFlow - Create arrays as big as you want. You can store images as big as 100k by 100k! - Keep the shape of each sample dynamic. This way you can store small and big arrays as 1 array. - Visualize any slice of the data in a matter of seconds without redundant manipulations ## What's the difference between this Python project and similar ones? Enumerate comparisons. It's much more deep learning, machine learning-oriented, and makes easy handling of the data. -- Anyone who agrees with this pull request could submit an *Approve* review to it. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2025-11-06 13:17:53 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/awesome-python#1518