From ca3d421a6df73ebfccb210c00a8dd6a95af68bbe Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Lo=C3=AFc?= Date: Wed, 8 Feb 2017 23:27:20 +0100 Subject: [PATCH] Add Pachyderm to this list Pachyderm is a data lake that offers complete version control for data and leverages the container ecosystem to provide reproducible data processing. --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index f468ab3..9da7713 100644 --- a/README.md +++ b/README.md @@ -50,6 +50,7 @@ Your contributions are always welcome! * [Apache Hadoop](http://hadoop.apache.org/) - framework for distributed processing. Integrates MapReduce (parallel processing), YARN (job scheduling) and HDFS (distributed file system). * [Tigon](https://github.com/caskdata/tigon) - High Throughput Real-time Stream Processing Framework. +* [Pachyderm](http://pachyderm.io/) - Pachyderm is a data storage platform built on Docker and Kubernetes to provide reproducible data processing and analysis. ## Distributed Programming