Add Pachyderm to this list

Pachyderm is a data lake that offers complete version control for data and leverages the container ecosystem to provide reproducible data processing.
This commit is contained in:
Loïc
2017-02-08 23:27:20 +01:00
committed by GitHub
parent c96f972a9a
commit ca3d421a6d

View File

@@ -50,6 +50,7 @@ Your contributions are always welcome!
* [Apache Hadoop](http://hadoop.apache.org/) - framework for distributed processing. Integrates MapReduce (parallel processing), YARN (job scheduling) and HDFS (distributed file system).
* [Tigon](https://github.com/caskdata/tigon) - High Throughput Real-time Stream Processing Framework.
* [Pachyderm](http://pachyderm.io/) - Pachyderm is a data storage platform built on Docker and Kubernetes to provide reproducible data processing and analysis.
## Distributed Programming