awesome-data-pipeline
github.com/kennethanceyer/awesome-data-pipeline ↗Awesome list for datapipeline
37
GitHub Stars
80
Curated Resources
3
Categories
4 hours ago
Last Refreshed
ComponentsCommunityMaterials
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me data warehouse resources from awesome-data-pipeline"
Installation instructions →What's inside
Components
- Aapache HiveData Warehouse
(Apache foundation / Hadoop-friendly / MapReduce / Free).
- Apache AirflowWorkflow Management
(Apache foundation / Airbnb / Open Source / Free).
- Apache AirpalData Analysis
(Apache foundation / Airbnb / Query Editor / Open Source / Free).
- Apache ArgoWorkflow Management
(CNCF foundation / Kubernetes-friendly / Open Source / Free).
- Apache ArrowData Format
(Apache foundation / Data Format / Open Source / Free).
- Apache AvroData Format
(Apache foundation / Data Format / Open Source / Free).
Community
- Airflow SummitOpen Source / Foundation
- Databricks | Data + AI SummitVendors
Data + AI Summit
- Kafka SummitOpen Source / Foundation
- Snowflake | Snowflake SummitVendors
Snowflake Summit
Materials
- DatabricksDummies Guide
- Manning - Data Pipeline with Apache AirflowBooks
Data Pipeline with Apache Airflow
- O'Reilly - Data Pipelines Pocker ReferenceBooks
Data Pipelines Pocker Reference
- SnowflakeDummies Guide
Showing a sample of 80 resources. View the full list on GitHub →