awesome-beam
github.com/pabloem/awesome-beam ↗A curated list of awesome resources for Apache Beam
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me courses resources from awesome-beam"
Installation instructions →What's inside
Courses
- Apache Beam | A Hands-On course to build Big data Pipelines
A Hands-On course to build Big data Pipelines.
- Apache Beam Katas
Beam Wrappers
- Apache Beam in Kotlin to reduce boilerplate
- Scio - Scala wrapper for Apache Beam
Scala wrapper for Apache Beam
- thruber - Clojure wrapper for Apache Beam
Clojure wrapper for Apache Beam
Community
Notebooks
- Apache Zeppelin
Web-based notebook that enables interactive data analytics with plugable backends, plotting, etc.
Official Resources
Blogs and Blog Posts
- Coding Apache Beam in your Web Browser and Running it in Cloud Dataflow
How to create and run a Beam Pipeline on Dataflow using Code Editor.
- Java Development Environments for Beam on GCP
How to set up a development environment for Java Dataflow / Beam jobs.
- Official Beam Blog
- Profiling Apache Beam Python Pipelines
Medium
- Profiling Dataflow Pipelines (Java)
Medium
- Python Development Environments for Beam on GCP
How to set up a development environment for Python Dataflow jobs.
Talks
- slides
Excellent talk using Beam's Python SDK to run machine learning over a dataset of images.
- slides
A talk about Spark runner implementation, performance improvements and roadmap.
- slides
A talk about how Beam enables Python pipelines to run on top of Flink.
- slides
A Transform Developer's perspective. Alex Van Boxel. [
- slides
The Beam runner for Samza - Xinyu Liu [
- slides
Showing a sample of 32 resources. View the full list on GitHub →