- Python Programming 的经验。
- 使用 Linux 命令行的经验。
观众
- 开发人员
Apache Beam 是一个开源的统一程式设计模型,用于定义和执行并行数据处理管道。它的强大之处在于它能够同时运行批处理和流式管道,执行由 Beam 支援的分散式处理后端之一执行:Apache Apex、Apache Flink、Apache Spark 和 Google Cloud Dataflow。Apache Beam 对于 ETL(提取、转换和载入)任务非常有用,例如在不同的存储介质和数据源之间移动数据、将数据转换为更理想的格式以及将数据载入到新系统。
在这个由讲师指导的现场培训(现场或远端)中,参与者将学习如何在 Java 或 Python 应用程式中实施 Apache Beam SDK,该应用程式定义了一个数据处理管道,用于将大数据集分解成更小的块,以进行独立的并行处理。
在本次培训结束时,参与者将能够:
- 安装和配置 Apache Beam。
- 使用单个程式设计模型从其 Java 或 Python 应用程式执行批处理和流处理。
- 跨多个环境执行管道。
课程形式
- 部分讲座、部分讨论、练习和大量动手实践
注意
- 本课程将在未来提供 Scala。请联系我们安排。
介绍
- Apache Beam 与 MapReduce、Spark Streaming、Kafka Streaming、Storm 和 Flink
安装与设定Apache Beam
Apache Beam 功能和架构概述
- 梁模型、SDK、梁管道流道
- 分散式处理后端
了解 Apache Beam Programming 模型
- 管道的执行方式
运行示例管道
- 准备 WordCount 管道
- 在本地执行 Pipeline
设计管道
- 规划结构、选择转换以及确定输入和输出方法
创建管道
- 编写驱动程式并定义管道
- 使用 Apache Beam 类
- 数据集、转换、I/O、数据编码等
执行管道
- 在本地、远端电脑和公共云上执行管道
- 选择运行器
- 特定于 Runner 的配置
测试和除错 Apache Beam
- 使用类型提示模拟静态类型
- 管理 Python 管道依赖项
处理有界和无界数据集
- 视窗化和触发器
使您的管道可重用且可维护
创建新的数据源和接收器
- Apache Beam 源 API 和接收器 API
将 Apache Beam 与其他 Big Data 系统集成
- 阿帕奇 Hadoop, Apache Spark, Apache Kafka
故障排除
总结和结论
United Arab Emirates - Unified Batch and Stream Processing with Apache Beam
Qatar - Unified Batch and Stream Processing with Apache Beam
Egypt - Unified Batch and Stream Processing with Apache Beam
Saudi Arabia - Unified Batch and Stream Processing with Apache Beam
South Africa - Unified Batch and Stream Processing with Apache Beam
Brasil - Unified Batch and Stream Processing with Apache Beam
Canada - Unified Batch and Stream Processing with Apache Beam
中国 - Unified Batch and Stream Processing with Apache Beam
香港 - Unified Batch and Stream Processing with Apache Beam
澳門 - Unified Batch and Stream Processing with Apache Beam
台灣 - Unified Batch and Stream Processing with Apache Beam
USA - Unified Batch and Stream Processing with Apache Beam
Österreich - Unified Batch and Stream Processing with Apache Beam
Schweiz - Unified Batch and Stream Processing with Apache Beam
Deutschland - Unified Batch and Stream Processing with Apache Beam
Czech Republic - Unified Batch and Stream Processing with Apache Beam
Denmark - Unified Batch and Stream Processing with Apache Beam
Estonia - Unified Batch and Stream Processing with Apache Beam
Finland - Unified Batch and Stream Processing with Apache Beam
Greece - Unified Batch and Stream Processing with Apache Beam
Magyarország - Unified Batch and Stream Processing with Apache Beam
Ireland - Unified Batch and Stream Processing with Apache Beam
Luxembourg - Unified Batch and Stream Processing with Apache Beam
Latvia - Unified Batch and Stream Processing with Apache Beam
España - Unified Batch and Stream Processing with Apache Beam
Italia - Unified Batch and Stream Processing with Apache Beam
Lithuania - Unified Batch and Stream Processing with Apache Beam
Nederland - Unified Batch and Stream Processing with Apache Beam
Norway - Unified Batch and Stream Processing with Apache Beam
Portugal - Unified Batch and Stream Processing with Apache Beam
România - Unified Batch and Stream Processing with Apache Beam
Sverige - Unified Batch and Stream Processing with Apache Beam
Türkiye - Unified Batch and Stream Processing with Apache Beam
Malta - Unified Batch and Stream Processing with Apache Beam
Belgique - Unified Batch and Stream Processing with Apache Beam
France - Unified Batch and Stream Processing with Apache Beam
日本 - Unified Batch and Stream Processing with Apache Beam
Australia - Unified Batch and Stream Processing with Apache Beam
Malaysia - Unified Batch and Stream Processing with Apache Beam
New Zealand - Unified Batch and Stream Processing with Apache Beam
Philippines - Unified Batch and Stream Processing with Apache Beam
Singapore - Unified Batch and Stream Processing with Apache Beam
Thailand - Unified Batch and Stream Processing with Apache Beam
Vietnam - Unified Batch and Stream Processing with Apache Beam
India - Unified Batch and Stream Processing with Apache Beam
Argentina - Unified Batch and Stream Processing with Apache Beam
Chile - Unified Batch and Stream Processing with Apache Beam
Costa Rica - Unified Batch and Stream Processing with Apache Beam
Ecuador - Unified Batch and Stream Processing with Apache Beam
Guatemala - Unified Batch and Stream Processing with Apache Beam
Colombia - Unified Batch and Stream Processing with Apache Beam
México - Unified Batch and Stream Processing with Apache Beam
Panama - Unified Batch and Stream Processing with Apache Beam
Peru - Unified Batch and Stream Processing with Apache Beam
Uruguay - Unified Batch and Stream Processing with Apache Beam
Venezuela - Unified Batch and Stream Processing with Apache Beam
Polska - Unified Batch and Stream Processing with Apache Beam
United Kingdom - Unified Batch and Stream Processing with Apache Beam
South Korea - Unified Batch and Stream Processing with Apache Beam
Pakistan - Unified Batch and Stream Processing with Apache Beam
Sri Lanka - Unified Batch and Stream Processing with Apache Beam
Bulgaria - Unified Batch and Stream Processing with Apache Beam
Bolivia - Unified Batch and Stream Processing with Apache Beam
Indonesia - Unified Batch and Stream Processing with Apache Beam
Kazakhstan - Unified Batch and Stream Processing with Apache Beam
Moldova - Unified Batch and Stream Processing with Apache Beam
Morocco - Unified Batch and Stream Processing with Apache Beam
Tunisia - Unified Batch and Stream Processing with Apache Beam
Kuwait - Unified Batch and Stream Processing with Apache Beam
Oman - Unified Batch and Stream Processing with Apache Beam
Slovakia - Unified Batch and Stream Processing with Apache Beam
Kenya - Unified Batch and Stream Processing with Apache Beam
Nigeria - Unified Batch and Stream Processing with Apache Beam
Botswana - Unified Batch and Stream Processing with Apache Beam
Slovenia - Unified Batch and Stream Processing with Apache Beam
Croatia - Unified Batch and Stream Processing with Apache Beam
Serbia - Unified Batch and Stream Processing with Apache Beam
Bhutan - Unified Batch and Stream Processing with Apache Beam
Nepal - Unified Batch and Stream Processing with Apache Beam
Uzbekistan - Unified Batch and Stream Processing with Apache Beam