- 熟悉Java编程语言(大多数编程练习使用Java)
- 熟悉Linux环境(能够使用Linux命令行,使用vi/nano编辑文件)
- 具备Hadoop的基础知识
实验环境
零安装:无需在学生机器上安装hadoop软件!将为学生提供可用的hadoop集群。
学生需要准备以下内容
- SSH客户端(Linux和Mac已自带SSH客户端,Windows推荐使用Putty)
- 用于访问集群的浏览器。推荐使用Firefox浏览器
Apache Hadoop 是最流行的服务器集群上处理 Big Data 的框架之一。本课程深入探讨 HDFS 中的数据管理、高级 Pig、Hive 和 HBase。这些高级编程技术将对有经验的 Hadoop 开发者非常有益。
受众: 开发者
时长: 三天
形式: 讲座 (50%) 和动手实验 (50%)。
第一部分: Data Management 在 HDFS 中
- 多种数据格式 (JSON / Avro / Parquet)
- 压缩方案
- 数据掩码
- 实验: 分析不同数据格式, 启用压缩
第二部分: 高级 Pig
- 用户自定义函数
- Pig 库介绍 (ElephantBird / Data-Fu)
- 使用 Pig 加载复杂结构化数据
- Pig 调优
- 实验: 高级 Pig 脚本编写, 解析复杂数据类型
第三部分: 高级 Hive
- 用户自定义函数
- 压缩表
- Hive 性能调优
- 实验: 创建压缩表, 评估表格式和配置
第四部分: 高级 HBase
- 高级模式建模
- 压缩
- 批量数据导入
- 宽表/高表比较
- HBase 和 Pig
- HBase 和 Hive
- HBase 性能调优
- 实验: 调优 HBase, 从 Pig 和 Hive 访问 HBase 数据, 使用 Phoenix 进行数据建模
United Arab Emirates - Advanced Hadoop for Developers
Qatar - Advanced Hadoop for Developers
Egypt - Advanced Hadoop for Developers
Saudi Arabia - Advanced Hadoop for Developers
South Africa - Advanced Hadoop for Developers
Brasil - Advanced Hadoop for Developers
Canada - Advanced Hadoop for Developers
中国 - Advanced Hadoop for Developers
香港 - Advanced Hadoop for Developers
澳門 - Advanced Hadoop for Developers
台灣 - Advanced Hadoop for Developers
USA - Advanced Hadoop for Developers
Österreich - Advanced Hadoop for Developers
Schweiz - Advanced Hadoop for Developers
Deutschland - Advanced Hadoop for Developers
Czech Republic - Advanced Hadoop for Developers
Denmark - Advanced Hadoop for Developers
Estonia - Advanced Hadoop for Developers
Finland - Advanced Hadoop for Developers
Greece - Advanced Hadoop for Developers
Magyarország - Advanced Hadoop for Developers
Ireland - Advanced Hadoop for Developers
Luxembourg - Advanced Hadoop for Developers
Latvia - Advanced Hadoop for Developers
España - Hadoop Avanzado para Desarrolladores
Italia - Advanced Hadoop for Developers
Lithuania - Advanced Hadoop for Developers
Nederland - Advanced Hadoop for Developers
Norway - Advanced Hadoop for Developers
Portugal - Advanced Hadoop for Developers
România - Advanced Hadoop for Developers
Sverige - Advanced Hadoop for Developers
Türkiye - Advanced Hadoop for Developers
Malta - Advanced Hadoop for Developers
Belgique - Hadoop pour Développeurs, niveau Acvancé
France - Hadoop pour Développeurs, niveau Acvancé
日本 - Advanced Hadoop for Developers
Australia - Advanced Hadoop for Developers
Malaysia - Advanced Hadoop for Developers
New Zealand - Advanced Hadoop for Developers
Philippines - Advanced Hadoop for Developers
Singapore - Advanced Hadoop for Developers
Thailand - Advanced Hadoop for Developers
Vietnam - Advanced Hadoop for Developers
India - Advanced Hadoop for Developers
Argentina - Hadoop Avanzado para Desarrolladores
Chile - Hadoop Avanzado para Desarrolladores
Costa Rica - Hadoop Avanzado para Desarrolladores
Ecuador - Hadoop Avanzado para Desarrolladores
Guatemala - Hadoop Avanzado para Desarrolladores
Colombia - Hadoop Avanzado para Desarrolladores
México - Hadoop Avanzado para Desarrolladores
Panama - Hadoop Avanzado para Desarrolladores
Peru - Hadoop Avanzado para Desarrolladores
Uruguay - Hadoop Avanzado para Desarrolladores
Venezuela - Hadoop Avanzado para Desarrolladores
Polska - Advanced Hadoop for Developers
United Kingdom - Advanced Hadoop for Developers
South Korea - Advanced Hadoop for Developers
Pakistan - Advanced Hadoop for Developers
Sri Lanka - Advanced Hadoop for Developers
Bulgaria - Advanced Hadoop for Developers
Bolivia - Hadoop Avanzado para Desarrolladores
Indonesia - Advanced Hadoop for Developers
Kazakhstan - Advanced Hadoop for Developers
Moldova - Advanced Hadoop for Developers
Morocco - Advanced Hadoop for Developers
Tunisia - Advanced Hadoop for Developers
Kuwait - Advanced Hadoop for Developers
Oman - Advanced Hadoop for Developers
Slovakia - Advanced Hadoop for Developers
Kenya - Advanced Hadoop for Developers
Nigeria - Advanced Hadoop for Developers
Botswana - Advanced Hadoop for Developers
Slovenia - Advanced Hadoop for Developers
Croatia - Advanced Hadoop for Developers
Serbia - Advanced Hadoop for Developers
Bhutan - Advanced Hadoop for Developers