- 基本的Linux管理技能
- 基本编程技能
受众:
本课程面向希望在分布式系统环境中存储和处理大规模数据集的IT专业人员。
Goal:
深入了解Hadoop集群管理。
1: HDFS (17%)
- 描述HDFS守护进程的功能
- 描述Apache Hadoop集群在数据存储和数据处理中的正常操作
- 识别当前计算系统的特性,这些特性促使了像Apache Hadoop这样的系统的出现
- 分类HDFS设计的主要目标
- 在给定场景中,识别HDFS Federation的合适用例
- 识别HDFS HA-Quorum集群的组件和守护进程
- 分析HDFS安全性(Kerberos)的作用
- 在给定场景中,确定最佳的数据序列化选择
- 描述文件的读取和写入路径
- 识别在Hadoop文件系统Shell中操作文件的命令
2: YARN和MapReduce版本2 (MRv2) (17%)
- 理解将集群从Hadoop 1升级到Hadoop 2如何影响集群设置
- 理解如何部署MapReduce v2 (MRv2 / YARN),包括所有YARN守护进程
- 理解MapReduce v2 (MRv2)的基本设计策略
- 确定YARN如何处理资源分配
- 识别在YARN上运行的MapReduce作业的工作流程
- 确定为了将集群从MapReduce版本1 (MRv1)迁移到运行在YARN上的MapReduce版本2 (MRv2),必须更改哪些文件以及如何更改
3: Hadoop集群规划 (16%)
- 选择硬件和操作系统以托管Apache Hadoop集群时需要考虑的主要点
- 分析选择操作系统时的选项
- 理解内核调优和磁盘交换
- 在给定场景和工作负载模式下,识别适合该场景的硬件配置
- 在给定场景中,确定集群需要运行的生态系统组件以满足SLA
- 集群规模:在给定场景和执行频率下,识别工作负载的具体需求,包括CPU、内存、存储、磁盘I/O
- 磁盘大小和配置,包括JBOD与RAID、SANs、虚拟化以及集群中的磁盘大小要求
- 网络拓扑:理解Hadoop中的网络使用情况(包括HDFS和MapReduce),并在给定场景中提出或识别关键的网络设计组件
4: Hadoop集群安装与管理 (25%)
- 在给定场景中,识别集群如何处理磁盘和机器故障
- 分析日志配置和日志配置文件格式
- 理解Hadoop指标和集群健康监控的基础知识
- 识别可用工具的功能和用途,用于集群监控
- 能够安装CDH 5中的所有生态系统组件,包括(但不限于):Impala、Flume、Oozie、Hue、Manager、Sqoop、Hive和Pig
- 识别用于管理Apache Hadoop文件系统的可用工具的功能和用途
5: 资源Management (10%)
- 理解每个Hadoop调度器的整体设计目标
- 在给定场景中,确定FIFO调度器如何分配集群资源
- 在给定场景中,确定Fair调度器如何在YARN下分配集群资源
- 在给定场景中,确定Capacity调度器如何分配集群资源
6: 监控与日志 (15%)
- 理解Hadoop的指标收集功能及其特性
- 分析NameNode和JobTracker的Web UI
- 理解如何监控集群守护进程
- 识别并监控主节点的CPU使用情况
- 描述如何监控所有节点的交换和内存分配
- 识别如何查看和管理Hadoop的日志文件
- 解释日志文件
United Arab Emirates - Administrator Training for Apache Hadoop
Qatar - Administrator Training for Apache Hadoop
Egypt - Administrator Training for Apache Hadoop
Saudi Arabia - Administrator Training for Apache Hadoop
South Africa - Administrator Training for Apache Hadoop
Brasil - Treinamento de Administrador para Apache Hadoop
Canada - Administrator Training for Apache Hadoop
中国 - Administrator Training for Apache Hadoop
香港 - Administrator Training for Apache Hadoop
澳門 - Administrator Training for Apache Hadoop
台灣 - Administrator Training for Apache Hadoop
USA - Administrator Training for Apache Hadoop
Österreich - Administrator Training for Apache Hadoop
Schweiz - Administrator Training for Apache Hadoop
Deutschland - Administrator Training for Apache Hadoop
Czech Republic - Administrator Training for Apache Hadoop
Denmark - Administrator Training for Apache Hadoop
Estonia - Administrator Training for Apache Hadoop
Finland - Administrator Training for Apache Hadoop
Greece - Administrator Training for Apache Hadoop
Magyarország - Administrator Training for Apache Hadoop
Ireland - Administrator Training for Apache Hadoop
Luxembourg - Administrator Training for Apache Hadoop
Latvia - Administrator Training for Apache Hadoop
España - Capacitación de Administrador para Apache Hadoop
Italia - Administrator Training for Apache Hadoop
Lithuania - Administrator Training for Apache Hadoop
Nederland - Administrator Training for Apache Hadoop
Norway - Administrator Training for Apache Hadoop
Portugal - Treinamento de Administrador para Apache Hadoop
România - Administrator Training for Apache Hadoop
Sverige - Administrator Training for Apache Hadoop
Türkiye - Administrator Training for Apache Hadoop
Malta - Administrator Training for Apache Hadoop
Belgique - Administrator Training for Apache Hadoop
France - Administrator Training for Apache Hadoop
日本 - Administrator Training for Apache Hadoop
Australia - Administrator Training for Apache Hadoop
Malaysia - Administrator Training for Apache Hadoop
New Zealand - Administrator Training for Apache Hadoop
Philippines - Administrator Training for Apache Hadoop
Singapore - Administrator Training for Apache Hadoop
Thailand - Administrator Training for Apache Hadoop
Vietnam - Administrator Training for Apache Hadoop
India - Administrator Training for Apache Hadoop
Argentina - Capacitación de Administrador para Apache Hadoop
Chile - Capacitación de Administrador para Apache Hadoop
Costa Rica - Capacitación de Administrador para Apache Hadoop
Ecuador - Capacitación de Administrador para Apache Hadoop
Guatemala - Capacitación de Administrador para Apache Hadoop
Colombia - Capacitación de Administrador para Apache Hadoop
México - Capacitación de Administrador para Apache Hadoop
Panama - Capacitación de Administrador para Apache Hadoop
Peru - Capacitación de Administrador para Apache Hadoop
Uruguay - Capacitación de Administrador para Apache Hadoop
Venezuela - Capacitación de Administrador para Apache Hadoop
Polska - Administrator Training for Apache Hadoop
United Kingdom - Administrator Training for Apache Hadoop
South Korea - Administrator Training for Apache Hadoop
Pakistan - Administrator Training for Apache Hadoop
Sri Lanka - Administrator Training for Apache Hadoop
Bulgaria - Administrator Training for Apache Hadoop
Bolivia - Capacitación de Administrador para Apache Hadoop
Indonesia - Administrator Training for Apache Hadoop
Kazakhstan - Administrator Training for Apache Hadoop
Moldova - Administrator Training for Apache Hadoop
Morocco - Administrator Training for Apache Hadoop
Tunisia - Administrator Training for Apache Hadoop
Kuwait - Administrator Training for Apache Hadoop
Oman - Administrator Training for Apache Hadoop
Slovakia - Administrator Training for Apache Hadoop
Kenya - Administrator Training for Apache Hadoop
Nigeria - Administrator Training for Apache Hadoop
Botswana - Administrator Training for Apache Hadoop
Slovenia - Administrator Training for Apache Hadoop
Croatia - Administrator Training for Apache Hadoop
Serbia - Administrator Training for Apache Hadoop
Bhutan - Administrator Training for Apache Hadoop