Course Code: hadoopsparkforadmin
Duration: 35 hours
Prerequisites:
  • 系统管理经验
  • 具有 Linux 命令行的经验
  • 对大数据概念的理解

观众

  • 系统管理员
  • 数据库管理员
Overview:

Apache Hadoop 是一个流行的数据处理框架,用于在许多计算机上处理大数据集。

由教练领导,现场培训(在线或在线)旨在系统管理员谁想要学习如何在他们的组织内设置,部署和管理 Hadoop 集群。

在本研讨会结束后,参与者将能够:

  • 安装和配置 Apache Hadoop.
  • 了解生态系统的四个主要组成部分:HDFS、MapReduce、YARN和0 Common。
  • 使用分布式文件系统(HDFS)将一个集群扩展到数百或数千个节点。
  • 设置 HDFS 作为存储发动机在前置 Spark 部署。
  • 设置 Spark 以获取替代存储解决方案,如 Amazon S3 和 NoSQL 数据库系统,如 Redis, Elasticsearch, Couchbase, Aerospike 等。
  • 执行行政任务,如提供,管理,监测和保证一个 Apache Hadoop 集群。

课程格式

  • 互动讲座和讨论。
  • 很多练习和练习。
  • 在现场实验室环境中进行手动实施。

课程定制选项

  • 要申请此课程的定制培训,请联系我们安排。
Course Outline:

介绍

  • 云计算和大数据解决方案简介
  • Apache 概述 Hadoop 特性和体系结构

设置 Hadoop

  • 规划 Hadoop 集群(本地、云等)
  • 选择操作系统和 Hadoop 发行版
  • 预配资源(硬件、网络等)
  • 下载和安装软件
  • 调整群集大小以实现灵活性

使用 HDFS

  • 了解 Hadoop 分布式文件系统 (HDFS)
  • HDFS命令参考概述
  • 访问 HDFS
  • 在HDFS上执行基本文件操作
  • 使用 S3 作为 HDFS 的补充

MapReduce概述

  • 了解MapReduce框架中的数据流
  • 映射、随机播放、排序和减少
  • 演示:计算最高工资

使用 YARN

  • 了解 Hadoop 中的资源管理
  • 使用 ResourceManager、NodeManager、Application Master
  • 在 YARN 下调度作业
  • 为大量节点和集群进行调度
  • 演示:作业调度

将 Hadoop 与 Spark 集成

  • 为 Spark 设置存储(HDFS、Amazon、S3、NoSQL 等)
  • 了解弹性分布式数据集 (RDD)
  • 创建 RDD
  • 实现 RDD 转换
  • 演示:实现电影标题的文本搜索程序

管理 Hadoop 集群

  • 监控 Hadoop
  • 保护 Hadoop 集群
  • 添加和删除节点
  • 运行性能基准
  • 调整 Hadoop 群集以优化性能
  • 备份、恢复和业务连续性规划
  • 确保高可用性 (HA)

升级和迁移 Hadoop 集群

  • 评估工作负载要求
  • 升级 Hadoop
  • 从本地迁移到云,反之亦然
  • 从故障中恢复

故障 排除

总结和结论

Sites Published:

United Arab Emirates - Hadoop and Spark for Administrators

Qatar - Hadoop and Spark for Administrators

Egypt - Hadoop and Spark for Administrators

Saudi Arabia - Hadoop and Spark for Administrators

South Africa - Hadoop and Spark for Administrators

Brasil - Hadoop and Spark for Administrators

Canada - Hadoop and Spark for Administrators

中国 - Hadoop and Spark for Administrators

香港 - Hadoop and Spark for Administrators

澳門 - Hadoop and Spark for Administrators

台灣 - Hadoop and Spark for Administrators

USA - Hadoop and Spark for Administrators

Österreich - Hadoop and Spark for Administrators

Schweiz - Hadoop and Spark for Administrators

Deutschland - Hadoop and Spark for Administrators

Czech Republic - Hadoop and Spark for Administrators

Denmark - Hadoop and Spark for Administrators

Estonia - Hadoop and Spark for Administrators

Finland - Hadoop and Spark for Administrators

Greece - Hadoop and Spark for Administrators

Magyarország - Hadoop and Spark for Administrators

Ireland - Hadoop and Spark for Administrators

Luxembourg - Hadoop and Spark for Administrators

Latvia - Hadoop and Spark for Administrators

España - Hadoop and Spark for Administrators

Italia - Hadoop and Spark for Administrators

Lithuania - Hadoop and Spark for Administrators

Nederland - Hadoop and Spark for Administrators

Norway - Hadoop and Spark for Administrators

Portugal - Hadoop and Spark for Administrators

România - Hadoop and Spark for Administrators

Sverige - Hadoop and Spark for Administrators

Türkiye - Hadoop and Spark for Administrators

Malta - Hadoop and Spark for Administrators

Belgique - Hadoop and Spark for Administrators

France - Hadoop and Spark for Administrators

日本 - Hadoop and Spark for Administrators

Australia - Hadoop and Spark for Administrators

Malaysia - Hadoop and Spark for Administrators

New Zealand - Hadoop and Spark for Administrators

Philippines - Hadoop and Spark for Administrators

Singapore - Hadoop and Spark for Administrators

Thailand - Hadoop and Spark for Administrators

Vietnam - Hadoop and Spark for Administrators

India - Hadoop and Spark for Administrators

Argentina - Hadoop and Spark for Administrators

Chile - Hadoop and Spark for Administrators

Costa Rica - Hadoop and Spark for Administrators

Ecuador - Hadoop and Spark for Administrators

Guatemala - Hadoop and Spark for Administrators

Colombia - Hadoop and Spark for Administrators

México - Hadoop and Spark for Administrators

Panama - Hadoop and Spark for Administrators

Peru - Hadoop and Spark for Administrators

Uruguay - Hadoop and Spark for Administrators

Venezuela - Hadoop and Spark for Administrators

Polska - Hadoop and Spark for Administrators

United Kingdom - Hadoop and Spark for Administrators

South Korea - Hadoop and Spark for Administrators

Pakistan - Hadoop and Spark for Administrators

Sri Lanka - Hadoop and Spark for Administrators

Bulgaria - Hadoop and Spark for Administrators

Bolivia - Hadoop and Spark for Administrators

Indonesia - Hadoop and Spark for Administrators

Kazakhstan - Hadoop and Spark for Administrators

Moldova - Hadoop and Spark for Administrators

Morocco - Hadoop and Spark for Administrators

Tunisia - Hadoop and Spark for Administrators

Kuwait - Hadoop and Spark for Administrators

Oman - Hadoop and Spark for Administrators

Slovakia - Hadoop and Spark for Administrators

Kenya - Hadoop and Spark for Administrators

Nigeria - Hadoop and Spark for Administrators

Botswana - Hadoop and Spark for Administrators

Slovenia - Hadoop and Spark for Administrators

Croatia - Hadoop and Spark for Administrators

Serbia - Hadoop and Spark for Administrators

Bhutan - Hadoop and Spark for Administrators

Nepal - Hadoop and Spark for Administrators

Uzbekistan - Hadoop and Spark for Administrators