Course Code: hadoopdeva
Duration: 21 hours
Prerequisites:
  • comfortable with Java programming language (most programming exercises are in java)
  • comfortable in Linux environment (be able to navigate Linux command line, edit files using vi / nano)
  • a working  knowledge of Hadoop.

Lab environment

Zero Install: There is no need to install hadoop software on students’ machines! A working hadoop cluster will be provided for students.

Students will need the following

  • an SSH client (Linux and Mac already have ssh clients, for Windows Putty is recommended)
  • a browser to access the cluster. We recommend Firefox browser
Overview:

Apache Hadoop is one of the most popular frameworks for processing Big Data on clusters of servers. This course delves into data management in HDFS, advanced Pig, Hive, and HBase.  These advanced programming techniques will be beneficial to experienced Hadoop developers.

Audience: developers

Duration: three days

Format: lectures (50%) and hands-on labs (50%).

Course Outline:

Section 1: Data Management in HDFS

  • Various Data Formats (JSON / Avro / Parquet)
  • Compression Schemes
  • Data Masking
  • Labs : Analyzing different data formats;  enabling compression

Section 2: Advanced Pig

  • User-defined Functions
  • Introduction to Pig Libraries (ElephantBird / Data-Fu)
  • Loading Complex Structured Data using Pig
  • Pig Tuning
  • Labs : advanced pig scripting, parsing complex data types

Section 3 : Advanced Hive

  • User-defined Functions
  • Compressed Tables
  • Hive Performance Tuning
  • Labs : creating compressed tables, evaluating table formats and configuration

Section 4 : Advanced HBase

  • Advanced Schema Modelling
  • Compression
  • Bulk Data Ingest
  • Wide-table / Tall-table comparison
  • HBase and Pig
  • HBase and Hive
  • HBase Performance Tuning
  • Labs : tuning HBase; accessing HBase data from Pig & Hive; Using Phoenix for data modeling
Sites Published:

United Arab Emirates - Advanced Hadoop for Developers

Qatar - Advanced Hadoop for Developers

Egypt - Advanced Hadoop for Developers

Saudi Arabia - Advanced Hadoop for Developers

South Africa - Advanced Hadoop for Developers

Brasil - Advanced Hadoop for Developers

Canada - Advanced Hadoop for Developers

中国 - Advanced Hadoop for Developers

香港 - Advanced Hadoop for Developers

澳門 - Advanced Hadoop for Developers

台灣 - Advanced Hadoop for Developers

USA - Advanced Hadoop for Developers

Österreich - Advanced Hadoop for Developers

Schweiz - Advanced Hadoop for Developers

Deutschland - Advanced Hadoop for Developers

Czech Republic - Advanced Hadoop for Developers

Denmark - Advanced Hadoop for Developers

Estonia - Advanced Hadoop for Developers

Finland - Advanced Hadoop for Developers

Greece - Advanced Hadoop for Developers

Magyarország - Advanced Hadoop for Developers

Ireland - Advanced Hadoop for Developers

Luxembourg - Advanced Hadoop for Developers

Latvia - Advanced Hadoop for Developers

España - Hadoop Avanzado para Desarrolladores

Italia - Advanced Hadoop for Developers

Lithuania - Advanced Hadoop for Developers

Nederland - Advanced Hadoop for Developers

Norway - Advanced Hadoop for Developers

Portugal - Advanced Hadoop for Developers

România - Advanced Hadoop for Developers

Sverige - Advanced Hadoop for Developers

Türkiye - Advanced Hadoop for Developers

Malta - Advanced Hadoop for Developers

Belgique - Hadoop pour Développeurs, niveau Acvancé

France - Hadoop pour Développeurs, niveau Acvancé

日本 - Advanced Hadoop for Developers

Australia - Advanced Hadoop for Developers

Malaysia - Advanced Hadoop for Developers

New Zealand - Advanced Hadoop for Developers

Philippines - Advanced Hadoop for Developers

Singapore - Advanced Hadoop for Developers

Thailand - Advanced Hadoop for Developers

Vietnam - Advanced Hadoop for Developers

India - Advanced Hadoop for Developers

Argentina - Hadoop Avanzado para Desarrolladores

Chile - Hadoop Avanzado para Desarrolladores

Costa Rica - Hadoop Avanzado para Desarrolladores

Ecuador - Hadoop Avanzado para Desarrolladores

Guatemala - Hadoop Avanzado para Desarrolladores

Colombia - Hadoop Avanzado para Desarrolladores

México - Hadoop Avanzado para Desarrolladores

Panama - Hadoop Avanzado para Desarrolladores

Peru - Hadoop Avanzado para Desarrolladores

Uruguay - Hadoop Avanzado para Desarrolladores

Venezuela - Hadoop Avanzado para Desarrolladores

Polska - Advanced Hadoop for Developers

United Kingdom - Advanced Hadoop for Developers

South Korea - Advanced Hadoop for Developers

Pakistan - Advanced Hadoop for Developers

Sri Lanka - Advanced Hadoop for Developers

Bulgaria - Advanced Hadoop for Developers

Bolivia - Hadoop Avanzado para Desarrolladores

Indonesia - Advanced Hadoop for Developers

Kazakhstan - Advanced Hadoop for Developers

Moldova - Advanced Hadoop for Developers

Morocco - Advanced Hadoop for Developers

Tunisia - Advanced Hadoop for Developers

Kuwait - Advanced Hadoop for Developers

Oman - Advanced Hadoop for Developers

Slovakia - Advanced Hadoop for Developers

Kenya - Advanced Hadoop for Developers

Nigeria - Advanced Hadoop for Developers

Botswana - Advanced Hadoop for Developers

Slovenia - Advanced Hadoop for Developers

Croatia - Advanced Hadoop for Developers

Serbia - Advanced Hadoop for Developers

Bhutan - Advanced Hadoop for Developers

Nepal - Advanced Hadoop for Developers

Uzbekistan - Advanced Hadoop for Developers