- comfortable with Java programming language (most programming exercises are in java)
- comfortable in Linux environment (be able to navigate Linux command line, edit files using vi / nano)
- a working knowledge of Hadoop.
Lab environment
Zero Install: There is no need to install hadoop software on students’ machines! A working hadoop cluster will be provided for students.
Students will need the following
- an SSH client (Linux and Mac already have ssh clients, for Windows Putty is recommended)
- a browser to access the cluster. We recommend Firefox browser
Apache Hadoop is one of the most popular frameworks for processing Big Data on clusters of servers. This course delves into data management in HDFS, advanced Pig, Hive, and HBase. These advanced programming techniques will be beneficial to experienced Hadoop developers.
Audience: developers
Duration: three days
Format: lectures (50%) and hands-on labs (50%).
Section 1: Data Management in HDFS
- Various Data Formats (JSON / Avro / Parquet)
- Compression Schemes
- Data Masking
- Labs : Analyzing different data formats; enabling compression
Section 2: Advanced Pig
- User-defined Functions
- Introduction to Pig Libraries (ElephantBird / Data-Fu)
- Loading Complex Structured Data using Pig
- Pig Tuning
- Labs : advanced pig scripting, parsing complex data types
Section 3 : Advanced Hive
- User-defined Functions
- Compressed Tables
- Hive Performance Tuning
- Labs : creating compressed tables, evaluating table formats and configuration
Section 4 : Advanced HBase
- Advanced Schema Modelling
- Compression
- Bulk Data Ingest
- Wide-table / Tall-table comparison
- HBase and Pig
- HBase and Hive
- HBase Performance Tuning
- Labs : tuning HBase; accessing HBase data from Pig & Hive; Using Phoenix for data modeling
United Arab Emirates - Advanced Hadoop for Developers
Qatar - Advanced Hadoop for Developers
Egypt - Advanced Hadoop for Developers
Saudi Arabia - Advanced Hadoop for Developers
South Africa - Advanced Hadoop for Developers
Brasil - Advanced Hadoop for Developers
Canada - Advanced Hadoop for Developers
中国 - Advanced Hadoop for Developers
香港 - Advanced Hadoop for Developers
澳門 - Advanced Hadoop for Developers
台灣 - Advanced Hadoop for Developers
USA - Advanced Hadoop for Developers
Österreich - Advanced Hadoop for Developers
Schweiz - Advanced Hadoop for Developers
Deutschland - Advanced Hadoop for Developers
Czech Republic - Advanced Hadoop for Developers
Denmark - Advanced Hadoop for Developers
Estonia - Advanced Hadoop for Developers
Finland - Advanced Hadoop for Developers
Greece - Advanced Hadoop for Developers
Magyarország - Advanced Hadoop for Developers
Ireland - Advanced Hadoop for Developers
Luxembourg - Advanced Hadoop for Developers
Latvia - Advanced Hadoop for Developers
España - Hadoop Avanzado para Desarrolladores
Italia - Advanced Hadoop for Developers
Lithuania - Advanced Hadoop for Developers
Nederland - Advanced Hadoop for Developers
Norway - Advanced Hadoop for Developers
Portugal - Advanced Hadoop for Developers
România - Advanced Hadoop for Developers
Sverige - Advanced Hadoop for Developers
Türkiye - Advanced Hadoop for Developers
Malta - Advanced Hadoop for Developers
Belgique - Hadoop pour Développeurs, niveau Acvancé
France - Hadoop pour Développeurs, niveau Acvancé
日本 - Advanced Hadoop for Developers
Australia - Advanced Hadoop for Developers
Malaysia - Advanced Hadoop for Developers
New Zealand - Advanced Hadoop for Developers
Philippines - Advanced Hadoop for Developers
Singapore - Advanced Hadoop for Developers
Thailand - Advanced Hadoop for Developers
Vietnam - Advanced Hadoop for Developers
India - Advanced Hadoop for Developers
Argentina - Hadoop Avanzado para Desarrolladores
Chile - Hadoop Avanzado para Desarrolladores
Costa Rica - Hadoop Avanzado para Desarrolladores
Ecuador - Hadoop Avanzado para Desarrolladores
Guatemala - Hadoop Avanzado para Desarrolladores
Colombia - Hadoop Avanzado para Desarrolladores
México - Hadoop Avanzado para Desarrolladores
Panama - Hadoop Avanzado para Desarrolladores
Peru - Hadoop Avanzado para Desarrolladores
Uruguay - Hadoop Avanzado para Desarrolladores
Venezuela - Hadoop Avanzado para Desarrolladores
Polska - Advanced Hadoop for Developers
United Kingdom - Advanced Hadoop for Developers
South Korea - Advanced Hadoop for Developers
Pakistan - Advanced Hadoop for Developers
Sri Lanka - Advanced Hadoop for Developers
Bulgaria - Advanced Hadoop for Developers
Bolivia - Hadoop Avanzado para Desarrolladores
Indonesia - Advanced Hadoop for Developers
Kazakhstan - Advanced Hadoop for Developers
Moldova - Advanced Hadoop for Developers
Morocco - Advanced Hadoop for Developers
Tunisia - Advanced Hadoop for Developers
Kuwait - Advanced Hadoop for Developers
Oman - Advanced Hadoop for Developers
Slovakia - Advanced Hadoop for Developers
Kenya - Advanced Hadoop for Developers
Nigeria - Advanced Hadoop for Developers
Botswana - Advanced Hadoop for Developers
Slovenia - Advanced Hadoop for Developers
Croatia - Advanced Hadoop for Developers
Serbia - Advanced Hadoop for Developers
Bhutan - Advanced Hadoop for Developers