Course Code: greenplumarch
Duration: 21 hours
Prerequisites:
  • An understanding of relational databases and SQL
  • Experience with data warehousing or analytical systems
  • Familiarity with Linux command line operations

Audience

  • Data architects and engineers
  • Database administrators and technical leads
  • BI developers and analytics specialists working with Greenplum
Overview:

Greenplum is an open-source massively parallel processing (MPP) data platform designed for analytics and large-scale data environments.

This instructor-led, live training (online or onsite) is aimed at intermediate-level to advanced-level technical users who wish to understand the internal architecture of Greenplum, build optimized data models, and apply high-performance SQL in distributed environments.

By the end of this training, participants will be able to:

  • Understand Greenplum’s shared-nothing architecture and parallelism model.
  • Design distributed table structures and partitions optimized for query performance.
  • Use indexing strategies, storage formats, and physical design techniques.
  • Interpret EXPLAIN plans and optimize queries using best practices.

Format of the Course

  • Interactive lecture and discussion.
  • Lots of exercises and practice.
  • Hands-on implementation in a live-lab environment.

Course Customization Options

  • To request a customized training for this course, please contact us to arrange.
Course Outline:

Greenplum Architecture

  • Parallel processing and symmetric multi-processing
  • Segment roles and cluster configuration
  • Scalability and data movement
  • Greenplum Data Warehouse architecture

Greenplum Table Structures

  • Distributed vs. randomly assigned tables
  • Heap vs. append-only tables
  • Row vs. columnar storage formats
  • Partitioned and clustered tables

Data Distribution and Hashing

  • Hashing logic and distribution keys
  • Skew handling and performance impact
  • Hash maps and row placement strategies

Indexes and Performance Optimization

  • Clustered and non-clustered indexes
  • B-tree and bitmap index use cases
  • Index scan and storage behavior

Physical Database Design

  • Normalization and logical model design
  • User access strategies and distribution analysis
  • Data demographics and indexing decisions

Denormalization Techniques

  • Derived data, summary tables, and pre-joins
  • Columnar tables as vertical partitioning
  • Data marts and materialized views

Advanced SQL and Query Execution

  • Join strategies and redistribution
  • OLAP and window functions
  • Temporary tables, subqueries, and derived tables

EXPLAIN Plans and Query Tuning

  • Reading and interpreting EXPLAIN output
  • Cost analysis and plan optimization
  • Join movement and segment-local operations

Greenplum Utilities and Best Practices

  • ANALYZE and VACUUM
  • Data loading and movement with Nexus
  • Security, permissions, and performance tips

Summary and Next Steps

Sites Published:

United Arab Emirates - Greenplum Architecture and Data Modeling

Qatar - Greenplum Architecture and Data Modeling

Egypt - Greenplum Architecture and Data Modeling

Saudi Arabia - Greenplum Architecture and Data Modeling

South Africa - Greenplum Architecture and Data Modeling

Brasil - Greenplum Architecture and Data Modeling

Canada - Greenplum Architecture and Data Modeling

中国 - Greenplum Architecture and Data Modeling

香港 - Greenplum Architecture and Data Modeling

澳門 - Greenplum Architecture and Data Modeling

台灣 - Greenplum Architecture and Data Modeling

USA - Greenplum Architecture and Data Modeling

Österreich - Greenplum Architecture and Data Modeling

Schweiz - Greenplum Architecture and Data Modeling

Deutschland - Greenplum Architecture and Data Modeling

Czech Republic - Greenplum Architecture and Data Modeling

Denmark - Greenplum Architecture and Data Modeling

Estonia - Greenplum Architecture and Data Modeling

Finland - Greenplum Architecture and Data Modeling

Greece - Greenplum Architecture and Data Modeling

Magyarország - Greenplum Architecture and Data Modeling

Ireland - Greenplum Architecture and Data Modeling

Luxembourg - Greenplum Architecture and Data Modeling

Latvia - Greenplum Architecture and Data Modeling

España - Greenplum Architecture and Data Modeling

Italia - Greenplum Architecture and Data Modeling

Lithuania - Greenplum Architecture and Data Modeling

Nederland - Greenplum Architecture and Data Modeling

Norway - Greenplum Architecture and Data Modeling

Portugal - Greenplum Architecture and Data Modeling

România - Greenplum Architecture and Data Modeling

Sverige - Greenplum Architecture and Data Modeling

Türkiye - Greenplum Architecture and Data Modeling

Malta - Greenplum Architecture and Data Modeling

Belgique - Greenplum Architecture and Data Modeling

France - Greenplum Architecture and Data Modeling

日本 - Greenplum Architecture and Data Modeling

Australia - Greenplum Architecture and Data Modeling

Malaysia - Greenplum Architecture and Data Modeling

New Zealand - Greenplum Architecture and Data Modeling

Philippines - Greenplum Architecture and Data Modeling

Singapore - Greenplum Architecture and Data Modeling

Thailand - Greenplum Architecture and Data Modeling

Vietnam - Greenplum Architecture and Data Modeling

India - Greenplum Architecture and Data Modeling

Argentina - Greenplum Architecture and Data Modeling

Chile - Greenplum Architecture and Data Modeling

Costa Rica - Greenplum Architecture and Data Modeling

Ecuador - Greenplum Architecture and Data Modeling

Guatemala - Greenplum Architecture and Data Modeling

Colombia - Greenplum Architecture and Data Modeling

México - Greenplum Architecture and Data Modeling

Panama - Greenplum Architecture and Data Modeling

Peru - Greenplum Architecture and Data Modeling

Uruguay - Greenplum Architecture and Data Modeling

Venezuela - Greenplum Architecture and Data Modeling

Polska - Greenplum Architecture and Data Modeling

United Kingdom - Greenplum Architecture and Data Modeling

South Korea - Greenplum Architecture and Data Modeling

Pakistan - Greenplum Architecture and Data Modeling

Sri Lanka - Greenplum Architecture and Data Modeling

Bulgaria - Greenplum Architecture and Data Modeling

Bolivia - Greenplum Architecture and Data Modeling

Indonesia - Greenplum Architecture and Data Modeling

Kazakhstan - Greenplum Architecture and Data Modeling

Moldova - Greenplum Architecture and Data Modeling

Morocco - Greenplum Architecture and Data Modeling

Tunisia - Greenplum Architecture and Data Modeling

Kuwait - Greenplum Architecture and Data Modeling

Oman - Greenplum Architecture and Data Modeling

Slovakia - Greenplum Architecture and Data Modeling

Kenya - Greenplum Architecture and Data Modeling

Nigeria - Greenplum Architecture and Data Modeling

Botswana - Greenplum Architecture and Data Modeling

Slovenia - Greenplum Architecture and Data Modeling

Croatia - Greenplum Architecture and Data Modeling

Serbia - Greenplum Architecture and Data Modeling

Bhutan - Greenplum Architecture and Data Modeling

Nepal - Greenplum Architecture and Data Modeling

Uzbekistan - Greenplum Architecture and Data Modeling