Course Code: bspkkafk
Duration: 21 hours
Course Outline:

Module 1. Brief Introduction

  • Apache Kafka as a distributed streaming platform
  • Topics and Logs
  • Distribution
  • Producers and Consumers

Module 2. Kafka installation

  • Download and installation
  • The role of zookeeper
  • Guidelines for zookeeper installation
  • Testing the installation
  • Hardware and OS considerations
  • Disks and Filesystem considerations
  • Thinking about the upgrade

Module 3. Configuring Kafka

  • Essentials parameters for Brokers
  • Full Broker configuration
  • Topic configuration
  • The producer configuration
  • The consumer configuration
  • Considerations about Kafka Connect configuration

Module 4. Kafka Operations

  • Configuring Topics
  • Adding and removing topics
  • Increasing replication factor
  • The right balance
  • Balancing leadership and Balancing Replicas
  • Expanding your cluster
  • Automatically migrating data to new machines
  • Custom partition assignment and migration
  • Limiting Bandwidth Usage during Data Migration
  • Decommissioning brokers
  • How to scale
  • Implementing a Mirror

Module 5. Monitoring

  • Common monitoring metrics for producer/consumer
  • Producer monitoring
  • Consumer Group Metrics
  • Using Topic-level Fetch Metrics

Module 6. Introduction to Kafka Streams

  • The Architecture
  • Stream Partitions and Tasks
  • The Threading Model
  • Understanding Fault Tolerance

Module 7. Writing a Streams Application

  • The Streams DSL
  • Using the Processor API
  • Data Types and Serialization
  • Interactive Queries
  • How to manage memory
  • Running Streams Applications
  • Managing Streams Application Topics
  • Streams Security
  • The Application Reset Tool