Course Outline
Kafka Administration Essentials
- Understanding where Kafka fits within a modern data platform and the typical responsibilities of production operations.
- Core concepts for operators: brokers, topics, partitions, offsets, and consumer groups.
- Replication fundamentals: leaders and followers, in-sync replicas, and availability trade-offs.
- Key operational highlights and common terminology found in runbooks.
KRaft Mode and Cluster Design
- KRaft basics: controllers, metadata quorum, elections, and their operational significance.
- Deployment planning: sizing for throughput, partitions, retention periods, and future growth.
- Node roles and layouts: combined versus dedicated controllers, with considerations for fault domains.
- Lab: inspecting KRaft metadata, validating quorum health, and interpreting controller logs.
Installation, Configuration, and Day-to-Day Operations
- Installation approaches (packages, tarballs, containers) and standardization strategies for enterprise environments.
- Core broker configuration affecting reliability: listeners, replication settings, log directories, and retention policies.
- Safe service operations: startup sequences, graceful shutdowns, and validation checks.
- Lab: deploying a multi-node cluster, verifying broker registration, and confirming baseline produce and consume operations.
Managing Topics, Partitions, and Data Placement
- Topic lifecycle management using the Kafka CLI: creating, describing, updating configurations, and deleting.
- Selecting partitions and replication factors for real-world workloads, including identifying common anti-patterns.
- Reassignments and balancing: determining when to move partitions and verifying progress safely.
- Lab: creating topics, triggering a partition reassignment, simulating a broker outage, and confirming recovery.
Securing Kafka for Production
- TLS for client and inter-broker traffic: managing certificates, trust chains, and validation steps.
- Authentication with SASL: selecting appropriate mechanisms and avoiding misconfiguration.
- Authorization with ACLs: implementing least-privilege patterns for administrators, producers, and consumers.
- Lab: enabling TLS and SASL, validating client connectivity, and applying ACLs for application roles.
Observability, Reliability, and Troubleshooting
- Monitoring essentials: controller health, under-replicated partitions, request latency, and disk and network saturation.
- Logs and metrics: reading broker logs and exposing metrics via the JMX exporter to common observability stacks.
- Operational playbooks: performing rolling restarts, applying safe configuration changes, and handling disk-full and ISR issues.
- Lab: building a minimal alert set, diagnosing a degraded cluster, and restoring healthy replication.
Upgrades and Disaster Recovery Readiness
- Upgrade planning for Kafka: conducting compatibility checks, staging, and defining rollback approaches.
- Backups and recovery expectations: understanding what can be backed up, what cannot, and the basics of configuration recovery.
- Overview of cross-cluster replication and when to utilize MirrorMaker 2 for disaster recovery and migrations.
- Wrap-up: operational checklists, handover artifacts, and next steps for production rollout.
Requirements
- A solid understanding of basic Linux administration (users, services, files, and permissions).
- Experience with TCP/IP networking concepts (DNS, ports, firewalls, and load balancers).
- Basic scripting experience (Bash, PowerShell, or similar) for routine operational tasks.
Audience
- Kafka administrators and platform engineers responsible for operating Kafka clusters.
- Site reliability engineers and DevOps engineers supporting streaming platforms.
- Infrastructure and operations teams deploying new KRaft-based Kafka clusters or migrating from ZooKeeper.
Testimonials (5)
Possibility to perform independent exercises in the training environment.
Tomasz - PKO Zycie Towarzystwo Ubezpieczen S.A.
Course - Kafka for Administrators
To the point, proper pace (bash basics required though)
Krzysztof - Agora SA
Course - Kafka for Administrators
Trainer accepts questions at any time of the session, even if the subject was taught few days past.
GOODLUCK MASHIMBA - Tanzania Revenue Authority
Course - Kafka for Administrators
Nice presentation skill
Md Maruf Hossain - ATOS PGS sp. z o.o.
Course - Kafka for Administrators
Grate skills, examples, very good exercises