Overview
This course will make you proficient in its architecture, installation configuration and performance tuning. Our trainers teach you how to integrate Kafka with Spark, Hadoop and the other distributed processing system—Storm. By taking this Apache Kafka course, you will learn the basics of Apache ZooKeeper and develop the skills to implement Kafka for real-time messaging.
Objectives
At the end of Apache Kafka training course, participants will learn
- The basics of a messaging system, about Kafka and its various components and how it helps in real-time data processing
- How to design and develop a robust messaging system
- The Kafka cluster and how it integrates with other Big Data Frameworks like Hadoop
- Integrating Kafka with Storm
Prerequisites
Participants are expected to know core concepts of Java, in order to attend this course.
Course Outline
- Introduction to Big Data
- Big Data Customer Scenarios
- What is Kafka?
- Need for Kafka
- Core Concepts of Kafka
- Kafka Architecture
- Where is Kafka Used
- Understanding the components of Kafka Cluster
- Installation of Kafka Cluster
- Configuring Kafka Cluster
- Producer of Kafka
- Consumer of Kafka
- Producer and Consumer in Action
- Offset
- Design
- Hardware
- Kafka Monitoring and Issues
- Kafka Performance Tuning
- Reading data from Kafka
- Demo-Twitter Kafka Producer
- Introduction to Scala
- Mixed Paradigm-Functional Programming
- Scala Installation & Configuration
- Scala REPL
- Scala Project Using Eclipse
- Understanding the Hadoop Cluster
- Integrating Kafka with Hadoop Cluster
- Understanding Apache Storm
- Implementing Spouts and Bolts
- Kafka with Storm Spout
- Ecosystem of Spark
- Understanding the Spark Cluster
- Integrating Kafka with Spark