Apache Kafka is an open-source distributed event streaming platform used to collect, store, and integrate data at scale. Several companies use it for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. Apache Kafka has been instrumental across many industries for event streaming purposes such as processing payments in real-time for banks and stock exchanges, continuously capturing and analyzing sensor-based data from IoT devices, collecting and reacting to customer interactions for orders in retail, travel bookings, and mobile applications.

In this free-to-download guide, we walk you through some core aspects of Apache Kafka, including a general overview, jobs that use the Kafka, key terminology, and algorithms that you need to get started.


ODSC - Open Data Science

Our passion is bringing thousands of the best and brightest data scientists together under one roof for an incredible learning and networking experience.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store