Data Day Texas 2017

We’re heading to Texas in January to talk about Kafka and Spark. Check out our session below.

Saturday, January 14

Data Pipelines with Kafka and Spark


Spark and Kafka have emerged as a core part of distributed data processing pipelines. This tutorial will explain how Spark, Kafka and rest of the big data ecosystem fit together in production to create a data platform supporting batch, interactive, and real-time analytical workloads. By examining use cases and architectures, we’ll trace the flow of data from source to output, and explore the options and considerations for each stage of the pipeline.