All Blog Posts

Structured Streaming in Spark

This post gives you a quick overview of the new structured streaming feature in Spark 2.0, illustrating why it’s an exciting addition.

ANDREW RAY

July 28, 2016

While it would be great for everyone if you could just “buy a Hadoop” and skip straight to “Profit!”, in reality there’s a lot of work involved, and 95% of it is unique to your business. How do you determine the steps of a big data project, and ensure it delivers results early? This post talks about where to start.

EDD WILDER-JAMES

July 21, 2016

Brain Monitoring with Kafka, OpenTSDB, and Grafana

A team of our data scientists recently won 2nd place in Confluent’s Kafka Hackathon. In this post, explore their project—streaming EEG data and visualizing it.

JEFF LAM

July 14, 2016

Building Pipelines to Understand User Behavior

In this post, we cover what’s needed to understand user activity, and we look at some pipeline architectures that support this analysis.

MARK MIMS

June 23, 2016

Kafka Simple Consumer Failure Recovery

This post walks you through a simple failure recovery mechanism, as well as a test harness that allows you to make sure this mechanism works as expected.

DMITRIY FEFERMAN

June 21, 2016

CDO FAQ

This month’s Throwback Thursday feature looks at some frequently asked questions about the role of the Chief Data Officer. An updated report will be forthcoming in September.

JULIE STEELE

June 16, 2016

Structured Streaming in Spark

Why You Need a Data Strategy

Brain Monitoring with Kafka, OpenTSDB, and Grafana

Building Pipelines to Understand User Behavior

Kafka Simple Consumer Failure Recovery

CDO FAQ

Noteworthy Links: Social Media Edition

Hadooponomics Interview: The Evolution of Data

Materialized Views with Cassandra

Sign In