Mark Mims

Mark has extensive experience architecting and implementing data science solutions across a variety of industries. His passion is Data Plumbing, where Data Science meets the real world of DevOps and Infrastructure Engineering.

Easily Spinning up Data Platforms

A quick overview of the motivation behind our instant and repeatable data platform tool.

April 4, 2017

Building Pipelines to Understand User Behavior

In this post, we cover what’s needed to understand user activity, and we look at some pipeline architectures that support this analysis.

June 23, 2016

Develop Spark Apps on YARN Using Docker

Rather than get bitten by the idiosyncrasies involved in running Spark on YARN vs. standalone when you go to deploy, here’s a way to set up a development environment for Spark that more closely mimics how it’s used in the wild.

October 15, 2015

Past Events

2025

2017

Apr 2 - 7

Enterprise Data World 2017
Atlanta, GA

Enterprise Data World focuses on data-driven business. Several of us will be there this year, talking about data platforms and enterprise data science. Let us know if you’ll be there, or you can sign up to receive our slides.

Details

2016

Jul 23

Data Day Seattle 2016
Seattle, WA

Join us as CTO John Akred gives a talk on alternative approaches to valuing data within an organization, and Data Scientist Chloe Mawer demonstrates the power of Jupyter notebooks using a real-world train-detection problem. We’ll also present a tutorial on building data pipelines with Kafka and Spark.

Details
Mar 8

Hadoop with the Best

Principal Engineer Mark Mims will be speaking at this online conference, presenting on how to identify user activity from streams.

Details
Jan 16

Data Day Texas
Austin, TX

Join CTO John Akred for a talk on Running Agile Data Science Teams, and VP of Engineering Stephen O’Sullivan for a talk on Choosing an HDFS data storage format (Avro vs. Parquet). Principal Engineer Mark Mims will hold Office Hours.

Details

Mark Mims

Recent Posts

Past Events

Enterprise Data World 2017

Data Day Seattle 2016

Hadoop with the Best

Data Day Texas