All Blog Posts

TensorFlow RNN Tutorial

In this post, we’ll provide a short tutorial for training a RNN for speech recognition; we’re including code snippets throughout, and an accompanying GitHub repository. The software we’re using is a mix of borrowed and inspired code from existing open source projects.

Spark Summit: Ignition in the Enterprise

We are excited to announce for Spark Summit 2017 in San Francisco, Edd Wilder-James will be joining Reynold Xin as co-chair of the Spark Summit program.

Four Data Capabilities for Telecommunications

This post looks at four business analysis capabilities that connect the dots between promising applications of data assets for telecommunications companies.

Introducing a Value-Centered View of Data Maturity

In this post we introduce our new data maturity model, and include a link to the assessment.

The Data Platform Puzzle

Building or rebuilding a data platform can be a daunting task, as most questions that need to be asked have open-ended answers. But that doesn’t mean you have to guess and use your gut.

Models: From the Lab to the Factory

Deploying a model without a rigorous process in place has consequences. We go over techniques for successful deployment and management.

structure

Building Tech Communities

In this interview, Travis talks about how to balance enterprise and open source, as well as what it takes to build a community.

magnifying glass and map

The Value of Exploratory Data Analysis

In this post, we will give a high level overview of what EDA typically entails and then describe three of the major ways EDA is critical to successfully model and interpret its results.

Driving Product Engagement with User Behavior Analytics

In this post, we will look at driving product engagement with behavioral data, as well as building an integrated analytical environment.