All Blog Posts

Open Source Toolkits for Speech Recognition

This article reviews the main options for free speech recognition toolkits that use traditional HMM and n-gram language models.

Breaking Down Communication Barriers in Tech

Travis and I discuss breaking down silos, the importance of effectively communicating about cutting-edge technology, and where Anaconda is going next.

Analyzing Caltrain Delays

In this post, we will explore some aspects of the train delay data we’ve been collecting from the Caltrain API.

Getting Started with Deep Learning

One way to give back to the open source community that provides us with tools is to help others evaluate and choose those tools in a way that takes advantage of our experience. We offer this analysis, along with explanations of the various criteria upon which we based our decisions.

The ROI of a Modern Data Strategy

In this post we look at the three components you can use to determine your data strategy’s ROI.

TensorFlow Image Recognition on a Raspberry Pi

In this post, Matt talks about using TensorFlow to detect true and false positives in our Caltrain work.

Data Opportunities in Insurance

In this post we explore how data is changing the insurance industry, through the lens of auto insurance underwriting.

Noteworthy Links: Using Data Creatively

Being data-driven means breaking down silos within organizations, promoting communication, and being deliberate about the data you collect and use. Here are five articles that illustrate how modern organizations are tackling this challenge.

Avoiding Common Mistakes with Time Series Analysis

A basic mantra in statistics and data science is correlation is not causation, meaning that just because two things appear to be related to each other doesn’t mean that one causes the other. This is a lesson worth learning.

Sign up for our newsletter