Archive for the ‘Architecture’ Category

Crossing the Development to Production Divide

In this post we’ll give an overview of obstacles we’ve faced (you may be able to relate) and talk about solutions to overcome these obstacles.

marbles small files

Handling Small Files in MapR-FS

In this post, we will discuss how dealing with small files is different if you are using MapR-FS rather than the traditional HDFS installation.

Space Shuttle Problems: Long-term Planning Amid Changing Technology

How can you manage your implementation in a way that allows you to take maximum advantage of technology innovation as you go, rather than having to freeze your view of technology to today’s state and design something that will be outdated when it launches? You must start by deciding which pieces are necessary now, and which can wait.

Data Ingestion with Spark and Kafka

In this tutorial, we will walk you through some of the basics of using Kafka and Spark to ingest data.

How to Choose a Data Format

In this post we provide a framework for choosing a data format, and provide some example use cases.

Graphic of a button that is off and one that is on

Realize the Business Power of Your Data with DevOps

If you are on the path to being a data-driven company, you have to be on the path to being a development-enabled company.

Graphic of pipes, in shades of gray

Data Pipelines in Hadoop

In this post we’ll look at some real world examples of managing headaches while moving to Hadoop.

Pile of colorful spinning top toys

Easily Spinning up Data Platforms

A quick overview of the motivation behind our instant and repeatable data platform tool.


Making Spark and Kafka Data Pipelines Manageable with Tuning

In this post, we’ll walk you through how to use tuning to make your Spark/Kafka pipelines more manageable.