
Analyzing Sentiment in Caltrain Tweets
As a first step to using Twitter activity as one of the data sources for train prediction, we start with a simple question: How do Twitter users currently feel about Caltrain?
Lynette is a proactive and resourceful problem solver and an experienced CRM professional who specializes in customer strategy and solution delivery. With over 10 years of experience in complex solution development and large scale project management, Lynette delivers projects with an uncompromising commitment to quality and client satisfaction.
Cindi is the Head of Data Science at SVDS. She is a naturally collaborative problem-solver able to bridge technical and business concerns using strong communication and facilitation skills. With over fifeteen years experience of research and applications of machine learning and natural language processing across academia and industry, she brings a unique blend of academic and industry experience AI and R&D. She has also collaborated extensively to solve problems by bridging technical and business concerns using strong communication and facilitation skills.
Cindi holds a PhD and MA in Computer Sciences from UT-Austin, and a Bachelor of Science in Computer Science from NCSU. She has dozens of publications in both journals and refereed conferences and is the co-inventor on three patents.
As a first step to using Twitter activity as one of the data sources for train prediction, we start with a simple question: How do Twitter users currently feel about Caltrain?
This post explains why chatbots are rising in popularity with banks, the opportunities and challenges presented, and how data science fits into the puzzle.
This article reviews the main options for free speech recognition toolkits that use traditional HMM and n-gram language models.
The Data in Practice track focuses on modern techniques for efficient execution of your data strategy. Register now!
Join us in Santa Clara as we talk about managing data science in the enterprise, data valuation, and best practices for data visualization.
CTO John Akred, VP of Data Science Jeffrey Yau, and Senior Data Scientist Cindi Thompson will teach a three-hour tutorial in which they will share our methods and observations from three years of effectively deploying data science in enterprise organizations. Attendees will learn how to be an effective member or manager of a data science team.
With more than 7 years of experience in creating large scale applications, Luke loves to help enterprises make decisions with their data. Luke is passionate about user experience, charts and graphs, helping people, and applying data.
Daniel has experience in solving problems and building solutions in a variety of domains from home energy consumption to measuring the shapes of galaxies. For his PhD research, he developed and applied machine learning and statistical techniques to measure the structure and evolution of the expanding universe.
With a background in optical physics and biomedical research, Matt has a broad range of experiences in software development, database engineering, and data analytics. He enjoys working closely with clients to develop straightforward and robust solutions to difficult problems.
The DeepGramAI Hackathon has concluded, check out the project that Data Engineer Matthew Rubashkin worked on.
In this post, we’ll provide a short tutorial for training a RNN for speech recognition; we’re including code snippets throughout, and an accompanying GitHub repository. The software we’re using is a mix of borrowed and inspired code from existing open source projects.
One way to give back to the open source community that provides us with tools is to help others evaluate and choose those tools in a way that takes advantage of our experience. We offer this analysis, along with explanations of the various criteria upon which we based our decisions.
In this post, Matt talks about using TensorFlow to detect true and false positives in our Caltrain work.
In this post, we discuss our Raspberry Pi streaming video analysis software, which we use to better predict Caltrain delays.
In this post we’ll start looking at the nuts and bolts of making our Caltrain work possible: image processing, video analysis, and image recognition.
Larry has decades of experience helping businesses define their needs, architect solutions, builds teams and deliver results. He enjoys exploring new technology and has a passion for getting operations knowledge into engineering solutions.
In this post, we will cover some of the basics of monitoring and alerting as it relates to data pipelines in general, and Kafka and Spark in particular.
In this post, we’ll walk you through how to use tuning to make your Spark/Kafka pipelines more manageable.
An expert at navigating the ever-expanding middle ground between business capability needs and technology enablement, Justin thrives on solving complex problems using right-sized technologies, and architecting platforms that last.
In this post, we’ll cover the difference between data rights versus privacy and security, and you’ll be introduced to some projects that show how the technology industry is evolving to develop scalable solutions around intersecting concerns in data management and authorization.
We detail insights learned while attending the recent Predix Transform conference.
With over 20 years of experience in Fortune 500, Colette is passionate about competing on analytics. She has worked at Bank of America in various SVP/Quantitative Analytics roles, with over a decade in enterprise credit risk, global corporate and investment banking, commercial banking, technology and operations, and consumer banking lines of business. In other places, she has managed product development related to Big Data capabilities in support of decision management and automation.
Colette holds a masters degree in both Industrial Engineering (MSIE from Binghamton University) and Business (MBA from Wake Forest University) along with a certification as a Six Sigma Master Blackbelt. She has a full understanding of the types of business questions that surface in support of profitability growth initiatives, as well as an arsenal of analytics methods.
Let the key aspirations that define the vision for your company be the cornerstones of your approach towards digitization. This ensures that the areas in your digitization plan are the right ones to focus on.
In this post we look at how to visualize data gaps, and engage senior leadership.
In this post, we will discuss what “real” gaps in data look like and how to find them in your organization.
Several of our presenters were interviewed at Strata San Jose. If you missed the conference, check out these interviews below to catch up on some of the topics that were on our minds.
We know that digitization is a disruptive force that can help your company stand out from the competition. It needs to be a priority, but where should you start?
The Data in Practice track focuses on modern techniques for efficient execution of your data strategy. Register now!
Come find us in Austin this December. Principal Data Strategist Colette Glaeser and VP of Strategy Edd Wilder-James will be discussing how to develop a data strategy. Director of Communications Julie Steele will provide insight into how to best leverage your data through visualizations.
The SVDS crew will be in New York this year, talking about data platforms, data strategy, and making the business case for Spark. Come by our talks, or catch us in the hallway track.
Several of us will be at Enterprise Data World 2016 in San Diego. We’d love to say hi, and hear your thoughts.
Jin brings to SVDS over twenty years of experience driving data-fueled partnerships, integrations, business strategies, and analytics across multiple industries including healthcare, e-commerce, media, and software development. She is boundlessly enthusiastic about enabling business with data.
The Data in Practice track focuses on modern techniques for efficient execution of your data strategy. Register now!
Several of us will be in Chicago this year, presenting tutorials on data strategy, data platforms, and how to manage data science in the enterprise. CTO John Akred will also be taking part in a panel about how to strengthen your data strategy skills.