What are Recommender Systems and Why Should I Care?
Introduction Every time you want to acquire a new product or service from a physical store or business, you may go there and ask to speak to a salesperson. You will tell them what exactly it is you want from said product, such as a laptop specifically with an SSD and... Read more
How Well Did Data Scientists Predict the 2018 World Cup? (Hint: Not Very)
This year’s World Cup in Russia was the most watched sporting event in history. GlobalWebIndex reports that up to 3.4 billion people – around half of the world’s population – watched some part of the tournament. As with past World Cups, a global prediction market emerged allowing spectators to... Read more
A Different Use of Time Series to Identify Seasonal Customers
I had previously written about creatively leveraging your data using segmentation to learn about a customer base.  The article is here.  In the article I mentioned utilizing any data that might be relevant.  Trying to identify customers with seasonal usage patterns was one of the variables that I mentioned that sounded interesting. ... Read more
Introduction to Machine Learning for Non-Developers
About Machine Learning We all know that machine learning is about handling data, but it also can be seen as: The art of finding order in data by browsing its inner information. Some background on predictive models There are several types of predictive models. These models usually have several... Read more
Graph algorithms and software prefetching
A lot of data in the real world can be represented as graphs: you have nodes connected through edges. For example, you are a node in a graph where friendships are edges. I recently met with professor Semih Salihoglu, an expert in graph databases and algorithms. We discussed fun problem... Read more
Demystifying Black-Box Models with SHAP Value Analysis
As an Applied Data Scientist at Civis, I implement the latest data science research to solve real-world problems. We recently worked with a global tool manufacturing company to reduce churn among their most loyal customers. A newly proposed tool, called SHAP (SHapley Additive exPlanation) values, allowed us to build a... Read more
Interviewing is a noisy prediction problem
I have done roughly 2,000 interviews in my life. When I started recruiting, I had so much confidence in my ability to assess people. Let me just throw a couple of algorithm questions at a candidate and then I’ll tell you if they are good or not! Over time... Read more
Feature Engineering for Time Series Analysis – ODSC East 2018
What is a time series analysis in data science? How can you construct an effective time series analysis? What does feature engineering have to do with time series analysis? These questions and more were many of the talking points from the talk by Michael Schmidt, PhD, Chief Scientist at... Read more
LIME Can Make You Better at Machine Learning
LIME is a crucial machine learning tool that can tackle one of the biggest issues in machine learning is the issue of interpretability. You can think of interpretability as explaining how and why a model makes predictions. In this age of the super black box model, it may be... Read more
What lies beyond Big Data Analysis?
Beyond big data analysis lies an innovation known as cognitive analysis, which is capable of providing insights with minimum human support. Information accumulating from disparate sources, differing in formats, is known as big data. This data is essential for organizations as it is capable of providing intuitive insights that... Read more
Open Data Science - Your News Source for AI, Machine Learning & more