fbpx
In this post we will describe how to evaluate a predictive model. Why bother creating complex predictive models if 5% of the customers will churn anyway? Because a predictive model will rank our clients based on the probability that they  will abandon the company. It helps answer these two questions: 1.... Read more
Do Resampling Estimates Have Low Correlation to the Truth?
The Answer May Shock You. One criticism that is often leveled against using resampling methods (such as cross-validation) to measure model performance is that there is no correlation between the CV results and the true error rate. Let’s look at this with some simulated data. While this assertion is... Read more
This is a two-part series about using machine learning to hack my taste in music. In this first piece, I applied unsupervised learning techniques and tools on Pandora data to analyze songs that I like. The second part, which will be published soon, is about using supervised on Spotify data to... Read more
It wasn’t an overbooking problem. United Airlines was trying to move four flight crew members to the next airport. They forced passengers to get off the plane with the consequences we saw on the video from last Sunday, but don’t take our word for it. Let’s talk data. An elaborate... Read more
Cognitive Machine Learning (1): Learning to Explain
Above is an image of the Zaamenkomst panel: one of the best remaining exemplars of rock art from the San people of Southern Africa. As soon as you see it, you are inevitably herded, like the eland in the scene, through a series of thoughts. Does it have a meaning?  Why are the eland running?... Read more
The meaning of Artificial Intelligence (A.I) changes depending on whom is speaking. Right now the most prominent instantiation of A.I is the chatbot. Technology’s biggest companies and plucky startups allocate resources to make chatbots more impressive, but they really haven’t broken into the mainstream yet. From the outside the... Read more
The Sentiment Behind The Declaration of Independence
The American political season often conjures numerous references to the country’s origins from either side of the aisle. What better way to join in than by looking at the country’s birth using Data Science, the field that will dictate much of its future. I’ll do this by leveraging a... Read more
Every week we bring you a selection of the best data science articles we find in Cyberspace. We start with high school students writings on AI, lessons learned by one of the leading Machine Learning expert, building bots without programming, an intro into probabilistic programming and take a look... Read more
Mark Higgins ODSC Boston 2015 – Crowd Sourced Data Science Competitions
Mark higginscrowd sourced_data_science_competitions from odsc Data science is increasingly relevant for a wide range of companies. Tools and infrastructure for data science are growing but are extremely fragmented. Hiring the right data scientists and creating an environment to host and analyze the data continue to be challenges. Several startups... Read more
ODSC Interview with Gael Varoquaux
Gaël Varoquaux is an INRIA faculty researcher working on data science for brain imaging in the Neurospin brain research institute (Paris, France). His research focuses on modeling and mining brain activity in relation to cognition. Years before the NSA, he was hoping to make bleeding-edge data processing available across... Read more