Cognitive Machine Learning (1): Learning to Explain
Above is an image of the Zaamenkomst panel: one of the best remaining exemplars of rock art from the San people of Southern Africa. As soon as you see it, you are inevitably herded, like the eland in the scene, through a series of thoughts. Does it have a meaning?  Why are... Read more
Learning the Monty Hall problem
As Wikipedia gives it Suppose you’re on a game show, and you’re given the choice of three doors: Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what’s behind the doors, opens another... Read more
How the Multinomial Logistic Regression Model Works
In the pool of supervised classification algorithms, the logistic regression model is the first most algorithm to play with. This classification algorithm again categorized into different categories. These categories purely based on the number of target classes. If the logistic regression model used for addressing the... Read more
When the bootstrap doesn’t work
The bootstrap always works, except sometimes. By ‘works’ here, I mean in the weakest senses that the large-sample bootstrap variance correctly estimates the variance of the statistic, or that the large-scale percentile bootstrap intervals have their nominal coverage. I don’t mean the stronger sense that someone like... Read more
Generating data with random Gaussian noise
I recently needed to generate some data for xx, with some added Gaussian noise. This comes in handy when you want to generate data with an underlying regularity that you want to discover, for example when testing different machine learning algorithms. What I wanted to get... Read more
How to be data-driven without data…
…and then make better use of the data you get. The usefulness of data science begins long before you collect the first data point. It can be used to describe very clearly your questions and your assumptions, and to analyze in a consistent manner what they... Read more
The phenomenon of the “yearly sports game release” is a well established tradition in the videogame industry. The biggest is, perhaps, the FIFA franchise, reigning supreme leader in its niche, simulated soccer, for most of its over twenty year history. EA Sports released the latest iteration,... Read more
The meaning of Artificial Intelligence (A.I) changes depending on whom is speaking. Right now the most prominent instantiation of A.I is the chatbot. Technology’s biggest companies and plucky startups allocate resources to make chatbots more impressive, but they really haven’t broken into the mainstream yet. From... Read more
No one can master every algorithm. However, there is a basic toolbox that every Data Scientist understands and uses. One of the algorithms in this toolbox is Principal Component Analysis – P.C.A for short. It is an unsupervised learning technique used in many different fields to... Read more
Within soccer’s nascent analytics movement, one metric dominates most discussions. It’s called Expected Goals or xG. Models for calculating xG differ, but the underlying concept is the same. In a nutshell, xG takes a shot’s characteristics – distance from goal, angle from goal, root cause, etc.... Read more