You will learn the basic concepts of machine learning – such as Modeling, Model Selection, Loss or Profit, overfitting, and validation – in a non-mathematical way, so that you can ask for data analysis and interpret the results of a model in the context of making business decisions. The concepts behind machine learning are actually quite simple, so expect to take away not just words and acronyms, but rather, a deep understanding. We will work in the context of concrete examples from different domains, including finance and medicine.
What is probability? What is a model? Supervised vs unsupervised learning. Regression and Classification.
Models and Data: Bias, Variance, Noise, Overfitting, and how to solve Overfitting with Regularization and Validation
Risk and Cost, Data snooping, asymmetry in data, experiments.
How good is a model? Profit Curves, ROC curves, and the expected value formalism.
Rahul Dave is a partner at LxPrior, a small Data Science consultancy focussed on interesting or socially important problems. LxPrior offers its clients data analysis services as well as data science training. Rahul trained as an astrophysicist, doing research on dark energy, and worked at the University of Pennsylvania, NASA’s Astrophysics Data System, as well as at Harvard University. As a computational scientist, he has developed time series databases, semantic search engines, and techniques for classifying astronomical objects. He was one of the people behind Harvard’s Data Science course CS109, and Harvard Library’s Data Science Training For Librarians course. He is currently hard at work on the book “Dealing with Data” for Manning Publications.