fbpx
Feature Engineering – David Epstein ODSC Boston 2015
Feature Engineering from odsc One of the most important, yet often overlooked, aspects of predictive modeling is the transformation of data to create model inputs, better known as feature engineering (FE). This talk will go into the theoretical background behind FE, showing how it leverages existing... Read more
Machine Learning for a Pet Insurance Company – TJ Houk & David Jaw ODSC Boston 2015
Machine Learning for a Pet Insurance Company from odsc As an insurance company, we receive a monthly premium from policy holders and in return, we pay claims on veterinary bills. Insurance risk for pet health is relatively uncharted territory; identifying key patterns can affect the company... Read more
Frontiers of Open Data Science Research – Ani Aghababyan ODSC Boston 2015
Frontiers of Open Data Science Research from odsc Keynote Presenter Bio Ani loves writing about herself in third person and has written this all true bio. Ani is a Data Scientist for the Digital Platforms Group in McGraw-Hill Education company. She has a diverse educational background... Read more
The Art of Data Science – Josh Wills ODSC Boston 2015
The Art of Data Science from odsc Keynote Presenter Bio Josh Wills is Cloudera’s Senior Director of Data Science, working with customers and engineers to develop Hadoop-based solutions across a wide-range of industries. He is the founder and VP of the Apache Crunch project for creating... Read more
Jumping to Conclusions – Richard Robehr Bijjani ODSC Boston 2015
Jumping to Conclusions from odsc Data Science is the study of the extraction of knowledge from data. What if we extract partial or inaccurate knowledge? This illusion of knowledge would lead us to make wrong decisions, with sometimes disastrous consequences such as in the case of... Read more
Machine Learning Based Personalization Using Uplift Analytics: Examples and Applications – Victor Lo ODSC Boston 2015
Uplift Modeling Workshop from odsc Traditional randomized experiments allow us to determine the overall causal impact of a treatment program (e.g. marketing, medical, social, education, political). Uplift modeling (also known as true lift, net lift, incremental lift) takes a further step to identify individuals who are... Read more
Data Science 101 – Todd Cioffi ODSC Boston 2015
Data Science 101 from odsc Curious about Data Science? Self-taught on some aspects, but missing the big picture? Well, you’ve got to start somewhere and this session is the place to do it. This session will cover, at a layman’s level, some of the basic concepts... Read more
Can We Automate Predictive Analytics – Thomas Dinsmore ODSC Boston 2015
Can We Automate Predictive Analytics from odsc Recent news about the pending shortage of data scientists prompts speculation about automation: will machines replace human analysts? We propose a model of automation, and briefly review progress in automated machine learning over the past twenty years. Summarizing the... Read more
Learning to Love Bayesian Statistics – Allen Downey ODSC Boston 2015
http://tinyurl.com/lovebayes Bayesian statistical methods provide powerful tools for answering questions and making decisions. For example, the result of Bayesian analysis is a set of values and probabilties that can be fed directly into a cost-benefit analysis, which is not possible with conventional statistics. But there are... Read more
Predictive Modeling Workshop – Max Kuhn ODSC Boston 2015
Predictive Modeling Workshop from odsc The workshop is an overview of creating predictive models using R. An example data set will be used to demonstrate a typical workflow: data splitting, pre-processing, model tuning and evaluation. Several R packages will be shown along with the caret package... Read more