fbpx
Building a Predictive Analytics Solution with Azure ML  – Fidan Boylu & Syed Fahad Allam Shah ODSC Boston 2015
Building a Predictive Analytics Solution with Azure ML from odsc Create and operationalize a predictive model using Microsoft Azure Machine Learning. –Perform the typical steps involved in building a predictive analytics solution such as data ingestion, data cleansing, data exploration, feature engineering, model selection and evaluation... Read more
Beyond Names – Gregor Stewart ODSC Boston 2015
Beyond Names from odsc Finding and classifying the mentions of the things named in text, often called Named Entity Recognition or NER, is a fundamental task in many search and analysis applications. Mature, robust NER technology is available for many languages and domains, from people, places,... Read more
Domain Expertise and Unstructured Date – William Macmillan & Evan Schnidman ODSC Boston 2015
Domain Expertise and Unstructured Data from odsc Data science allows us to turn a dark forest into a world of perpetual twilight by giving us the tools to better understand the data that surrounds us. Unfortunately, in this world of twilight we still need a flashlight... Read more
Practical Mergic – Aaron Schumacher ODSC Boston 2015
http://bit.ly/PracticalMergic Combining data sets can be a huge pain, with possible problems both obvious and insidious. Aaron will present practical approaches for detecting and avoiding potential pitfalls, as well as rigorous and repeatable processes for generating merge tables through reduction to de-duplication. The focus will be... Read more
How I Fight Slavery – Eric Schles ODSC Boston 2015
http://bit.ly/EricSchlesODSCTalk In this talk I will be covering a set to techniques I’ve used to track and find instances of human trafficking on the internet. I’ll be going over web scraping, entity recognition, some techniques for text comparison, and data storage concerns. The tool that I... Read more
Bridging the Gap Between Data and Insight using Open-Source Tools – Nicholas Arcolano ODSC Boston 2015
Bridging the Gap Between Data and Insight using Open-Source Tools from odsc Despite the proliferation of open-source tools for analysis (such as Python and R) and those used for visualization (such as Javascript / D3), there often exist significant gaps between these areas, and those of... Read more
Kaggle The Home of Data Science – Anthony Goldbloom ODSC Boston 2015
Kaggle The Home of Data Science from odsc Keynote Presenter Bio Anthony Goldbloom is the founder and CEO of Kaggle. In 2011 & 2012, Forbes Magazine named Anthony as one of the 30 under 30 in technology, in 2013 the MIT Tech Review named him one... Read more
Intro to Text Mining Using tm, openNLP and topicmodels – Ted Kwartler ODSC Boston 2015
Intro to Text Mining Using tm, openNLP and topicmodels from odsc You will learn how modern customer service organizations use data to understand important customer attributes and how R is used for workforce optimization. Topics include real world examples of how R is used in large... Read more
Learning to Love Bayesian Statistics – Allen Downey ODSC Boston 2015
http://tinyurl.com/lovebayes Bayesian statistical methods provide powerful tools for answering questions and making decisions. For example, the result of Bayesian analysis is a set of values and probabilties that can be fed directly into a cost-benefit analysis, which is not possible with conventional statistics. But there are... Read more
Data Workflows for Iteration, Collaboration, and Reproducibility – David Chudzicki ODSC Boston 2015
http://www.davidchudzicki.com/slides/odsc-2015-workflow/ For other data scientists to improve, build on, or even just trust your analysis, they need to be able to reproduce it. Even if you have shared code and data, reproducing your analysis may be difficult: which code was executed against which data in what... Read more