Building a Predictive Analytics Solution with Azure ML  – Fidan Boylu & Syed Fahad Allam Shah ODSC Boston 2015
Building a Predictive Analytics Solution with Azure ML from odsc Create and operationalize a predictive model using Microsoft Azure Machine Learning. –Perform the typical steps involved in building a predictive analytics solution such as data ingestion, data cleansing, data exploration, feature engineering, model selection and evaluation of model results... Read more
Beyond Names – Gregor Stewart ODSC Boston 2015
Beyond Names from odsc Finding and classifying the mentions of the things named in text, often called Named Entity Recognition or NER, is a fundamental task in many search and analysis applications. Mature, robust NER technology is available for many languages and domains, from people, places, and products, to... Read more
How I Fight Slavery – Eric Schles ODSC Boston 2015 In this talk I will be covering a set to techniques I’ve used to track and find instances of human trafficking on the internet. I’ll be going over web scraping, entity recognition, some techniques for text comparison, and data storage concerns. The tool that I will be explaining... Read more
Domain Expertise and Unstructured Date – William Macmillan & Evan Schnidman ODSC Boston 2015
Domain Expertise and Unstructured Data from odsc Data science allows us to turn a dark forest into a world of perpetual twilight by giving us the tools to better understand the data that surrounds us. Unfortunately, in this world of twilight we still need a flashlight to get a... Read more
Practical Mergic – Aaron Schumacher ODSC Boston 2015 Combining data sets can be a huge pain, with possible problems both obvious and insidious. Aaron will present practical approaches for detecting and avoiding potential pitfalls, as well as rigorous and repeatable processes for generating merge tables through reduction to de-duplication. The focus will be on techniques for... Read more
Intro to Text Mining Using tm, openNLP and topicmodels – Ted Kwartler ODSC Boston 2015
Intro to Text Mining Using tm, openNLP and topicmodels from odsc You will learn how modern customer service organizations use data to understand important customer attributes and how R is used for workforce optimization. Topics include real world examples of how R is used in large scale operations to... Read more
Bridging the Gap Between Data and Insight using Open-Source Tools – Nicholas Arcolano ODSC Boston 2015
Bridging the Gap Between Data and Insight using Open-Source Tools from odsc Despite the proliferation of open-source tools for analysis (such as Python and R) and those used for visualization (such as Javascript / D3), there often exist significant gaps between these areas, and those of us trying to... Read more
Kaggle The Home of Data Science – Anthony Goldbloom ODSC Boston 2015
Kaggle The Home of Data Science from odsc Keynote Presenter Bio Anthony Goldbloom is the founder and CEO of Kaggle. In 2011 & 2012, Forbes Magazine named Anthony as one of the 30 under 30 in technology, in 2013 the MIT Tech Review named him one of top 35... Read more
Probabilistic Programming in Data Science – Thomas Wiecki ODSC Boston 2015 There exist a large number of metrics to evaluate the performance-risk trade-off of a portfolio. Although those metrics have proven to be useful tools in practice, most of them require a large amount of data and implicitly assume returns to be normally distributed. Bayesian modeling is a statistical... Read more
Recurrent Neural Networks for Text Analysis – Alec Radford ODSC Boston 2015
Recurrent Neural Networks for Text Analysis from odsc Recurrent Neural Networks hold great promise as general sequence learning algorithms. As such, they are a very promising tool for text analysis. However, outside of very specific use cases such as handwriting recognition and recently, machine translation, they have not seen... Read more