Predictive Modeling Workshop – Max Kuhn ODSC Boston 2015
Predictive Modeling Workshop from odsc The workshop is an overview of creating predictive models using R. An example data set will be used to demonstrate a typical workflow: data splitting, pre-processing, model tuning and evaluation. Several R packages will be shown along with the caret package which provides a... Read more
Making R Go Faster and Bigger – Jared Lander ODSC Boston 2015
http://bit.ly/JaredLanderPresentation The features of R that make it easy to use–dynamically typed, in-memory analysis, the interpreter engine and REPL–can also slow it down. Fortunately the R Core Team has made dramatic improvements in recent years with better memory management and faster interpretation of code. We look at some of... Read more
Probabilistic Programming in Data Science – Thomas Wiecki ODSC Boston 2015
http://bit.ly/ThomasWieckiPresentation There exist a large number of metrics to evaluate the performance-risk trade-off of a portfolio. Although those metrics have proven to be useful tools in practice, most of them require a large amount of data and implicitly assume returns to be normally distributed. Bayesian modeling is a statistical... Read more
Recurrent Neural Networks for Text Analysis – Alec Radford ODSC Boston 2015
Recurrent Neural Networks for Text Analysis from odsc Recurrent Neural Networks hold great promise as general sequence learning algorithms. As such, they are a very promising tool for text analysis. However, outside of very specific use cases such as handwriting recognition and recently, machine translation, they have not seen... Read more
Data Visualization & UX – Mark Schindler and Bang Wong ODSC Boston 2015
Data Visualization & UX from odsc Data Visualization is about helping people gain knowledge from data. The focus of this workshop is on approaches to turn data into actionable insights, combining heuristics for visual analytics with techniques from user-experience design. Participants will learn how to choose and create data... Read more
High Performance Hardware for Data Analysis – Michael Pittaro ODSC Boston 2015
High Performance Hardware for Data Analysis from odsc Choosing hardware for big data analysis is difficult because of the many options and variables involved. The problem is more complicated when you need a full cluster for big data analytics. This session will cover the basic guidelines and architectural choices... Read more
What Can Graphs Teach Us about Teachers: Using Graphs for High Quality Recommendations – Amit Bhattacharyya ODSC Boston 2015
What Can Graphs Teach Us about Teachers: Using Graphs for High Quality Recommendations from odsc Teachers Pay Teachers is an online marketplace for teachers to buy, sell and share original educational resources. As any marketplace grows, there is an increasing need to provide a customized experience so that the... Read more
Scalable Data Science and Deep Learning with H2O – Arno Candel ODSC Boston 2015
Scalable Data Science and Deep Learning with H2O from odsc The era of Big Data has passed, and the era of sensory overload – that is, the proliferation of sensor data – is upon us. The challenge today is how to create the next generation of business and consumer... Read more
Searching for Meaning in the Deep Web – Andy Terrel ODSC Boston 2015
Searching for Meaning in the Deep Web from odsc The internet is a big place and most people’s interaction with it is regulated by a few companies paid to sell you things. My team has been building tools for the DARPA Memex project to democratize search for all, with... Read more
Keeping Governments Accountable with Open Data Science – Cezary Podkul & Marc Joffe ODSC Boston 2015
Keeping Governments Accountable with Open Data Science from odsc Keeping Governments Accountable with Open Data Science: Extracting and Analyzing Municipal Financial Data from odsc Open data is enabling journalists, watchdog groups and investors to gain more insight than ever before into the finances of state and local governments. Unfortunately,... Read more