fbpx
Predictive Modeling Workshop – Max Kuhn ODSC Boston 2015
Predictive Modeling Workshop from odsc The workshop is an overview of creating predictive models using R. An example data set will be used to demonstrate a typical workflow: data splitting, pre-processing, model tuning and evaluation. Several R packages will be shown along with the caret package... Read more
Making R Go Faster and Bigger – Jared Lander ODSC Boston 2015
http://bit.ly/JaredLanderPresentation The features of R that make it easy to use–dynamically typed, in-memory analysis, the interpreter engine and REPL–can also slow it down. Fortunately the R Core Team has made dramatic improvements in recent years with better memory management and faster interpretation of code. We look... Read more
Probabilistic Programming in Data Science – Thomas Wiecki ODSC Boston 2015
http://bit.ly/ThomasWieckiPresentation There exist a large number of metrics to evaluate the performance-risk trade-off of a portfolio. Although those metrics have proven to be useful tools in practice, most of them require a large amount of data and implicitly assume returns to be normally distributed. Bayesian modeling... Read more
Recurrent Neural Networks for Text Analysis – Alec Radford ODSC Boston 2015
Recurrent Neural Networks for Text Analysis from odsc Recurrent Neural Networks hold great promise as general sequence learning algorithms. As such, they are a very promising tool for text analysis. However, outside of very specific use cases such as handwriting recognition and recently, machine translation, they... Read more
Machine Learning for Suits – Rahul Dave ODSC Boston 2015
Machine Learning for Suits from odsc You will learn the basic concepts of machine learning – such as Modeling, Model Selection, Loss or Profit, overfitting, and validation – in a non-mathematical way, so that you can ask for data analysis and interpret the results of a... Read more
On Demand Analytic and Learning Environments with Jupyter – Kyle Kelley and Andrew Odewahn ODSC Boston 2015
http://bit.ly/Odewahn_KelleyPresentation The Jupyter/IPython project has been building systems to enable collections of users to work on a shared system within their team, lab, and on a wide web audience. There is the multi user server JupyterHub, the temporary notebook system (tmpnb), blossoming Google Drive integration (jupyter-drive),... Read more
What Can Graphs Teach Us about Teachers: Using Graphs for High Quality Recommendations – Amit Bhattacharyya ODSC Boston 2015
What Can Graphs Teach Us about Teachers: Using Graphs for High Quality Recommendations from odsc Teachers Pay Teachers is an online marketplace for teachers to buy, sell and share original educational resources. As any marketplace grows, there is an increasing need to provide a customized experience... Read more
Scalable Data Science and Deep Learning with H2O – Arno Candel ODSC Boston 2015
Scalable Data Science and Deep Learning with H2O from odsc The era of Big Data has passed, and the era of sensory overload – that is, the proliferation of sensor data – is upon us. The challenge today is how to create the next generation of... Read more
Searching for Meaning in the Deep Web – Andy Terrel ODSC Boston 2015
Searching for Meaning in the Deep Web from odsc The internet is a big place and most people’s interaction with it is regulated by a few companies paid to sell you things. My team has been building tools for the DARPA Memex project to democratize search... Read more
Keeping Governments Accountable with Open Data Science – Cezary Podkul & Marc Joffe ODSC Boston 2015
Keeping Governments Accountable with Open Data Science from odsc Keeping Governments Accountable with Open Data Science: Extracting and Analyzing Municipal Financial Data from odsc Open data is enabling journalists, watchdog groups and investors to gain more insight than ever before into the finances of state and... Read more