Scikit-Learn for Easy Machine Learning: the Vision, the Tool, and the Project Scikit-learn for easy machine learning: the vision, the tool, and the project from Gael Varoquaux Scikit-learn is a popular machine learning tool. What can it do for you?Why you you want to use it? What can you... Read more
Lynn Root at ODSC Boston 2015
Metric-Driven Development: See the Forest for the Trees At Spotify, my team struggled to be awesome. We had a very loose understanding of what product/service our squad was responsible for, and even less so of the expectations our internal and external customers had for those services. Other than “does... Read more
Wes McKinney at ODSC Boston 2015
DataFrames: The Extended Cut DataFrames: The Extended Cut from odsc This talk will give an overview of data frame libraries and toolkits across most languages and systems in use for data science and analytics today. We’ll highlight strengths and weaknesses and opportunities for community work. Presenter Bio: Wes McKinney... Read more
API Driven Development: How I Build Things and Why – Kenneth Reitz ODSC Boston 2015
API Driven Development from odsc An exposé on human-centered design, as related to data science and “medium data”. Examples of great API design will be showcased, as well as other end-user facing tools that can enable data scientists to share their observations with the world. Presenter Bio Kenneth Reitz... Read more
Agile Data – Chris Bergh ODSC Boston 2015
Agile Data from odsc To rephrase an old saying: ‘It takes a village to raise an Analyst.’ Data Analysts and Scientists are working in teams delivering insight and analysis on an ongoing basis. So how do you get the team to support experimentation and insight delivery without ending up... Read more
Using Spark, Python, and Parquet for Loading Large Datasets – Douglas Eisenstein ODSC Boston 2015
Spark, Python and Parquet from odsc Have you been in the situation where you’re about to start a new project and ask yourself, what’s the right tool for the job here? I’ve been in that situation many times and thought it might be useful to share with you a... Read more
Data Science at Dow Jones: Monetizing Data, News and Information – Juan Huerta ODSC Boston 2015
Data Science at Dow Jones: Monetizing Data, News and Information from odsc In this presentation I will describe the way Data Science supports the business of information and news at Dow Jones. Specifically, I will describe how we are introducing innovative and advanced large-scale information mining and analytic approaches... Read more
Big Data: Pig, Hive, Hadoop w/MapReduce – Gil Benghiat, Chris Bergh, Eric Estabrooks
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive from odsc The main objective of this workshop is to give the audience hands on experience with several Hadoop technologies and jump start their hadoop journey. In this workshop, you will load data and submit queries using Hadoop!... Read more
Using Open Source Solutions in Sports Business Operations – Matthew Wills ODSC Boston 2015
Using Open Source Solutions in Sports Business Operations from odsc This presentation will overview how the Grizzlies apply the use of R to their sales and marketing business operations. From basic data manipulation, to statistical modeling and enhanced visualization, the Grizzlies utilize R as a tool that efficiently positions... Read more
Using Python with Apache Storm and Kafka – Keith Bourgoin ODSC Boston 2015
http://bit.ly/KeithBourgoinPresentation As Python gains more and more traction in data science, the ability to interact with large scale data processing systems has greatly improved. Instead of being limited to what can fit on one’s laptop or having to wait for a Hadoop job to complete, we can now tap... Read more