fbpx
Using Python with Apache Storm and Kafka – Keith Bourgoin ODSC Boston 2015
http://bit.ly/KeithBourgoinPresentation As Python gains more and more traction in data science, the ability to interact with large scale data processing systems has greatly improved. Instead of being limited to what can fit on one’s laptop or having to wait for a Hadoop job to complete, we... Read more
API Driven Development: How I Build Things and Why – Kenneth Reitz ODSC Boston 2015
API Driven Development from odsc An exposé on human-centered design, as related to data science and “medium data”. Examples of great API design will be showcased, as well as other end-user facing tools that can enable data scientists to share their observations with the world. Presenter... Read more
Using Spark, Python, and Parquet for Loading Large Datasets – Douglas Eisenstein ODSC Boston 2015
Spark, Python and Parquet from odsc Have you been in the situation where you’re about to start a new project and ask yourself, what’s the right tool for the job here? I’ve been in that situation many times and thought it might be useful to share... Read more
Building a Predictive Analytics Solution with Azure ML  – Fidan Boylu & Syed Fahad Allam Shah ODSC Boston 2015
Building a Predictive Analytics Solution with Azure ML from odsc Create and operationalize a predictive model using Microsoft Azure Machine Learning. –Perform the typical steps involved in building a predictive analytics solution such as data ingestion, data cleansing, data exploration, feature engineering, model selection and evaluation... Read more
Beyond Names – Gregor Stewart ODSC Boston 2015
Beyond Names from odsc Finding and classifying the mentions of the things named in text, often called Named Entity Recognition or NER, is a fundamental task in many search and analysis applications. Mature, robust NER technology is available for many languages and domains, from people, places,... Read more
Domain Expertise and Unstructured Date – William Macmillan & Evan Schnidman ODSC Boston 2015
Domain Expertise and Unstructured Data from odsc Data science allows us to turn a dark forest into a world of perpetual twilight by giving us the tools to better understand the data that surrounds us. Unfortunately, in this world of twilight we still need a flashlight... Read more
Practical Mergic – Aaron Schumacher ODSC Boston 2015
http://bit.ly/PracticalMergic Combining data sets can be a huge pain, with possible problems both obvious and insidious. Aaron will present practical approaches for detecting and avoiding potential pitfalls, as well as rigorous and repeatable processes for generating merge tables through reduction to de-duplication. The focus will be... Read more
How I Fight Slavery – Eric Schles ODSC Boston 2015
http://bit.ly/EricSchlesODSCTalk In this talk I will be covering a set to techniques I’ve used to track and find instances of human trafficking on the internet. I’ll be going over web scraping, entity recognition, some techniques for text comparison, and data storage concerns. The tool that I... Read more
Bridging the Gap Between Data and Insight using Open-Source Tools – Nicholas Arcolano ODSC Boston 2015
Bridging the Gap Between Data and Insight using Open-Source Tools from odsc Despite the proliferation of open-source tools for analysis (such as Python and R) and those used for visualization (such as Javascript / D3), there often exist significant gaps between these areas, and those of... Read more
Kaggle The Home of Data Science – Anthony Goldbloom ODSC Boston 2015
Kaggle The Home of Data Science from odsc Keynote Presenter Bio Anthony Goldbloom is the founder and CEO of Kaggle. In 2011 & 2012, Forbes Magazine named Anthony as one of the 30 under 30 in technology, in 2013 the MIT Tech Review named him one... Read more