The Coolest Natural Language Processing Applications
Natural Language Processing (NLP) is one of the most interesting areas of Data Science. From analysis of the political arena, to organizing meetings, and forming the bedrock of the dream of strong A.I, training computers to truly understand the nuances of human language is part of the yet unreached... Read more
Data science is an interdisciplinary endeavor, and it serves the purpose of extracting insight from varying sources of information. Various communities come together at Data Science Conferences to share their knowledge and promote innovation. It is not surprising, then, that the tools showcased by data scientists at ODSC East... Read more
Data Science on Twitter
By: Jason O’Rawe – ODSC data science team contributor Twitter is an indispensable resource for data scientists as well as for the broader data science community. With the right connections, you can use twitter to learn data science, discover new technologies, computational tools and methodologies, and you can contribute... Read more
Amazon Machine Learning: Nice and Easy or Overly Simple?
Can the new Amazon Machine Learning help companies reap the benefits of predictive analytics? Machine Learning as a Service (MLaaS) promises to put data science within the reach of companies. In that context, Amazon Machine Learning is a predictive analytics service with binary/multiclass classification and linear regression features. The service... Read more
Intro to Text mining using R
Abstract: Attendees will learn the foundations of text mining approaches in addition to learn basic text mining scripting functions used in R. The audience will learn what text mining is, then perform primary text mining such as keyword scanning, dendogram and word cloud creation. Later participants will be able... Read more
The recent Republican and Democratic debates drew unprecedented amounts of viewers and the usual lot of controversies and soundbites in the media. Each debate deeply impacted future polls, subsequent fundraising, and the composition of the race. In our polarized media landscape, ensuing political analysis always suffer from political bias.... Read more
Riding on Large Data with Scikit-learn
What’s a Large Data Set? A data set is said to be large when it exceeds 20% of the available RAM for a single machine. Which for your standard MacBook Pro with 8Gb of RAM, corresponds to a meager 2Gb dataset — size that is becoming more and more... Read more
Top 10 Signs of the Textpocalypse – Steve Cohen ODSC Boston 2015
Top 10 Signs of the Textpocalypse from odsc Human-generated text may be the next frontier for big data analysis, but we humans are complicated beasts and the text we generate is messy and complicated in ways that can confound analysis. We’ll describe the top ten mistakes people make when... Read more
Mark Higgins ODSC Boston 2015 – Crowd Sourced Data Science Competitions
Mark higginscrowd sourced_data_science_competitions from odsc Data science is increasingly relevant for a wide range of companies. Tools and infrastructure for data science are growing but are extremely fragmented. Hiring the right data scientists and creating an environment to host and analyze the data continue to be challenges. Several startups... Read more
Ted Dunning at ODSC Boston 2015
Anomaly Detection In a complex world of rapidly changing information, it’s not easy to find something that is rare – but it’s even harder if you do not know what to look for. In this webinar, Ted Dunning and Ellen Friedman, authors of the O’Reilly ebook, Practical Machine Learning:... Read more