fbpx
Google’s TensorFlow framework spread like wildfire upon its release. The slew of tutorials and extensions made an already robust ecosystem even more so. Recently, Google released one of their own extensions. It’s called SyntaxNet, a TensorFlow based syntactic parser for Natural Language Understanding. SyntaxNet uses neural networks to model... Read more
Identifying Hate Speech
All the beauty of the internet age comes with its fair share of ugliness. Recently, a deluge of articles highlighting the dark side of Twitter has raised concerns for its future. As great as it is to engage with others on a variety of topics, as of late it’s... Read more
Social media has fundamentally changed the way in which we interact with each other, and with the World Wide Web. Our web activities are now inherently social. We can keep in touch with close friends on facebook without ever needing to pick up a phone or get on a... Read more
The Coolest Natural Language Processing Applications
Natural Language Processing (NLP) is one of the most interesting areas of Data Science. From analysis of the political arena, to organizing meetings, and forming the bedrock of the dream of strong A.I, training computers to truly understand the nuances of human language is part of the yet unreached... Read more
Intro to Text mining using R
Abstract: Attendees will learn the foundations of text mining approaches in addition to learn basic text mining scripting functions used in R. The audience will learn what text mining is, then perform primary text mining such as keyword scanning, dendogram and word cloud creation. Later participants will be able... Read more
The recent Republican and Democratic debates drew unprecedented amounts of viewers and the usual lot of controversies and soundbites in the media. Each debate deeply impacted future polls, subsequent fundraising, and the composition of the race. In our polarized media landscape, ensuing political analysis always suffer from political bias.... Read more
Top 10 Signs of the Textpocalypse – Steve Cohen ODSC Boston 2015
Top 10 Signs of the Textpocalypse from odsc Human-generated text may be the next frontier for big data analysis, but we humans are complicated beasts and the text we generate is messy and complicated in ways that can confound analysis. We’ll describe the top ten mistakes people make when... Read more
Machine-in-the-loop for Knowledge Discovery – Max Kleiman-Weiner ODSC Boston 2015
Machine-In-The-Loop for Knowledge Discovery from odsc I’ll present the new knowledge discovery tools we are building at Diffeo. Unlike traditional search engines that use keywords, Diffeo provides an in-browser knowledge base that accelerates information gathering about people, companies, chemical compounds, cyber events, or other real world entities. I’ll describe... Read more
Beyond Names – Gregor Stewart ODSC Boston 2015
Beyond Names from odsc Finding and classifying the mentions of the things named in text, often called Named Entity Recognition or NER, is a fundamental task in many search and analysis applications. Mature, robust NER technology is available for many languages and domains, from people, places, and products, to... Read more
Domain Expertise and Unstructured Date – William Macmillan & Evan Schnidman ODSC Boston 2015
Domain Expertise and Unstructured Data from odsc Data science allows us to turn a dark forest into a world of perpetual twilight by giving us the tools to better understand the data that surrounds us. Unfortunately, in this world of twilight we still need a flashlight to get a... Read more