fbpx
Web Scraping News Articles in Python
This article is the second of a series in which I will cover the whole process of developing a machine learning project. If you have not read the first one, I strongly encourage you to do it here. The project involves the creation of a real-time web application that gathers data from several newspapers... Read more
State-of-the-Art Natural Language Understanding at Scale
For many of you in data science, natural language processing is a critical component of your projects. David Talby of Pacific.ai is here to introduce Apache Spark’s new NLP library and outline how it can facilitate your NLP pipeline for higher accuracy and faster results using the same amount... Read more
What Businesses Should Know About Speech Technologies
One of the top workshops at ODSC West last year (2018) was a talk by Omar Tawakol, the founder of Voicea. His company created a voice assistant that transformed meetings by handling lower order tasks like note-taking. Cisco acquired Voicea, most likely to integrate it into Webex as part... Read more
Do Android Composers Dream of Electric Keyboards?
Editor’s Note: If you’re interested in the idea of AI with a dream of electric keyboards, see Joseph’s talk “The Soul of a New AI” at ODSC Europe 2019. My journey in AI begins with grammar. Raised in a mathematical home, I think I was discovering prime numbers when... Read more
Watch: State of the Art Natural Language Understanding at Scale
Natural language understanding is a key component in many data science systems that must understand or reason about text. Common use cases include question answering, paraphrasing or summarization, sentiment analysis, natural language BI, language modeling, and disambiguation. Building such systems usually requires combining three types of software libraries: NLP... Read more
Watch: Understanding Unstructured Data with Language Models
As data scientists, we’ve seen a rapid improvement in the last decades in the tools available for working with structured data (be it tabular data, graph data, sensor data etc.). Yet, the vast majority of our data (Merrill Lynch puts the figure at roughly 90%) is *unstructured*, and lives... Read more
20 Open Datasets for Natural Language Processing
Natural language processing is a significant part of machine learning use cases, but it requires a lot of data and some deftly handled training. In 25 Excellent Machine Learning Open Data Sets, we listed Amazon Reviews and Wikipedia Links for general NLP and the Standford Sentiment Treebank and Twitter... Read more
The Most Exciting Natural Language Processing Research of 2019 So Far
The data revolution isn’t just about numbers, as researchers are teaching machines how to process natural language as data. The evolving capacity that machines have to interpret human speech, whether written or spoken, opens new possibilities for the interactions between computers and people. Below, we have highlighted some of... Read more
Ben Vigoda on the New Era of NLP
Many apps and programs claim to be able to understand you and are at least capable of engaging in superficial interactions. Spend long enough talking to one of these programs, however, and you’ll no doubt see the hallmarks of imperfectly reproduced natural language. That technology has not yet caught... Read more
The Power of Universal Latent Space In Medical Breakthroughs
The cost to develop new medicines has grown tremendously despite our computing and medical advances. Although we have a greater need than ever for massive breakthroughs in medical science, the industry itself is still a slow, highly regimented field. Mason Victors, CTO and CPO for Recursion Pharmaceuticals believes that... Read more