Using Text Features to Predict the Great Stock Market Crash of 1929
Predicting financial crises is notoriously difficult. This is primarily a consequence of the infrequency of such events and the instability of relationships between financial variables. However, it is also related to the contagious nature of financial crises: if one bank expects another to liquidate its holdings... Read more
On The Dangers of Stochastic Parrots: Can Language Models Be Too Big?
In March this year, the ACM FAccT conference published the paper titled above, authored by Emily Bender, Timnit Gebru, Angelina McMillan-Major, and “Shmargaret Shmitchell”. This paper attracted intense controversy, and led to the exit of Gebru and Margaret Mitchell (“Shmargaret Shmitchell” in the paper) from the Google AI Ethics team.... Read more
What Happens When You Run SRL Experiments Over a BERT Based Model? 
Transformers have made more progress in the past few years than NLP in the past generation. Standard NLU approaches first learn syntactical and lexical features to explain the structure of a sentence. The former NLP models would be trained to understand the basic syntax of language... Read more
Performing IMDb Sentiment Analysis with GloVe Embeddings
The GloVe model came out in 2014, a year after the Word2Vec paper came out. The GloVe and Word2Vec models are similar as the embeddings generated for a word are determined by the words that occur around it. However, these context words occur with different frequencies. Some of these context... Read more
Build NLP and Conversational AI Apps with Transformers and Large Scale Pre-Trained Language Models
Transformers have taken the AI research and product community by storm. We have seen them advancing multiple fields in AI such as natural language processing (NLP), computer vision, and robotics. In this blog, I will share some background in conversational AI, NLP, and transformers-based large-scale language... Read more
Top Applications of NLP in 2021
Data in the form of text is increasingly commonplace. Businesses have plenty of text-based surveys and emails to plow through, researchers often use social media posts for analysis, and so on. It should be no surprise that NLP is becoming a must-have skillset for data scientists... Read more
The Pile Dataset: EleutherAI’s Massive Project to Help Train NLP Models
Recently, EleutherAI – a small group of researchers devoted to open-source AI research – created The Pile, a massive dataset designed to train NLP models, such as GPT-2 and GPT-3, among others. The dataset is open-source, contains over 800GB of English language data, and is still... Read more
Top 14 NLP Job-Ready Skills for 2021
NLP was one of the hottest skills in 2019 and  2020 for good reason. Companies have a lot of text to work with and many applicants to apply it across the business. We will discuss the top applications of NLP in part II of this two-part... Read more
Learn NLP the Stanford Way — Lesson 1
The AI area of Natural Language Processing, or NLP, throughout its gigantic language models — yes, GPT-3, I’m watching you — presents what it’s perceived as a revolution in machines’ capabilities to perform the most distinct language tasks. Due to that, the perception of the public... Read more
The State of Enterprise NLP in 2020
The state of entrerpise NLP in 2020. This has been a unique year for public health, professional life, the economy, and just about every other aspect of daily life. While some doors are closing, and others are pivoting their business models, businesses that haven’t taken a... Read more