DO Repeat Yourself: Designing Open-Source Libraries for Modern Machine Learning
Editor’s Note: Patrick is a speaker for ODSC East 2022 this April 19th-21st. Be sure to check out his talk,  Transformers &  Datasets for Research and Production, there! “Don’t repeat yourself”, or DRY, is a well-known principle of software development. The principle originates from “The pragmatic programmer”,... Read more
Model Overload — Which NLP Model Should I Choose?
As I’m writing this, the model library on Huggingface consists of 11,256 models, and by the time you’re reading this, this number will only have increased. With so many models to choose from, it is no wonder that many get overwhelmed and don’t know any more which model... Read more
8 Ways to Perform NLP Better in 2022
A lot goes into NLP. Languages, dialects, unstructured data, and unique business needs all contribute to requiring constant innovation from the field. Going beyond NLP platforms and skills alone, having expertise in novel processes and staying afoot in the latest research are becoming pivotal for effective... Read more
The Evolution of AI Emotion and Sentiment Analysis
Artificial intelligence emotion and sentiment analysis has come a long way over the years and is on track to revolutionize the AIs of the future. Some wonder if it can ever truly understand human emotions, but computer scientists are focusing on training AI to recognize these... Read more
What Can Go Wrong When Creating Data to Enable Multilingual AI 
Editor’s note: Olga is a speaker for ODSC East 2022! Be sure to check out her talk, “Creating Data to Enable Multilingual AI: What Can Go Wrong and Ways to Mitigate It,” there! Artificial intelligence (AI), and conversational AI as one of the fastest-growing sub-domains within... Read more
Training a Medication Named-Entity Recognition Model From Scratch with spaCy
Editor’s note: Ben is a speaker for ODSC East 2022. Be sure to check out his talk, “Neural Named-Entity Recognition pipelines with spaCy,” there! The uptake of Electronic Health Record (EHR) systems grew five-fold between 2008 and 2013 and has only been increasing since. These record... Read more
Building Named Entity Recognition and Relationship Extraction Components with HuggingFace Transformers
Editor’s note: Sujit Pal is a speaker for ODSC East 2022. Be sure to check out his talk, “Transformer Based Approaches to Named Entity Recognition (NER) and Relationship Extraction (RE),” there! Named Entity Recognition (NER) is the process of identifying word or phrase spans in unstructured... Read more
GPT-3, RNNs and All That: A Deep Dive into Language Modeling
As I’ve been working on Chai I’ve been exposed to large language models (LLMs), something I didn’t really know anything about previously. In this article, I’ll summarise everything I have since learned on the subject. We’ll go from the very simple (what researchers were doing 40-ish years... Read more
Is Natural Language Processing Advanced Enough to Tackle Legal Documentation?
Natural language processing (NLP) is one of the most practical AI fields today. This technology is the driving force behind chatbots, smart speakers, and spell-checkers, and it could go further. Many law firms have started to take note of NLP’s potential. The legal industry seems like... Read more
Accelerating Business Growth with Natural Language Processing 
Editor’s Note: Sameer is a speaker for ODSC East 2022. Be sure to check out his talk, “Natural Language Processing in Accelerating Business Growth,” to learn more about NLP applications in business! Natural language Processing (NLP) is a sub-discipline within Artificial Intelligence (AI) that enables the... Read more