fbpx
Training a Medication Named-Entity Recognition Model From Scratch with spaCy
Editor’s note: Ben is a speaker for ODSC East 2022. Be sure to check out his talk, “Neural Named-Entity Recognition pipelines with spaCy,” there! The uptake of Electronic Health Record (EHR) systems grew five-fold between 2008 and 2013 and has only been increasing since. These record... Read more
Learning Data Science: Is a Mini Bootcamp Right For You?
You know you want to make the jump to data science, but when you search for “data science bootcamps” and “data science courses,” you find tons of results like “I quit my job to join a bootcamp” and “How to become a data scientist in three... Read more
Top 10 AI Companies Coming to the ODSC East AI Expo Hall
With ODSC East right around the corner, coming April 19-21, it’s time to focus on some of the hot topics that you need to know. On everyone’s mind of course is artificial intelligence, where it’s going, and what will the future look like as the scale... Read more
The Data Scientific Method vs. The Scientific Method
Editor’s note: Patrick Hall is a speaker for ODSC East 2022 this April 19th-21st. Be sure to check out his talk, “A Tutorial on Machine Learning Model Governance,” there! Data science has a science problem. A lot of data science seems much more aligned with cargo... Read more
Building Named Entity Recognition and Relationship Extraction Components with HuggingFace Transformers
Editor’s note: Sujit Pal is a speaker for ODSC East 2022. Be sure to check out his talk, “Transformer Based Approaches to Named Entity Recognition (NER) and Relationship Extraction (RE),” there! Named Entity Recognition (NER) is the process of identifying word or phrase spans in unstructured... Read more
Benchmarking a Computer Vision Deep Learning Pipeline with Distributed Computing
Editor’s Note: Jennifer is a speaker for ODSC East 2022. Be sure to check out her talk, “Creating a Benchmark for a Large-Scale Image Captioning Pipeline,” to learn more about computer vision deep learning there! Computer vision has an essential role in solving some of the... Read more
10 Steps to Become a More Responsible Data Scientist
Responsible AI is a big tent idea that’s generated a lot of attention over the last few years. For those of us on the front lines, data scientists, machine learning (ML) engineers, researchers are others, the key question is: how do we build responsible AI in... Read more
Supercharge Your Pandas Code with Apache Spark
Editor’s Note: Itai Yaffe and Daniel Haviv are speakers for ODSC East 2022. Be sure to check out their talk, “A bamboo of Pandas: crossing Pandas’ single-machine barrier with Apache Spark,” there! Pandas is a fast and powerful open-source data analysis and manipulation framework written in... Read more
Optimizing Your Model for Inference with PyTorch Quantization
Editor’s Note: Jerry is a speaker for ODSC East 2022. Be sure to check out his talk, “Quantization in PyTorch,” to learn more about PyTorch quantization! Quantization is a common technique that people use to make their model run faster, with lower memory footprint and lower... Read more
Introduction to Model Validation Toolkit
Editor’s note: Alex & Matt are speakers for ODSC East 2022. Be sure to check out their talk, “Introducing Model Validation Toolkit,” there! There are a number of tools and concepts that go into assuring a machine learning pipeline. We’ve built solutions for some of the... Read more