Harnessing Machine Learning on Big Data with PySpark on AWS
Editor’s note: Suman Debnath is a speaker for ODSC APAC this August 22-23. Be sure to check out his talk, “Build Classification and Regression Models with Spark on AWS,” there! In the unceasingly dynamic arena of data science, discerning and applying the right instruments can significantly... Read more
Demystifying Machine Learning: Popular ML Libraries and Tools
As a senior data scientist, I often encounter aspiring data scientists eager to learn about machine learning (ML). It’s a fascinating field that can seem daunting at first, but I assure you, with the right mindset and resources, anyone can master it. In this comprehensive guide,... Read more
Decision Trees From Scratch With Python
We already know a single decision tree can work surprisingly well. The idea of constructing a forest from individual trees seems like the natural next step. Today you’ll learn how the Random Forest classifier works and implement it from scratch in Python. This is the sixth of many... Read more
Area Under the Curve and Beyond with Integrated Discrimination Improvement and Net Reclassification
TLDR AUC is a good starting metric when comparing the performance of two models but it does not always tell the whole story NRI looks at the new models ability to correctly reclassify cancers and benigns and should be used alongside AUC IDI quantifies improvement of the slopes of... Read more
7 Pitfalls to Avoid While Using Model-Agnostic Interpretation Techniques
Interpretable machine learning techniques are becoming more popular among the data science community as more and more complex machine learning algorithms are adopted which are not easily interpretable. Model-Agnostic Interpretation techniques do not care about the underlying models, but they have the capability to interpret the... Read more
Getting Up to Speed on Real-Time Machine Learning with Spark and SBERT
Editor’s note: Dillon Bostwick and Avinash Sooriyarachchi are speakers for ODSC Europe 2023 this June 14th-15th. Be sure to check out their talk, “Getting Up to Speed on Real-Time Machine Learning,” there! The benefits of real-time machine learning are becoming increasingly apparent. Digital native companies have... Read more
Building a Pizza Delivery Service with a Real-Time Analytics Stack
Editor’s note: Mark Needham is a speaker for ODSC Europe this June. Be sure to check out his talk, “Building a Real-time Analytics Application for a Pizza Delivery Service,” there! Gartner defines Real-Time Analytics as follows: Real-time analytics is the discipline that applies logic and mathematics... Read more
Streaming Machine Learning Without a Data Lake
Editor’s note: Kai Waehner is a speaker for ODSC Europe this June. Be sure to check out his talk, “Apache Kafka for Real-Time Machine Learning Without a Data Lake,” there! The combination of data streaming and machine learning (ML) enables you to build one scalable, reliable,... Read more
Financial Market Challenges and ML-Supported Asset Allocation
Editor’s note: Peter Schwendner, PhD is a speaker for ODSC Europe this June. Be sure to check out his talk, “ML Applications in Asset Allocation and Portfolio Management,” there! The year 2022 presented two significant turnarounds for tech: the first one is the immediate public visibility... Read more
Production Machine Learning for Mission-Critical Applications
Editor’s note: Robert Crowe is a speaker for ODSC Europe this June. Be sure to check out his talk, “Production ML for Mission-Critical Applications,” there! Deploying advanced machine learning technology to serve customers and/or business needs requires a rigorous approach and production-ready systems. This is especially... Read more