Understanding the “Machine Learning Way” to Solve Business Problems through Real-World Scenarios 
Ironically, one of the foremost barriers preventing the exploitation of machine learning in a business is neither the implementation of the algorithm nor the retrieval of the data (the how): the toughest part is to recognize the right occasion to use it (the why)! We need to... Read more
The ODSC Warmup Guide to PyTorch
PyTorch is an open-source framework built for developing machine learning and deep learning models. In particular, this framework provides the stability and support required for building computational models in the development phase and deploying them in the production phase.  PyTorch functionalities are extensible with other Python... Read more
Forecasting with Cohort-Based Models
Editor’s note: Nicolai Vicol is a speaker for ODSC West 2021. Check out his talk, “Forecasting with Cohort-Based Models,” there! Cohort-based models are an alternative to time series models when it comes to forecasting of paid subscriptions TLDR on Cohort-Based Models A company offering subscriptions (e.g.... Read more
Batteries-Included Workflow Orchestration Tool: Flyte
Editor’s note: The authors are speaking at ODSC West 2021. Be sure to check out their talk, “Deep Dive into Flyte,” there! Machine learning (ML) has been deployed in the industry for over a few decades, but the tooling to support researchers and engineers in this... Read more
The ODSC Warmup Guide to Apache Airflow
Apache Airflow is a workflow automation platform that schedules and monitors workflows in the data pipelines programmatically. Airflow makes it simpler to set up and operate an end-to-end data pipeline in the cloud. You can use Airflow to manage and create workflows without worrying about the... Read more
22 Machine Learning Open Datasets for 2021
It’s that time again. We know you’re diligently working on your machine learning skills, and it’s time to find datasets worthy of the challenge. Whether you’re new to the field or looking for some inspiration, here are some great machine learning open datasets for training models.... Read more
6 Trending Python Machine Learning Packages on PyPI
As the most popular programming language for data science, Python packages, frameworks, and libraries are pulled by the millions each month. Month-to-month, Python packages reflect growing trends in the field of data science; as NLP is talked about more often, so will we see more packages... Read more
The Warmup Guide to OpenAI Gym
Founded in 2015 by Elon Musk, Sam Altman, and several others, OpenAI is a non-profit company dedicated to building friendly AI that is beneficial for everyone. One of its most well-known products is the OpenAI Gym. Gym is a toolkit for developing and comparing reinforcement learning... Read more
In 2020, SAS and Microsoft announced their partnership with the common goal to inspire greater trust and confidence in every data-driven decision, by driving innovation and proven AI in the cloud. Artificial Intelligence (AI) is changing the way people and organizations improve decision-making and move about... Read more
What are the Types of Missing Data?
Having clean, comprehensive, and consistent data is paramount to developing effective algorithms in machine learning. Without perfect data, you’re exposed to bias and skewed results that will lead to improper decision-making. When it comes to missing data, there are three major types: Missing Completely at Random... Read more