How To Build A Spam Classifier Using Decision Tree
In the realm of Supervised Learning, there are tons of classifiers, including Logistic Regressions (logit 101 and logit 102), LDA, Naive Bayes, SVM, KNN, Random Forest, Neural Networks, and so many more coming each day! The real question that all data scientists... Read more
Introduction to Apache Airflow
Apache Airflow is a tool created by the community to programmatically author, schedule, and monitor workflows. The biggest advantage of Airflow is the fact that it does not limit the scope of pipelines. Airflow can be used for building Machine Learning models, transferring data, or managing the infrastructure. Let’s... Read more
5 Deep Learning Frameworks to Consider for 2020
Enough of flirting with deep learning and deep learning frameworks; it’s time to glide across the room and say, “Hello.” Call it an advanced subfield of machine learning or future to enhanced vision in the field of technology, deep learning won’t stop now!  Imbibed in the majority of business... Read more
Level Up: spaCy NLP for the Win
Kimberly is a speaker for ODSC East 2020! Be sure to check out her talk, “Level Up: Fancy NLP with Straightforward Tools,” there! Natural language processing (NLP) is a branch of artificial intelligence in which computers extract information from written or spoken human language.  This field has experienced a... Read more
Training and Operationalizing Interpretable Machine Learning Models
AI offers companies the unique opportunity to transform their operations: from AI applications able to predict and schedule equipment’s maintenance, to intelligent R&D applications able to estimate the success of future drugs. However, in order to be able to leverage this opportunity, companies have to learn how to successfully... Read more
Deep Q-Learning Algorithm in Reinforcement Learning
In this article, we will discuss Q-learning in conjunction with neural networks (NNs). This combination has the name deep Q-network (DQN). This article is an excerpt from the book Deep Reinforcement Learning Hands-on, Second Edition by Max Lapan. This book provides you with an introduction to the fundamentals of RL,... Read more
Are All Explainable Models Trustworthy?
Explainable AI or Explainable Data Science is one of the top buzzwords of Data Science at the moment. Models that are explainable are seen as the answer to many of recently recognized problems with machine learning, such as bias or data leaks. ... Read more
Understanding Dataset Shift
How to not be fooled by the tricks data plays on you. Dataset shift is a challenging situation where the joint distribution of inputs and outputs differs between the training and test stages—Dataset Shift, The MIT Press. Dataset shifting is one of those topics which is simple, perhaps so simple... Read more
2020 Outlook on AutoML Updates & Latest Recent Advances
The field of automated machine learning or AutoML continues to expand with new products and services being announced at a frenetic pace. As a data scientist, I’m motivated to carefully monitor this technology because it could potentially impact my profession especially if these tools open up the field of... Read more
Machine Learning for Time Series Data
Most organizations generate time-series data. The generation of sales data and financial data are primary components of all organizations’ business. This data is a form of time series data. Time series data consists of any data that carries a temporal component with it. Time series data is data that... Read more