RAPIDS Forest Inference Library: Prediction at 100 Million Rows per Second
Random forests (RF) and gradient-boosted decision trees (GBDTs) have become workhorse models of applied machine learning. XGBoost and LightGBM, popular packages implementing GBDT models, consistently rank among the most commonly used tools by data scientists on the Kaggle platform. We see similar interest in forest-based models in industry, where... Read more
Automating Image Annotation with MAX
This blogpost introduces a use-case example to automate image annotation with MAX (Model Asset Exchange). To learn more about how our deep learning models are created, containerized, and deployed to production, come join our training at ODSC West 2019: Deploying Deep Learning Models as Microservices.  Introduction The Model Asset Exchange... Read more
Three Methods of Data Pre-Processing for Text Classification
Editor’s Note: Nick will be presenting on this idea of data pre-processing during the workshop “Choosing The Right Deep Learning Framework: A Deep Learning Approach,” at ODSC Europe in London this November! As a developer advocate at IBM, I work to empower AI, machine learning, and deep learning developers... Read more
Unpacking YouTube’s Recommender System
Over the past couple of years, YouTube has come under fire for its recommender system, with the media suggesting that it is promoting violent content, or banning LGBT content for violating its terms of service. Seemingly in response to all of this, Google has finally released a paper explaining... Read more
Composable Machine Learning
Even as machine learning (ML) algorithms become more sophisticated and powerful, the way ML teams build ML systems hasn’t changed much. In this article, we’ll explain the need for composable machine learning systems. First, take a look at the old, inefficient way. Once the team figures out the task... Read more
Parallel Plots for Visualizing Relationships with ggplot2 and ggforce
Parallel plots are useful for understanding the connections in a data set. In this post, we will demonstrate how ggplot2 and ggforce packages can be combined to create Parallel set plots–an extension of parallel plots.  Parallel set plots depict... Read more
Regression Blog 2: We’re Practically Giving These Regressions Away
When I heard that they would be releasing Pumpkin Spice Spam, I thought of regression. This might seem like a leap, but bear with me. In the U.S. in the last few years, hearing news reports of unusual Pumpkin Spice flavored products means the unofficial end of summer, or at... Read more
What You Need to Know about DeepMind’s BSuite
Imagine this. You know that reinforcement learning has been responsible for some of AI’s most significant advancements. You’re in the exploratory phase of implementing your first project. You’d love a way to evaluate whether your RL agent is appropriate for the task you have, something not always apparent without... Read more
Applications of AI in Cybersecurity
Editor’s Note: See Dustin’s talk “Applications of AI in Cybersecurity” at ODSC West 2019. Security has historically lagged behind the implementation of new technology. With AI/ML transforming how industries and government agencies do business and serve citizens, it is critical that developers build security into our architectures from the... Read more
From Data to Process to Decision
I recently published a paper entitled ”Intelligent Decisions: How Businesses Can Improve Processes Using Artificial Intelligence Technologies.” The work focused on the possibility of employing artificial intelligence in the business process management functions of the enterprise. I would like to further explore this concept and investigate the data science... Read more