fbpx
Three Methods of Data Pre-Processing for Text Classification
Editor’s Note: Nick will be presenting on this idea of data pre-processing during the workshop “Choosing The Right Deep Learning Framework: A Deep Learning Approach,” at ODSC Europe in London this November! As a developer advocate at IBM, I work to empower AI, machine learning, and deep learning developers... Read more
From Data to Process to Decision
I recently published a paper entitled ”Intelligent Decisions: How Businesses Can Improve Processes Using Artificial Intelligence Technologies.” The work focused on the possibility of employing artificial intelligence in the business process management functions of the enterprise. I would like to further explore this concept and investigate the data science... Read more
5 DevOps Challenges To Overcome To Gain Productivity
Editor’s Note: Is your business ready to implement DevOps? Learn more at ODSC West on how you can do just that. DevOps brought the development community to the agile era where multiple teams can work in a collaborative environment sharing their skills, knowledge and development responsibilities. As competition is increasing... Read more
Data Science + Design Thinking: a Perfect Blend to Achieve the Best User Experience
  It’s one thing to rely on artificial intelligence, machine learning, and big data to make your product smarter.  And, quite another to build a product that’s so intuitive and easy-to-use that your customer falls in love with it. That’s the beauty of data science + design thinking. It’s... Read more
The Data Scientist’s Holy Grail – Labeled Data Sets
The Holy Grail for data scientists is the ability to obtain labeled data sets for the purpose of training a supervised machine learning algorithm. An algorithm’s ability to “learn” is based on training it using a labeled training set – having known response variable values that correspond to a... Read more
A Practical Approach to Data Ethics
There is a Golden Rule in life. It’s a maxim that appears in various forms around the world: One should never do that to another which one regards as injurious to one’s own self. As a data scientist, I find this principle of reciprocity very appealing! Treat others’ data... Read more
How Tidyverse Guides R Programmers Through Data Science Workflows
Whenever someone asks me how to get into data science using R, I invariably recommend checking out the tidyverse package. Tidyverse is a great launch pad for a language like R because it offers order and consistency. I studied programming language design as a CS undergrad. At the time,... Read more
Setting Your Hypothesis Test Up For Success
I want to go deep with you on exactly how I work with stakeholders ahead of launching a hypothesis test. This step is crucial to make sure that once a test is done running, we’ll actually be able to analyze it. This includes: A well-defined hypothesis A solid test... Read more
Organizing Your Next Data Science Project to Minimize Headaches
Call it the data scientist’s curse, but every practitioner has had a data science project that became unmanageable at some point because of poor organizational choices early on. We’ve all been at our desks at 2 a.m. changing values and re-running our scripts for the 80th time in an... Read more
Three Popular Clustering Methods and When to Use Each
In the mad rush to find new ways of teasing apart labeled data, we often forget about everything we can do with unsupervised learning. Unsupervised machine learning can be very powerful in its own right, and clustering is by far the most common expression of this group of problems.... Read more