Ready Made Plots Make Work Easier
A while back Simon Jackson and Kara Woo shared some great ideas and graphs on grouped bar charts and density plots (link).Win-Vector LLC‘s Nina Zumel just added a graph of this type to the development version of WVPlots. Nina has, as usual, some great documentation here. More and more I am finding when you are in the... Read more
Data Visualization Throughout the Data Science Workflow: Part 1
Scientists, including data scientists, often focus on numerical methods and analyses, glossing over visualization and communication. We want to believe that “the data speaks for itself.”1 However, data visualization is an essential tool in a data scientist’s toolbox. Data visualization allows you to see patterns that would be invisible —... Read more
Introducing GapminderVR: Data Visualization in Virtual Reality
I am a big fan of sites such as Gapminder and Our World in Data. Such data visualization sites are like intellectual pornography. You want to know which countries are doing better? Which continents drink more alcohol? How is alcohol related to GDP? Have people getting fatter recently, or is that a... Read more
Moving Beyond Data Visualization to Data Applications
One thing we love doing at Exaptive – aside from creating tools that facilitate innovation – is hiring intelligent, creative, and compassionate people to fill our ranks. Frank Evans is one of our data scientists. He was invited to present at the TEDxOU event on January 26, 2018. Frank gave a great talk... Read more
In the 1840’s Dr. Ignaz Semmelweis had a big problem on his hands. He was an obstetrician in the Vienna General Hospital and healthy young women in the maternity ward were dying at an alarming rate. The problem was well known throughout Vienna. There were two wards, we’ll call... Read more
SEINFELD CHARACTERS – A POST ABOUT NOTHING
This post is dedicated to my mother – Seinfeld’s greatest fan. Seinfeld is a classic TV sitcom. It featured four main characters surrounded by relatively normal, everyday, run of the mill scenarios. In the spirit of Seinfeld, this post will also “be about nothing.” Load Required Libraries library(scales) library(RMySQL)... Read more
Random Forest Classification of Mushrooms
There is a plethora of classification algorithms available to people who have a bit of coding experience and a set of data. A common machine learning method is the random forest, which is a good place to start. This is a use case in R of the randomForest package used on a data... Read more
Jack Kwok is a Software Engineer with 15 years of professional experience. At Insight, he built a Deep Learning solution to automatically detect flooded roads during natural disasters. He is now a Software Engineer at Lyft working with Machine Learning and Deep Learning. Want to learn applied Artificial Intelligence... Read more
Data Visualization – Part 3
What Type of Data Visualization Do You Choose (if any)? Determining whether or not you need a visualization is step one. While it seems silly, this is probably something everyone (including myself) should be doing more often. A lot of times, it seems like a great way to showcase the... Read more
Scratch Viz – Documentation and Usage
Contents Introduction Audience Getting Started Data Scratch Blocks Example Projects Introduction If you have built castles in the air, your work need not be lost; that is where they should be. Now put the foundations under them. Henry David Thoreau Source: Why’s (Poignant) Guide to Ruby This experimental Scratch extension aims to... Read more
Open Data Science - Your News Source for AI, Machine Learning & more