Moving Beyond Data Visualization to Data Applications
One thing we love doing at Exaptive – aside from creating tools that facilitate innovation – is hiring intelligent, creative, and compassionate people to fill our ranks. Frank Evans is one of our data scientists. He was invited to present at the TEDxOU event on January 26, 2018. Frank gave a great talk... Read more
In the 1840’s Dr. Ignaz Semmelweis had a big problem on his hands. He was an obstetrician in the Vienna General Hospital and healthy young women in the maternity ward were dying at an alarming rate. The problem was well known throughout Vienna. There were two wards, we’ll call... Read more
SEINFELD CHARACTERS – A POST ABOUT NOTHING
This post is dedicated to my mother – Seinfeld’s greatest fan. Seinfeld is a classic TV sitcom. It featured four main characters surrounded by relatively normal, everyday, run of the mill scenarios. In the spirit of Seinfeld, this post will also “be about nothing.” Load Required Libraries library(scales) library(RMySQL)... Read more
Random Forest Classification of Mushrooms
There is a plethora of classification algorithms available to people who have a bit of coding experience and a set of data. A common machine learning method is the random forest, which is a good place to start. This is a use case in R of the randomForest package used on a data... Read more
Jack Kwok is a Software Engineer with 15 years of professional experience. At Insight, he built a Deep Learning solution to automatically detect flooded roads during natural disasters. He is now a Software Engineer at Lyft working with Machine Learning and Deep Learning. Want to learn applied Artificial Intelligence... Read more
Data Visualization – Part 3
What Type of Data Visualization Do You Choose (if any)? Determining whether or not you need a visualization is step one. While it seems silly, this is probably something everyone (including myself) should be doing more often. A lot of times, it seems like a great way to showcase the... Read more
Scratch Viz – Documentation and Usage
Contents Introduction Audience Getting Started Data Scratch Blocks Example Projects Introduction If you have built castles in the air, your work need not be lost; that is where they should be. Now put the foundations under them. Henry David Thoreau Source: Why’s (Poignant) Guide to Ruby This experimental Scratch extension aims to... Read more
This blogpost is about topic modeling using data from this blog, opendatascience.com. From this, combined with the most visited articles of the year, we will generate the most popular topics of 2017. Last year, we did something similar with popular articles streamed through twitter using Non-Negative Matrix Factorization to determine topics, article... Read more
Watermain Breaks in the City of Toronto
It has been a while since my last post due to the major transition of moving back to Canada. This post will be a bit shorter than my previous ones but hopefully it will give some insight on practically investigating and analyzing open data that are becoming more popular... Read more
Plotting author statistics for Git repos using Git of Theseus
I spent a few days during the holidays fixing up a bunch of semi-dormant open source projects and I have a couple of blog posts in the pipeline about various updates. First up, I made a number of fixes to Git of Theseus which is a tool (written in Python) that... Read more