Last Saturday, in the UEFA Champions League final (think of it as Europe’s Super Bowl), Spanish giants Real Madrid defeated their Italian counterparts Juventus FC 4-1. It was a thrilling match, that saw both sides staking an equal claim to winning the match in the first half, with Madrid eventually prevailing... Read more
Pandas & Seaborn – A guide to handle & visualize data elegantly
Here at Tryolabs we love Python almost as much as we love machine learning problems. These kind of problems always involve working with large amounts of data which is key to understand before applying any machine learning technique. To understand the data, we need to manipulate it, clean it,... Read more
Regular Expression & Treemaps to Visualize Emergency Department Visits
It’s been a while since my last post on some TB WHO data. A lot has happened since then, including the opportunity to attend the Open Data Science Conference (ODSC) East held in Boston, MA. Over a two day period I had the opportunity to listen to a number of leaders... Read more
In this post we will describe how to evaluate a predictive model. Why bother creating complex predictive models if 5% of the customers will churn anyway? Because a predictive model will rank our clients based on the probability that they  will abandon the company. It helps answer these two questions: 1.... Read more
Drawing a map of distributed data systems
How we created an illustrated guide to help you find your way through the data landscape. Designing Data-Intensive Applications, the book I’ve been working on for four years, is finally finished, and should be available in your favorite bookstore in the next week or two. An incomplete beta (Early... Read more
Choroplethr v3.6.0 is now on CRAN
Choroplethr version 3.6.0 is now on CRAN. This version adds functionality for getting and mapping demographics of US Census Tracts. You can install it from the R console as follows: 1 2 3 install.packages("choroplethr") packageVersion("choroplethr") ‘3.6.0’ To use this functionality you will need an API key from the... Read more
NYC Pre-K Explorer
Shiny Project contributed by Amy Tzu-Yu Chen – Data Science Student in the NYC Data Science Academy Bootcamp Motivation In 2013, Mayor De Blasio campaigned on a promise of universal pre-kindergarten. The program makes access to free pre-kindergarten education available to all NYC families, regardless of child’s abilities and family income. Now,... Read more
Exploring the Relationship between Religion and Demographics in R
Today’s guest post is by Julia Silge. Take a look at her work on (“Mapping US Religion Adherence by County in R“) where she demonstrated how to work with US religion adherence data in R. In this post she explores the relationship between that dataset and US Demographic data. I... Read more
9 ways to Level up your Data Science practice
We love reading articles with tips and best practices, and we agree with a lot of the advice we see out there (#5 on this list is great!). So, we asked the Domino team for advice to pass on to researchers and scientists searching for ways to get to that next level, and... Read more
It wasn’t an overbooking problem. United Airlines was trying to move four flight crew members to the next airport. They forced passengers to get off the plane with the consequences we saw on the video from last Sunday, but don’t take our word for it. Let’s talk data. An elaborate... Read more