Has Progress in America Peaked?
Measuring Physical, Personal, and National Development over the past century I recently came across a compelling work of data visualization published by the New York Times. The author sought to prove a point that performance had peaked in men’s speed skating and more broadly in other Olympic... Read more
Visualizing Machine Learning Datasets with Google’s FACETS
There has been a lot of uproar as to how a large quantity of training data can have a tremendous impact on the results of a machine learning model. However, along with data quantity, it is also the quality that is critical to building a powerful... Read more
Visualizing Bias in Data Using Embedding Projector
Before a machine learning model is deployed, its performance is evaluated. However, apart from the performance aspect, it is also important to know what the model has learnt. This is necessary so as to ensure that the model hasn’t learnt something discriminatory or biased. [Related Article:... Read more
Interpreting the 2020 Puerto Rico Earthquake Swarm with Data Science
Using visualizations, maps, time series and Google Trends data science, the 2020 Puerto Rico earthquake swarm is described. Since late December 2019 until early January 2020, the southwestern region of the island Puerto Rico has been experiencing a series, or swarm, of earthquakes, leaving in its... Read more
Local Regression in Python
I love data visualization make-overs (like this one I wrote a few months ago), but sometimes the tone can be too negative (like this one I wrote a few months ago). Sarah Leo, a data journalist at The Economist, has found the perfect solution: re-making your own... Read more
A New Method of Data Mapping – Dimensionality Reduction + Network Theory
When you visit a new place, probably you will rely on a map to guide you from place to place so you could get yourself oriented and help you find the most interesting places. The same happens with data. When you get a new dataset to... Read more
Parallel Plots for Visualizing Relationships with ggplot2 and ggforce
Parallel plots are useful for understanding the connections in a data set. In this post, we will demonstrate how ggplot2 and ggforce packages can be combined to create Parallel set plots–an extension of parallel plots.  Parallel... Read more
Why We Need Graph Analytics for Real-World Predictions
Editor’s Note: Learn more about graphs and graph analytics at the talk “Reveal Predictive Patterns with Neo4j Graph Algorithms” at ODSC West 2019 on Wednesday, October 30th. As data becomes increasingly interconnected and systems increasingly sophisticated, it’s essential to make use of the rich and evolving... Read more
Interpretable Knowledge Discovery Reinforced by Visual Methods
Editor’s Note: See Boris Kovalerchuk’s talk “Interpretable Knowledge Discovery Reinforced by Visual Methods” at ODSC West 2019. Visual reasoning and discovery have a long history. Chinese and Indians had visual proof of the Pythagorean Theorem in 600 B.C. before it was known to the Greeks. Scientists... Read more
Data Visualization for Academics
One of the reasons I am excited about the rise of data journalism is that journalists are doing amazing things with visualization. At the same time, one of my frustrations with academic research is that the general quality of visualization is so poor. One of the... Read more