WHO Tuberculosis Data & ggplot2
BlogData VisualizationRposted by Eugene Joh August 2, 2017
So it has been a while since my previous post on some data taken from the UNHCR database. This post we’ll bring it back to the topic of infectious diseases (check out my other posts on the SIR model and MRSA typing). For this post, as similar to previous ones, I give... Read more
Feature Engineering with Tidyverse
BlogRRposted by Burak Himmetoglu July 19, 2017
In this blog post, I will discuss feature engineering using the Tidyverse collection of libraries. Feature engineering is crucial for a variety of reasons, and it requires some care to produce any useful outcome. In this post, I will consider a dataset that contains description of crimes in... Read more
Using GRAKN.AI to reason over an R dataset
BlogRposted by Jo Stichbury June 13, 2017
Introduction In this article I will introduce an open-source knowledge graph platform called GRAKN.AI. I’m going to use it to load a simple dataset, and show how to calculate basic statistics such as maximum and mean values. A good question at this point would be: as... Read more
Like I mentioned in my last blog post, I am contributing to a session at userR 2017 this coming July that will focus on discovering and learning about R packages. This is an increasingly important issue for R users as we all decide which of the... Read more
On indexing operators and composition
BlogRRposted by John Mount May 20, 2017
In this article I will discuss array indexing, operators, and composition in depth. If you work through this article you should end up with a very deep understanding of array indexing and the deep interpretation available when we realize indexing is an instance of function composition... Read more
Scraping CRAN with rvest
BlogRToolsData Mining|Rposted by Julia Silge May 17, 2017
I am one of the organizers for a session at userR 2017 this coming July that will focus on discovering and learning about R packages. How do R users find packages that meet their needs? Can we make this process easier? As somebody who is relatively... Read more
RIDE – A New Data Science IDE for Python and R
PythonRTools & Languagesposted by May 9, 2017
The data science world is split into two parts: the (i)Python and the R community. Both groups offer a plethora of tools and libraries enriching our work-life as a data scientist. Interestingly, many of the offerings are complementary, such that professional data scientists should know both... Read more
Choroplethr v3.6.0 is now on CRAN
Data VisualizationModelingRToolsTools & Languagesposted by Ari Lamstein May 9, 2017
Choroplethr version 3.6.0 is now on CRAN. This version adds functionality for getting and mapping demographics of US Census Tracts. You can install it from the R console as follows: 1 2 3 install.packages("choroplethr") packageVersion("choroplethr") ‘3.6.0’ To use this functionality you will need an API... Read more
Exploring the Relationship between Religion and Demographics in R
Data VisualizationModelingRStatisticsTools & Languagesposted by Julia Silge April 25, 2017
Today’s guest post is by Julia Silge. Take a look at her work on (“Mapping US Religion Adherence by County in R“) where she demonstrated how to work with US religion adherence data in R. In this post she explores the relationship between that dataset and US... Read more
Fixing an infelicity in ‘leaps’
RTools & LanguagesRposted by Thomas Lumley April 23, 2017
The ‘leaps’ package for R is ancient – this is its tenth twentieth year on CRAN. It uses old Fortran code by the Australian computational statistician Alan Miller. The Fortran 90 versions are on the web, but Fortran 90 compilation with R wasn’t portable back then, so I... Read more