Amazon will make $41B this Holiday Season! Forecasting Quarterly Revenue

Amazon will make $41...

The holiday shopping season is in full swing! The economy is relatively strong compared to a few years back and so retail sales are probably going to be strong especially for amazon. Other retailers like Target and Wal-Mart are also running amazing black Friday and holiday sales to attract customers. However, amazon has consistently shown […]

How to Search for Census Data from R

How to Search for Ce...

In my course Learn to Map Census Data in R I provide people with a handful of interesting demographics to analyze. This is convenient for teaching, but people often want to search for other demographic statistics. To address that, today I will work through an example of starting with a simple demographic question and using […]

Trump and Clinton Speeches, Step 1: Text Mining

Trump and Clinton Sp...

Editor’s note: Opinions expressed in this post do not necessarily reflect the views of #ODSC, nor do they necessarily reflect the views of Edward’s employer. Part 1 Obtaining Transcripts for Campaign Trail Speeches The political season is long and arduous.  As a former Ohioan I dreaded any election year because it is punctuated with endless negative and inflammatory […]

The Morality of Data Collection

The Morality of Data...

I just returned from Strata + Hadoop World in San Jose, where I gave a talk entitled “Kosher Collection: Best Practices in Data Handling“.  I really had an amazing time at Strata this year and major kudos to the organizers for putting on a great show. The central premise of my talk is that in […]

Win Customer Loyalty with Predictive Analytics

Win Customer Loyalty...

Winning your customer for life is a challenging task for organizations. How can you connect with your customer and how can you ensure that they stay with your organization for a long time? Questions that many organizations face.  Fortunately, with the advance of big data and analytics, it has become a little bit easier for […]

More Data ≠ Rich Data

More Data ≠ Rich D...

In the past 10 years, the focus of data has been on amassing and storing: the more data collected, the better. But while we all became expert data gatherers, what we actually ended up with was a glut of data, a shred of the insights we expected to get, and a very expensive problem. Data […]

In NYC, More Robberies Happen Right When School Gets Out than Any Other Time

In NYC, More Robberi...

By: Ben Wellington, Data Scientist – Two Sigma As noted in my last post comparing NYC neighborhood murder rates to American cities, the City has finally released raw crime data for the first time in its history. But the more I look at data released by other cities, the more the data released by NYC seems simply […]

Mining Twitter for Broken Hearts

Mining Twitter for B...

Twitter is a data treasure-trove for sociologists. The service allows the collection of rich data that can be used as a proxy for many of our humans interactions. It can also be used to study how people break up. Based on Twitter streams of 80,000 users, the researchers were able to show several psychological and […]

Does Piety Pick Presidents?

Does Piety Pick Pres...

Julia Silge, a former astronomer now data scientist, explores a dataset of religion membership throughout the country. This dataset, published by The Association of Statisticians of American Religious Bodies (ASARB), has data on the number of congregations and adherents for religious groups for each county in the United States. Using R packages choroplethr and ggplot2, […]