You weren’t supposed to actually implement it, Google
Last month, I wrote a blog post warning about how, if you follow popular trends in NLP, you can easily accidentally make a classifier that is pretty racist. To demonstrate this, I included the very simple code, as a “cautionary tutorial.” The post got a fair amount of reaction. Much... Read more
Blockchain Technology is Hot Right Now, But Tread Carefully.
In 1968, Dr. Spencer Silver, a chemist at 3M Company, was tasked with creating a super-strong adhesive. Despite his efforts, his chemical compound which only achieved moderate stickiness could not find a problem for his solution. That was until his colleague, Art Fry, noticed he could use the glue... Read more
Chatbots in Banking
The Co-arthers of this post are Yue Cathy Chang an SVDS alumi, and Cindi Thompson,Principal Data Scientist at Silicon Valley Data Science. There article was originally posted on the Silicon Valley Data Science blog. From asking Amazon Alexa for traffic conditions, to receiving helpful tips from Slackbot, to using WeChat to... Read more
12 tips for Designing and Managing an AI-driven product
Here’s a question that will keep future Artificial Intelligence (AI) entrepreneurs up at night: How do you manage a product when the software starts writing itself? We’re not quite there yet, but as we build smarter, more complex software that has elements driven by AI we’re also making less predictable software.... Read more
Datasets for Building a Data Analysis Portfolio
I recently had the pleasure of attending the 2017 Association of Public Data Users (APDU) Conference. My favorite part of the conference was talking to people who work with federal data on a daily basis. Overall I found people to be passionate about their work and eager to share information... Read more
It feels good to be a data geek in 2017. Last year, we asked “Is Big Data Still a Thing?”, observing that since Big Data is largely “plumbing”, it has been subject to enterprise adoption cycles that are much slower than the hype cycle. As a result, it took several... Read more
Testing non-cryptographic random number generators: my results
In software, we use random number generators to emulate “randomness” in games, simulations, probabilistic algorithms and so on. There are many definitions of what it means to be random, but in practice, what we do is run statistical tests on the output of the random number generators. These tests... Read more
Bundle Buddy
When building a complex JavaScript application, it is common to minify code and bundle files together to optimize network requests so the app loads faster. A common pattern for complex and large applications is code splitting. Typically this breaks up the bundles by each route in your application so... Read more
We’re all familiar with terms like first, third, and developing the world when it comes to describing countries in relation to the word. “First-world” refers to the countries are richer, healthier, and more educated, while impoverish nations fall under the label of third-world. In addition, we occasionally hear “second-world”... Read more
Dimensional Modeling and Kimball Data Marts in the Age of Big Data and Hadoop
Is dimensional modeling dead? Before I give you an answer to this question let’s take a step back and first have a look at what we mean by dimensional data modelling. Why do we need to model our data? Contrary to a common misunderstanding, it is not the only... Read more
Open Data Science - Your News Source for AI, Machine Learning & more