Understanding AI Toolkits
This was originally posted on the Silicon Valley Data Science blog. Options for Developing with Deep Learning  Modern artificial intelligence makes many benefits available to business, bringing cognitive abilities to machines at scale. As a field of computer science, AI is moving at an unprecedented rate: the time you must wait... Read more
The Chatbot Landscape, 2017 Edition
To help decision makers and users wade around the vast landscape of bots, this landscape gives a high-level overview of providers and tools.   Why this landscape, now? Since we started building bots more than 2 years ago, the landscape has seen massive interest and change. This makes it hard... Read more
CatBoost: Yandex’s machine learning algorithm is available free of charge
Russia’s Internet giant Yandex has launched CatBoost, an open source machine learning service. The algorithm has already been integrated by the European Organization for Nuclear Research to analyze data from the Large Hadron Collider, the world’s most sophisticated experimental facility. Machine learning helps make decisions by analyzing data and... Read more
Linked Data and Data Science
Feature Engineering with Apache Spark and Optimus
When we talk about Feature Engineering we refer to creating new features from your existing ones to improve model performance. Sometimes this is the case, or sometimes you need to do it because a certain model doesn’t recognize the data as you have it, so these transformations let you... Read more
After a spike in interest in library’s initial release, we can see it’s popularity demonstrated continuous growth, nearly doubling its search interest score in 2017 alone. Compared to deep learning libraries, TensorFlow is a class above the rest. Read more
Descriptive Analysis of MLST Data for MRSA
During one of my summers, I had the opportunity to conduct some research on the prevalence of methicillin-resistant Staphylococcus aureus (MRSA) in vulnerable populations and examining US emergency department data and I thought this would be a pretty interesting topic to expand on for my thesis in light of the increasing concerns... Read more
This post is the first of a two-part series in which we apply NLP techniques to analyze articles about big data, data science, and AI. If you are tired of the hassles of web scraping, then this post might be just for you. I occasionally web scrape news articles from the... Read more
In this post, I’ll tell you how to geolocate your analysis using the Geopy. Geopy is a Python 2 and 3 library, that provides connections to the most popular geocoding services. Why bother to geolocate your data? Because if you use latitude and longitude data, you can visualize your... Read more
Bundle Buddy
When building a complex JavaScript application, it is common to minify code and bundle files together to optimize network requests so the app loads faster. A common pattern for complex and large applications is code splitting. Typically this breaks up the bundles by each route in your application so... Read more