fbpx
Getting Started with Pandas
Pandas is a popular data analysis library built on top of the Python programming language. It assists with common manipulations for data cleaning, joining, sorting, filtering, deduping, and more. First released in 2009, pandas now sits as the epicenter of Python’s vast data science ecosystem and is an essential... Read more
Removing Items From a Set – remove(), pop(), and difference
Python has a rich collection of built-in data structures. These data structures are sometimes called “containers” or “collections” because they contain a collection of individual items. These structures cover a wide variety of common programming situations. In this recipe, we’ll look at how we can update a set by removing or replacing... Read more
An Introduction to AWS Networking – Virtual Private Cloud
Cloud computing is one of the major trends in computing today and has been for many years. Public cloud providers have transformed the start-up industry and what it means to launch a service from scratch. We no longer need to build our own infrastructure; we can pay public cloud... Read more
Creating if/elseif/else Variables in Python/Pandas
Frequencies and Chaining in Python-Pandas
A few years ago, in a Q&A session following a presentation I gave on data analysis (DA) to a group of college recruits for my then consulting company, I was asked to name what I considered the most important analytic technique. Though a surprise to the audience, my answer,... Read more
The goal of a data analysis pipeline in Python is to allow you to transform data from one state to another through a set of repeatable, and ideally scalable, steps. Problems for which I have used data analysis pipelines in Python include: Processing financial / stock market data, including... Read more
Guide to R and Python in a Single Jupyter Notebook
Why pick one when you can use both at the same time? R is primarily used for statistical analysis, while Python provides a more general approach to data science. R and Python are object-oriented towards data science for programming language. Learning both is an ideal solution. Python is a... Read more
15+ Free and Paid Resources to Learn Python
Python is one of the most significant skills you’ll need not only in data science but in any position in the development/tech world. It’s used in a variety of fields and has a reputation for being easy to learn and simple to use. Here are 15+ free and paid... Read more
Scikit Optimize: Bayesian Hyperparameter Optimization in Python
So you want to optimize hyperparameters of your machine learning model and you are thinking whether Scikit Optimize is the right tool for you? You are in the right place. In this article I will: show you an example of using skopt on a real problem, evaluate this library based on... Read more
Implementing a Kernel Principal Component Analysis in Python
In this article, we discuss implementing a kernel Principal Component Analysis in Python, with a few examples.  Many machine learning algorithms make assumptions about the linear separability of the input data. The perceptron even requires perfectly linearly separable training data to converge. Other algorithms that we have covered so... Read more