Correlation Analysis 101 in Python
Hello everyone! Do you realize it’s spring already? I’m almost ready to celebrate the holiday of flowers, but first: another data analysis practice for you today that will make your life easier (or at least more interesting, hopefully). Do you ever receive questions like: – Does... Read more
How is Cross-Site Scripting Different From Other Related Attacks?
Cross-site scripting attacks are one of the most common attacks, making it rank seventh in “The OWASP Top 10” — the industry-recognized list for spreading awareness about the most common web security vulnerabilities. And then too, there are many less-known vulnerabilities that are often confused with... Read more
An Introduction to Social Network Analysis with NetworkX: Two Factions of a Karate Club
Networks (a.k.a graphs) are one of the most interesting areas of data science and have been subject to an explosion of interest in recent years. The ability to model the relationship between data points is powerful. This article introduces some basic concepts in network science and... Read more
Ditching Excel for Python – Lessons Learned from a Legacy Industry
I’ve been in a reflective mood lately, probably because it’s the end of a very eventful year. Change is all around us and technology is the enabler. On that note, I’ve been thinking a lot about my own experiences with technology-driven change in the reinsurance industry,... Read more
Data Science Notebooks | 2020 Review
2020 was a roller coaster, but the data science community is going strong. Interest in the data science domain has grown in the past year yet again. We dug into the data to learn more about the current state of a vital part of the data... Read more
How to Load Big Data from Snowflake Into Python
We at Saturn Cloud are dedicated to fast and scalable data science with Python. Often this looks like querying data that resides in cloud storage or a data warehouse, then performing analysis, feature engineering, and machine learning with Python. Snowflake is a scalable cloud data warehouse... Read more
Getting Started with Pandas
Pandas is a popular data analysis library built on top of the Python programming language, and getting started with Pandas is an easy task. It assists with common manipulations for data cleaning, joining, sorting, filtering, deduping, and more. First released in 2009, pandas now sits as... Read more
Removing Items From a Set – remove(), pop(), and difference
Python has a rich collection of built-in data structures. These data structures are sometimes called “containers” or “collections” because they contain a collection of individual items. These structures cover a wide variety of common programming situations. In this recipe, we’ll look at how we can update a set by... Read more
Retrieving Webpages Through Python Programming
This article discusses retrieving web pages through Python programming. The internet and the World Wide Web (WWW), is probably the most prominent source of information today. Most of that information is retrievable through HTTP. HTTP was invented originally to share pages of hypertext (hence th.e name Hypertext Transfer Protocol),... Read more
Using a Smoother in SQL
A problem found throughout the world of data is how to distinguish signal from noise. When dealing with data that comes in a sequence, such as time-series data (the most familiar example but by no means the only example), a frequent method of dealing with the... Read more