

Top 5 Reasons to Use Seaborn for Data Visualizations
Blogs from ODSC SpeakersData VisualizationModelingseabornposted by ODSC Community November 17, 2021 ODSC Community

Editor’s note: Ted Petrou will be giving a presentation on Ai+ Training on Seaborn this December 1st and 8th. Be sure to check his sessions, “Data Visualization with Seaborn,” then!
The Seaborn data visualization library in Python provides a simple and intuitive interface for making beautiful plots directly from a Pandas DataFrame. When users arrange their data in tidy form, the Seaborn plotting functions perform the heavy lifting by grouping, splitting, aggregating, and plotting data, often with a single line of code. In this article, I will provide my top five reasons for using the Seaborn library to create data visualizations with Python.
Reason # 1 — Intuitive API — plotting with a single line of code
The Seaborn library’s API is intuitive, fairly easy to use, and quite uniform. Many visualizations can be created with a single line of code that takes the following form, where sns
is the Seaborn library, plotting_func
is a specific plotting function, df
is the pandas DataFrame where the data is stored, x
is the string name of the column holding the horizontal values, and y
is the string name of the column holding the vertical values.
sns.plotting_func(data=df, x=x, y=y)
Of course there are many optional parameters to create the exact plot you desire, but nearly all plotting functions follow this format and will produce a basic plot with this line. Below, we read in a simple dataset, which each row containing information about a single employee.
import pandas as pd
import seaborn as sns
emp = pd.read_csv('employee.csv')
emp.head()
A count of the number of employees by department can be done with the histplot
function.
sns.histplot(data=emp, y='dept');
Reason #2 — Instantly change styles to get more aesthetically pleasing plots
Seaborn has several built in styles that can be set to change the default plotting aesthetics. Here, we change to the ‘whitegrid’ style.
sns.set_theme(style='whitegrid',
rc={'figure.dpi': 144, 'figure.figsize': (6, 3)},
font_scale=0.7)
sns.histplot(data=emp, y='dept');
Reason #3 — Automatically group and aggregate tidy data
Perhaps its most powerful feature, Seaborn automatically groups and aggregates data in tidy form. Below, the data is grouped by sex, and again by race. Within each one of these groups, the average salary is found and plotted as a bar graph.
sns.barplot(data=emp, x='sex', y='salary', hue='race');
Reason #4 — Seaborn helps you think about data in tidy format
Seaborn is easiest to use when your data is structured in [tidy form], which is a format that makes most data analyses easier. With many other data visualization libraries, you’ll have to transform your data first into its final form and then make the plot. Below, we use pandas to group and aggregate the data so that it is able to be reproduce the above plot. With Seaborn, we can skip this step due to our data being in tidy form and because of Reason #3.
emp.pivot_table(index='sex', columns='race',
values='salary', aggfunc='mean').round(0)
Reason #5 — Many options to customize the plot
Seaborn plotting functions provide several options to customize your visualization. Some of these options for creating violin plots are used below.
sns.violinplot(data=emp.query('salary < 150_000'), x='salary',
y='race',
split=True, hue='sex', width=0.9,
order=['Asian', 'Black', 'Hispanic',
'Native American', 'White'],
hue_order=['Female', 'Male'])
When to use Seaborn
Seaborn is a good choice when you want to produce static, two-dimensional data visualizations with tidy data stored in a pandas DataFrame. It is not suitable for animation or three-dimensional visualizations. If you like using an intuitive API, easily setting new styles, automatically grouping and aggregating tidy data, and customizing your plots, then Seaborn can be a great tool to create the visualizations you desire.
Data Visualization with Seaborn — Live Training December 1st and 8th
If you are interested in an in-depth tutorial on nearly everything there is to know about creating data visualizations with Seaborn, take my live training on December 1st and 8th.
About the Author:
Teddy Petrou is the author of Pandas Cookbook, a highly rated text on performing real-world data analysis with Pandas. He is also the author of the books Exercise Python and Master Data Analysis with Python.
He is the founder of Dunder Data, a company that teaches the fundamentals of data science and machine learning. He really enjoys discovering best practices on how to use and teach data analysis with Python.