Hey guys, welcome back to my R-tips newsletter. Today I’m introducing GWalkR
: An R package for Exploratory Data Analysis in 1 line of code. Just like Tableau. But Costs $0 (100% free). Let’s go!
Here’s what you’re learning today:
GWalkR
is and how it makes Exploratory Data Analysis in R easierGet the Code (In the R-Tip 083 Folder)
Inside the workshop I’ll share how I built a Machine Learning Powered Production Shiny App with ChatGPT
(extends this data analysis to an insane production app):
What: ChatGPT for Data Scientists
When: Wednesday August 14th, 2pm EST
How It Will Help You: Whether you are new to data science or are an expert, ChatGPT is changing the game. There’s a ton of hype. But how can ChatGPT actually help you become a better data scientist and help you stand out in your career? I’ll show you inside my free chatgpt for data scientists workshop.
Price: Does Free sound good?
How To Join: Register Here
This article is part of R-Tips Weekly, a weekly video tutorial that shows you step-by-step how to do common R coding tasks. Pretty cool, right?
Here are the links to get set up.
I have an 11-minute video that walks you through setting up GWalkR
in R and running your first exploratory data analysis with it.
GWalkR
is a Tableau alternative that is 100% freely available in R. It includes 95% of the drag-n-drop features for fast EDA that Tableau has. And you can use it right in R. Github: https://github.com/Kanaries/GWalkR
For Python users, the pygwalker
library is the equivalent tool in Python. Github: https://github.com/Kanaries/pygwalker
Both GWalkR
and pygwalker
made by Kanaries, which offers a paid version that includes more features like cloud hosting, sharing, and AI.
I can replace roughly 95% of Tableau with the free version of GWalkR
.
GWalkR
is great forYou’ll need to use the paid version if you want to:
You’ll want to weigh your analytics needs. If you’re just doing analysis for yourself like I do 90% of the time. Then sharing isn’t a big deal. I’ll just make an RMarkdown with the final plots, analysis, and report when I need to share.
In this section, I’ll share how to make 4 common data visualiations (plots):
It takes about 10 seconds to get GWalkR
set up so you can start doing drag-n-drop exploratory data analysis (just like Tableau) inside of R. All the tutorial code and data sets shown are available in the R-Tips Newsletter folder for R-Tip 083.
Get the Code and Datasets (In the R-Tip 083 Folder)
The first step is to set up GWalkR
. Run this code to install GWalkR
, load the key libraries, and read in the first data set (MPG Data) that will explore together.
Get the Code (In the R-Tip 083 Folder)
This will produce the GWalkR in the Viewer Pane inside RStudio:
Get the Code (In the R-Tip 083 Folder)
Now you’re ready to explore and analyze the first data set.
Let’s get our feet wet with some of the basic features of GWalkR
. We’ll explore the “mpg” data set in the data folder of R-Tip 083.
A bar plot is the most basic plot that is an aggregation (sum, average, etc) applied to 1 numeric feature. The bars are formed by segmenting by 1 categorical feature.
Get the Code and Data Set (In the R-Tip 083 Folder)
To make a bar plot, we need to:
A scatter plot is an un-aggregated plot that will help us detect trends between 2 numeric features.
Get the Code and Data Set (In the R-Tip 083 Folder)
Now that you have a feel for how it works, creating a scatter plot is pretty easy:
A box plot applies Jon Tukey’s method for displaying the distribution of data using median, 1st and 3rd quartiles, and outliers. It’s great for detecting general trends and exposing outliers.
How to recreate this plot:
Now let’s work with a time series dataset. Run this code:
Get the Code and Data Set (In the R-Tip 083 Folder)
That will produce this GWalkR session in the Viewer pane:
A time series plot is a useful way to visualize trends in time series data (contains a date or time stamp).
To recreate this plot:
All of the code you saw today is available in R-Tips Newsletter folder for R-Tip 083
Get the Code (In the R-Tip 083 Folder)
The GWalkR
package makes it easy to explore data. In fact, I’ve used it to replace 95% of my Tableau work. But there’s more to becoming a data scientist.
If you would like to grow your Business Data Science skills with R, then please read on…
I’ve helped 6,107+ students learn data science for business from an elite business consultant’s perspective.
I’ve worked with Fortune 500 companies like S&P Global, Apple, MRM McCann, and more.
And I built a training program that gets my students life-changing data science careers (don’t believe me? see my testimonials here):
Here’s the system that has gotten aspiring data scientists, career transitioners, and life long learners data science jobs and promotions…
Join My 5-Course R-Track Program Now!
(And Become The Data Scientist You Were Meant To Be…)
P.S. – Samantha landed her NEW Data Science R Developer job at CVS Health (Fortune 500). This could be you.