In this blog, we’ll learn How to Create Scatter Plot in PowerBI. Firstly, we must understand what it is and when we should use it.
Introduction to Scatter plot in PowerBI
Scatter Plot uses dots to visualize two different numerical values. It is used to specify the relationship between variables (X-axis and Y-axis). The relationship can either be positive, negative or no relationship at all.
Dataset description
In this blog, you will learn how to use a supermarket sales dataset that includes the invoice ID, branch, city, customer type, gender, product line, unit price, quantity, tax, total amount, sale date and time, payment option, cost of goods sold (COGS), gross margin percentage, gross income, and buy rate.
You can access the dataset here.
Description: In this dataset, there following features/columns:
Invoice ID | Unique Identifier for each Invoice |
Branch | There are four branch Site |
City | City where transactions occurred |
Customer Type | Type of Customer – Member or Normal |
Gender | Gender – Male, Female |
Product Line | Type of Product |
Unit Price | Price of a Single Unit of Product |
Quantity | Total Quantity of Product Sell in a Single Purchase |
Tax 5% | 5% of Tax on Cost of Goods Sold |
Total | Total Amount of Items without Tax |
Date | Date of Sale |
Time | Time of Sale |
Payment | Mode of Payment |
COGS (Cost of Goods Sold) | Total Amount of Items with Tax |
Gross Margin Percentage | Total Profit in Percentage |
Gross Income | Total Profit |
Rating | Raing of the purchase in range of 1-10 |
Sample of Data:
Procedure to create a scatter plot in PowerBI
Step1: Open your PowerBI Desktop on your Device
Step2: Click on Get Data button in Home Ribbon and select Text/CSV.
Step3: Open Prompt box will be open to select your text or csv file.
Step4: After selection of dataset, It will pop up a window for Load OR Transform Data. Click on Load to use data on PowerBI.
Step5: In the following picture, data has been loaded successfully.
Step1: Select Scatter Plot from Visualizations Section (Right Hand Side).
Step2: Here, we’ll put the following data (from supermarket_sales) in different fields:
Values | Payment |
X Axis | Quantity (Sum of Quantity) |
Y Axis | Rating (Sum of Rating) |
Legend | Customer type |
Play Axis | Date |
After that you have to click on format your visual. Go to the General click on turn on visual border and the shadow. In the Visual border make the rounded corner of 15 px.
Insights from the scatter plot
The scatter plot clearly shows that there are different types of customers based on how they pay and how they rate the business. Members usually have better reviews and more payment options than Normal customers. Notably, transactions that are paid for with an e-wallet usually have better ratings, while payments made with cash have a wider range of ratings, some of which are lower. This shows that people who use e-wallets tend to rate better, while people who use cash have more varied experiences.
Conclusion
With a scatter plot, we can visualize the relationship between two numerical variables by plotting data points on a two-dimensional graph. In a scatter plot, the x-axis represents the values of one variable, while the y-axis represents the values of another variable. A play axis allows dynamic visualization with time or sequential ranges, animating the progression of data points to show changes and trends.
If you like this blog, you can share it with your friends or colleague. You can connect with me on social media profiles like LinkedIn, Twitter, and Instagram.
- 👏 Like for this article and subscribe to our newsletter
- 📰 View more content on my DataSpoof website
- 🔔 Follow Me: LinkedIn| Youtube | Instagram | Twitter