“Case study on Retail store using SQL”

Uday Tayal
2 min readMar 20, 2023

--

Here, through this case study I will try to train your mind how to have in depth analysis of the data.

Humble advice : The main feature of a good Analyst is he always spends a good amount of time in just watching and analyzing the dataset in his mind and try to paint a major picture of the different features of the dataset.

“Half the battle is won when their is a picture of the dataset in your mind palace” -Uday Tayal

Overview of the features available in the dataset

  • This business case has information of 100k orders from 2016 to 2018 made at a particular retail store in Brazil. Its features allows viewing an order from multiple dimensions: from order status, price, payment and freight performance to customer location, product attributes and finally reviews written by customers.

Data is available in 8 csv files:

1. customers.csv

2. geolocation.csv

3. order_items.csv

4. payments.csv

5. reviews.csv

6. orders.csv

7. products.csv

8. sellers.csv

Schema of the dataset

1. Usual exploratory analysis steps like checking the structure & characteristics of the dataset

1. Data type of columns in a table

2. Time period for which the data is given

3. Cities and States of customers ordered during the given period

SELECT

COUNT(DISTINCT(seller_state)) Unique_states,COUNT(DISTINCT(seller_city)) Unique_cities

FROM `1geo.sellers`;

--

--