“Case study on Retail store using SQL”
Here, through this case study I will try to train your mind how to have in depth analysis of the data.
Humble advice : The main feature of a good Analyst is he always spends a good amount of time in just watching and analyzing the dataset in his mind and try to paint a major picture of the different features of the dataset.
“Half the battle is won when their is a picture of the dataset in your mind palace” -Uday Tayal
Overview of the features available in the dataset
- This business case has information of 100k orders from 2016 to 2018 made at a particular retail store in Brazil. Its features allows viewing an order from multiple dimensions: from order status, price, payment and freight performance to customer location, product attributes and finally reviews written by customers.
Data is available in 8 csv files:
1. customers.csv
2. geolocation.csv
3. order_items.csv
4. payments.csv
5. reviews.csv
6. orders.csv
7. products.csv
8. sellers.csv
1. Usual exploratory analysis steps like checking the structure & characteristics of the dataset
1. Data type of columns in a table
2. Time period for which the data is given
3. Cities and States of customers ordered during the given period
SELECT
COUNT(DISTINCT(seller_state)) Unique_states,COUNT(DISTINCT(seller_city)) Unique_cities
FROM `1geo.sellers`;