A data analysis project focused on Zepto operational data. We use SQL for cleaning, organizing, and analyzing the dataset to derive key business insights for the quick-commerce industry.
| Path | Focus |
|---|---|
datasets/zepto_v2.csv |
Raw quick-commerce inventory data. |
sql/01_zepto_data_preparation.sql |
Data Cleaning: Schema setup, data quality checks, and unit conversion. |
sql/02_zepto_analysis_queries.sql |
EDA: Calculating key metrics, value ranking, and strategic insights. |
LICENSE |
Project license. |
This project addresses crucial business questions by segmenting the data and calculating specific metrics:
- Unit Conversion: Crucial transformation to convert all price fields (paise to rupees) for accurate financial analysis.
- Data Integrity: Identifying and correcting/removing invalid entries (e.g., zero prices) and checking for data quality issues.
- Value Assessment: Calculated Price per Gram to standardize product value and identify best-value items.
- Revenue & Discounts: Estimated total revenue per category and determined the highest average discount percentage offered.
- Inventory Segmentation: Used
CASEstatements to categorize products (e.g., Low, Medium, Bulk) for optimized inventory planning.
This project showcases core proficiency in the data analysis workflow using SQL:
- Data Preparation & Transformation: Standardizing data integrity, including crucial unit conversion (paise to rupees).
- Foundational Querying: Writing efficient SQL using
GROUP BYand Aggregate Functions to summarize large datasets. - Business Metric Calculation: Calculating key performance indicators (KPIs) like estimated revenue, price per gram efficiency, and discount averages.
- Conditional Logic: Employing
CASEstatements for data segmentation and custom reporting.
To replicate this analysis:
- Setup: Import the
datasets/zepto_v2.csvfile into your SQL database environment (e.g., MySQL, PostgreSQL). - Run Scripts: Execute the SQL scripts in numerical order:
- Start with
sql/01_zepto_data_preparation.sqlto clean and transform the data. - Then run
sql/02_zepto_analysis_queries.sqlto generate the insights.
- Start with