The Superstore dataset is a rich and comprehensive dataset containing information on the sales, orders, and customers of a fictional superstore. The dataset provides a detailed view of the business operations of the superstore, including information on the products sold, the customers who purchase them, and the sales performance across different regions and categories. This dataset is a valuable resource for data analysts, business intelligence professionals, and anyone interested in understanding the dynamics of retail sales and customer behavior. With about 9,994 rows of data spanning four years (2014-2017), the Superstore dataset provides a wealth of information to analyze and explore.
In this report, I perform python-based exploratory data analysis to gain insights about the fictional superstore.
This python dashboard provides insights on:
- What is the total sales and profit of the superstore?
- What is the total sales and profit by category?
- Show the top 5 states that made the highest sales
- Show the bottom 5 states that made the highest sales
- What is the total sales by region and category?
- What is the most popular ship mode class?
- Calculate the average profit by region
- Data Cleaning and Preprocessing
- Before performing analysis, we need to clean and preprocess the data. We will check if there are duplicates and remove any duplicate records and handle any missing values in the dataset.
- Descriptive Statistics
- To better understand the dataset, we will calculate some descriptive statistics such as mean, median, standard deviation, minimum, and maximum values for the numerical attributes in the dataset.
- Data Visualization
- To visualize the data, we will create various charts and plots such as bar charts and donut charts. These visualizations will help us to identify patterns and trends in the data.
- Insights/Conclusions
- Based on the analysis and visualizations, we can draw some insights and conclusions about the Superstore dataset.
- Recommendations
- Finally, we will provide some recommendations based on the insights/conclusions we have drawn from the analysis. These recommendations will help the superstore to improve its sales and profitability.
- The
Total Salesis$2,297,200.86and theTotal Profitis$286,397.02 Technologyhas the highest total sales andFurniturehas the highest total profit, whileFurniturehas the next highest total sales andTechnologyhas the next highest total profit. Also,Office suppliesmade the lowest in both total sales and total profit.Californiamade the highest total sales of$457,688, followed byNew Yorkthat made$310,876total sales.North Dakotahas the lowest total sales of$919.91, followed byWest VirginiaandMainethat made$1209.82and$1270.53total sales respectively.East Regionmade the highest total sales inTechnologycategory, andWest Regionmade the highest sales in bothFurnitureandOffice Suppliescategory. We can also see that theSouth Regionmade the lowest sales in all categories.Standard Classhas the highest ship mode rate of59.7%, followed bySecond Class,first ClassandSame Dayrespectively.West Regionhas the highest average profit.
- Increase focus categories: Based on the analysis, it is clear that some categories are more popular and profitable than others. The superstore should focus on these categories to increase their sales and profitability.
- Improve sales and distribution in underperforming regions: The analysis also reveals that some regions have lower sales and profitability compared to others. The superstore should invest in these regions by improving their sales and distribution strategies to increase their revenue.
- Monitor and optimize discounts: While offering discounts can be an effective way to increase sales, the analysis shows that it can also significantly impact the profitability of the superstore. The superstore should monitor and optimize their discount strategies to ensure that they are not impacting their profitability too much.
