This project focuses on customer segmentation to increase revenue through more targeted marketing strategies. Using clustering algorithms, we divided Amazon's customers into meaningful groups based on their purchasing behaviors. The primary goal is to develop strategies that enhance customer engagement and drive higher revenue.
- Project Description
- Key Features
- Dataset
- Methodology
- Results and Analysis
- Business Recommendations
- How to Run the Project
- License
- Contact
Customer segmentation involves dividing customers into groups based on similar characteristics or behaviors, such as purchase frequency, spending amount, and product preferences. This project aims to:
- Increase customer engagement.
- Identify high-value customers.
- Develop data-driven marketing strategies.
- Automated Segmentation: Use clustering algorithms to automatically segment customers.
- Revenue Enhancement: Segment-based strategies to boost Average Order Value (AOV) and purchase frequency.
- Customer Retention: Strategies to retain high-value customers.
The dataset contains Amazon's sales transaction data from 2020 and 2021, with a total of 286,392 records across 35 columns. It includes:
- Transaction Details: Order ID, date, quantity, payment method, and total value.
- Customer Information: Customer ID, age, gender, region, and contact information.
- Product Details: SKU, category, and discounts.
- Geographical and Temporal Data: City, state, zip code, and month of purchase.
- Data Preprocessing: Validating missing values, handling outliers, and ensuring correct data types.
- Clustering: Using the K-Means algorithm, which achieved the best silhouette score.
- Model Evaluation: Metrics such as Silhouette Score, Davies-Bouldin Index, and Calinski-Harabasz Score were used to evaluate clustering quality.
This project identified four distinct customer clusters with unique characteristics:
- Cluster 0: Loyal customers with low purchase frequency.
- Cluster 1: The largest group with high volume and moderate AOV.
- Cluster 2: Tech-focused customers benefiting from high discounts.
- Cluster 3: Premium customers with high-value orders.
Simulations suggest potential revenue increases of up to 92% with targeted strategies.
- Increase Purchase Frequency: Offer small discounts (2%-5%) to boost order volume.
- Upselling: Promote premium products and bundling offers.
- Customer Retention: Introduce loyalty programs for dormant customers and VIP programs for high-value customers.
- Targeted Marketing: Use personalized campaigns based on cluster characteristics.
- Clone the repository:
git clone https://github.com/Arsan24/Customer-Segmentation.git
This project is licensed under the MIT License. See the LICENSE file for details.
For any inquiries or further information, feel free to contact:
- Email: arsan.nuzhairil@gmail.com
- GitHub Profile: Arsan24
- LinkedIn: Nuzhairil Arsanurrahman