Customer churn is a critical business issue, especially in subscription-based industries. This project analyzes customer churn data using Python (Pandas, Matplotlib, Seaborn), and Excel to identify patterns and gain insights.
By performing data preprocessing, exploratory data analysis (EDA), and visualization, this project aims to help businesses reduce customer churn through actionable insights.
The main objectives of this analysis are:
β Data Cleaning & Preprocessing β Handling missing values, correcting data types, and ensuring consistency.
β Exploratory Data Analysis (EDA) β Finding trends and key factors influencing customer churn.
β Data Visualization β Creating graphs and charts for better interpretation.
β Business Insights β Understanding customer behavior and possible retention strategies.
Follow these steps to set up the project on your local machine:
1οΈβ£ Clone the Repository: (Download the project to your local system)
git clone https://github.com/sanjeevsingh74/Customer_Churn_Analysis.git
2οΈβ£ Navigate to the Project Directory:
bash
Copy
Edit
cd Customer_Churn_Analysis
3οΈβ£ Set Up a Virtual Environment (Optional but Recommended):
bash
Copy
Edit
python -m venv env
source env/bin/activate # On Windows, use `env\Scripts\activate`
4οΈβ£ Install Required Dependencies: (Make sure all required Python libraries are installed)
bash
Copy
Edit
pip install -r requirements.txt
5οΈβ£ Download the Dataset:
The dataset is available on Kaggle - Telco Customer Churn.
Place the WA_Fn-UseC_-Telco-Customer-Churn.csv file into the project folder.
6οΈβ£ Run the Jupyter Notebook:
bash
Copy
Edit
jupyter notebook Customer_Churn_Analysis.ipynb
π Usage Examples
β Running the Analysis: Open Customer_Churn_Analysis.ipynb and execute all the cells step by step.
β Customizing Visualizations: Modify matplotlib and seaborn parameters to adjust plots.
β Using SQL for Further Analysis: Run SQL queries to extract insights from structured data.
π» Technologies Used
Technology Purpose
Python Data analysis & scripting
Pandas Data manipulation
Matplotlib & Seaborn Data visualization
SQL Querying structured data
Excel Data organization & preprocessing
π Key Insights from Analysis
πΉ High Churn Rate: Customers with month-to-month contracts and no online security services are more likely to leave.
πΉ Billing Factor: Churn rate is higher among customers with paperless billing.
πΉ Senior Citizens: Older customers have a slightly higher churn percentage compared to younger customers.
πΉ Tenure Impact: Long-term customers are less likely to churn than new ones.
π Credits & Acknowledgments
π Dataset Source: Kaggle - Telco Customer Churn
π Inspiration: Understanding customer retention and churn reduction strategies.
π Author: Sanjeev Singh
π€ Contributions & Feedback
π‘ Want to improve this project?
Feel free to fork the repository and submit a pull request!
Open an issue if you have any suggestions or find a bug.
π§ Contact: LinkedIn
π If you found this project useful, donβt forget to β star the repository!
yaml
Copy
Edit
---
### β
**Steps to Add This to Your GitHub Repository:**
1οΈβ£ Open your **Customer_Churn_Analysis** repository on GitHub.
2οΈβ£ Click on **Add file β Create new file**.
3οΈβ£ Name the file **README.md**.
4οΈβ£ Paste the above content into the file.
5οΈβ£ Click **Commit changes**. ---