This project analyses patient data to assess the impact of smoking on organ health. The project covers the full analytic workflow—data cleaning with Power Query, building a data model, creating advanced DAX measures, and designing an interactive dashboard with KPIs, charts, and slicers. It demonstrates how to turn clinical data into visual insights for identifying high-risk patient groups and supporting preventive healthcare decisions.
- Author: Isaac Emmanuel
- About: This synthetic dataset is designed to analyze the impact of lifestyle factors, particularly smoking, on patient organ health. It contains individual patient records with demographic, behavioral, and clinical indicators, including smoking history, BMI, blood pressure risk, cholesterol, and family history.
- Analyze the patient cohort by key demographics (Age Group, Gender) and behavioral profiles (Smoking Status).
- Determine the core relationships between smoking metrics (Years_of_Smoking, Cigarettes_Per_Day) and the primary health outcome (Organ_Condition).
- Uncover risk factor trends by quantifying the correlation of additional variables (BMI, BP_Risk, Cholesterol, Alcohol_Consumption) with organ health.
- Identify high-risk patient segments by synthesizing demographic, lifestyle, and clinical data to pinpoint groups most susceptible to organ damage.
- Synthesize actionable insights into an interactive dashboard for data-driven decision-making in preventive healthcare and patient education.
| Organ | Healthy | Damaged |
|---|---|---|
| Heart | ![]() |
![]() |
| Body | ![]() |
![]() |
| Kidney | ![]() |
![]() |
| Liver | ![]() |
![]() |
| Lungs | ![]() |
![]() |
- Smoking (both current and former status) is the most significant behavioral factor correlated with organ damage, especially for the Lungs and Heart.
- The impact is systemic. While lung damage is expected, the data shows the Liver (37.7% damaged) and Kidney (37.0%) are actually the most affected organs, underscoring that smoking causes whole-body harm beyond the respiratory system.
- Organ damage is not just about current status but cumulative lifetime exposure.
- The "Avg Years of Smoking" and "Cigarettes Per Day" metrics consistently rise with age in damaged organ groups, showing a dose-response relationship.
- This intensification is behavioral. There is a strong positive correlation (0.73) between years smoked and daily intake, meaning long-term smokers tend to increase their habit, compounding risk instead of plateauing.
- Male patients show a higher prevalence of current and former smoking status across damaged organ analyses, indicating a gender disparity in high-risk smoking behavior within this dataset.
- Over 54% of the patient population has a history of smoking (Current + Former). This means smoking-related risks are a majority concern, affecting core population health metrics rather than a small, isolated group.
- Health damage from smoking is not static. Older age groups demonstrate the culmination of this progression, with higher smoking duration, higher daily intake, and greater metabolic risk. This creates a clear trajectory where the health consequences of smoking accumulate decisively over decades.
- Excel
- Microsoft Power BI









