Psychiatric Disorder Detection using Machine Learning

A full-stack web application for mental health screening based on the DASS-42 (Depression Anxiety Stress Scales) questionnaire. This project demonstrates production-ready ML deployment with proper evaluation metrics, ethical considerations, and modern web technologies.

⚠️ Important Disclaimer: This tool is for educational and informational purposes only. It is NOT a medical diagnosis. If you are experiencing mental health concerns, please consult a qualified mental health professional.

🎯 Problem Motivation

Mental health disorders affect approximately 1 in 4 people globally, yet many cases remain undiagnosed due to stigma, lack of awareness, or limited access to professional evaluation. This project aims to provide an accessible screening tool that can help individuals become aware of potential mental health concerns-not to replace professional diagnosis, but to encourage those who may benefit from professional help to seek it.

📊 Dataset Description

Source: OpenPsychometrics DASS Dataset (also available on Kaggle)

Attribute	Value
Samples	39,775
Features	42 questions (30 selected via RFE)
Collection Period	2017-2019
Response Scale	1-4 (frequency of symptoms)
Target Classes	4 (None, Mild, Moderate, Severe)

The DASS-42 is a validated psychological instrument measuring three related negative emotional states:

Depression: Dysphoria, hopelessness, devaluation of life, self-deprecation, lack of interest
Anxiety: Autonomic arousal, skeletal muscle effects, situational anxiety, subjective experience
Stress: Difficulty relaxing, nervous arousal, easily upset/agitated, irritability

🔬 ML Methodology

Feature Selection

Used Recursive Feature Elimination (RFE) with Random Forest to select the 30 most predictive questions from the original 42, ensuring a balance between model performance and questionnaire length.

Data Preprocessing

StandardScaler applied to normalize feature distributions
Median imputation for missing values
Quartile-based severity classification

Models Trained & Compared

Model	Accuracy	F1-Score	ROC-AUC
Logistic Regression ★	92.0%	0.920	0.992
SVM (RBF kernel)	91.9%	0.919	0.992
Random Forest	89.4%	0.894	0.986
Gradient Boosting	89.2%	0.893	0.987

★ Selected as best model based on weighted F1-score

Evaluation Strategy

Stratified Train/Test Split (80/20) to maintain class distribution
5-Fold Cross-Validation for robust performance estimation
Multiple Metrics (see below) to avoid accuracy-only evaluation

📈 Why Accuracy Alone is Insufficient

In mental health screening, class imbalance and cost asymmetry make accuracy a misleading metric:

Issue	Impact
Class Imbalance	Most samples are "None" or "Mild" - a model predicting only these classes could achieve high accuracy while missing severe cases
False Negatives are Costly	Missing a severe case (predicting "None" when actual is "Severe") could delay critical intervention
False Positives Cause Anxiety	Over-predicting severity could cause unnecessary worry

This is why we report:

Precision: Of those predicted as a class, how many actually are?
Recall: Of actual cases, how many did we correctly identify?
F1-Score: Harmonic mean balancing precision and recall
ROC-AUC: Model's ability to distinguish between classes

⚖️ Ethical Considerations

Potential Harms

False Reassurance: Predicting "None" for someone who is struggling could delay help-seeking
Stigmatization: Predicting "Severe" could cause distress or be misused
Cultural Bias: DASS was developed in Western contexts; responses may vary cross-culturally

Mitigations Implemented

Clear Disclaimers: Prominent warnings that this is not a diagnosis
Probability Display: Showing confidence prevents over-reliance on single prediction
Actionable Recommendations: Encouraging professional consultation regardless of result
No Data Storage: We do not store individual responses

This Tool Is NOT:

A replacement for professional mental health evaluation
Suitable for clinical decision-making
Validated for crisis situations (please contact emergency services if in crisis)

🏗️ System Architecture

┌─────────────────┐     HTTP/JSON      ┌─────────────────┐
│                 │  ──────────────▶   │                 │
│   Next.js       │                    │   FastAPI       │
│   Frontend      │  ◀──────────────   │   Backend       │
│   (Vercel)      │     Prediction     │   (HF Space)    │
│                 │                    │                 │
└─────────────────┘                    └────────┬────────┘
                                                │
                                                ▼
                                       ┌─────────────────┐
                                       │   Trained ML    │
                                       │   Model         │
                                       │   (joblib)      │
                                       └─────────────────┘

🚀 Quick Start

Prerequisites

Python 3.10+
Node.js 18+
npm or yarn

1. Train the Model (Google Colab)

We recommend using Google Colab for training to leverage free GPU resources:

Open ml/train_colab.ipynb in Google Colab
Run all cells to:
- Download the DASS dataset from Kaggle
- Train and compare 4 classifiers (Logistic Regression, Random Forest, SVM, Gradient Boosting)
- Apply StandardScaler for feature normalization
- Generate evaluation metrics and confusion matrices
- Save the best model to your Drive or download directly
Copy these files to backend/models/:
- psychiatric_model.joblib (trained model)
- scaler.joblib (fitted StandardScaler)
- feature_names.json (feature configuration)

Current Best Model: Logistic Regression with 92.0% accuracy and 99.2% ROC-AUC

2. Start the Backend

cd backend
pip install -r requirements.txt
uvicorn app.main:app --reload --port 8000

API docs will be available at http://localhost:8000/docs

3. Start the Frontend

cd frontend
npm install
npm run dev

Open http://localhost:3000 in your browser.

📦 Project Structure

.
├── backend/                 # FastAPI backend
│   ├── app/
│   │   ├── main.py         # API endpoints
│   │   ├── schemas.py      # Pydantic models
│   │   └── model.py        # ML model loading
│   ├── models/             # Saved ML models
│   ├── Dockerfile
│   └── requirements.txt
├── frontend/               # Next.js frontend
│   ├── app/
│   │   └── page.tsx       # Questionnaire UI
│   └── package.json
├── ml/                     # ML training
│   ├── train.py           # Training script
│   ├── config.py          # Configuration
│   └── outputs/           # Training artifacts
├── data/                   # Dataset documentation
└── README.md

🌐 Deployment

Backend on Hugging Face Spaces (Recommended)

Create a new Space on Hugging Face
Select Docker as the Space SDK
Upload the contents of the backend directory to the Space
The Space will build automatically using the provided Dockerfile
Note the Space URL (e.g., https://your-username-space-name.hf.space) for frontend configuration

Frontend on Vercel

Import project to Vercel
Set root directory to frontend
Add environment variable: NEXT_PUBLIC_API_URL=https://your-backend-url.run.app
Deploy

🔮 Future Improvements

Add longitudinal tracking (with proper consent and security)
Implement SHAP values for prediction explanations
Add multi-language support
Create progressive web app (PWA) for offline access
Conduct formal validation study
Add comparison with other validated screening tools

📚 References

Lovibond, S.H. & Lovibond, P.F. (1995). Manual for the Depression Anxiety Stress Scales. Sydney: Psychology Foundation.
OpenPsychometrics DASS Dataset
DASS Official Website

📄 License

This project is for educational purposes. The DASS questionnaire is in the public domain.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
backend		backend
data		data
frontend		frontend
ml		ml
.gitignore		.gitignore
MODEL_CARD.md		MODEL_CARD.md
PDD_report.pdf		PDD_report.pdf
README.md		README.md
dev.bat		dev.bat
shared_config.json		shared_config.json
training_code_old.ipynb		training_code_old.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Psychiatric Disorder Detection using Machine Learning

🎯 Problem Motivation

📊 Dataset Description

🔬 ML Methodology

Feature Selection

Data Preprocessing

Models Trained & Compared

Evaluation Strategy

📈 Why Accuracy Alone is Insufficient

⚖️ Ethical Considerations

Potential Harms

Mitigations Implemented

This Tool Is NOT:

🏗️ System Architecture

🚀 Quick Start

Prerequisites

1. Train the Model (Google Colab)

2. Start the Backend

3. Start the Frontend

📦 Project Structure

🌐 Deployment

Backend on Hugging Face Spaces (Recommended)

Frontend on Vercel

🔮 Future Improvements

📚 References

📄 License

About

Uh oh!

Releases

Packages

Languages

FazlulKarimC/Detection-of-Psychiatric-Disorder-using-ML

Folders and files

Latest commit

History

Repository files navigation

Psychiatric Disorder Detection using Machine Learning

🎯 Problem Motivation

📊 Dataset Description

🔬 ML Methodology

Feature Selection

Data Preprocessing

Models Trained & Compared

Evaluation Strategy

📈 Why Accuracy Alone is Insufficient

⚖️ Ethical Considerations

Potential Harms

Mitigations Implemented

This Tool Is NOT:

🏗️ System Architecture

🚀 Quick Start

Prerequisites

1. Train the Model (Google Colab)

2. Start the Backend

3. Start the Frontend

📦 Project Structure

🌐 Deployment

Backend on Hugging Face Spaces (Recommended)

Frontend on Vercel

🔮 Future Improvements

📚 References

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages