Diabetes Triage Service

ML service for predicting diabetes progression risk and surfacing triage guidance (HIGH / LOW risk + nurse call‑to‑action) to help prioritize patient follow-ups in a virtual diabetes clinic.

Overview

This service provides a REST API that predicts disease progression scores based on patient vitals and lab results. Higher scores indicate greater risk of disease progression, helping triage nurses prioritize follow-up calls.

Quick Start

Using Docker (Recommended)

Pull and run the latest release:

# Pull the image
docker pull ghcr.io/theorjiugovictor/diabetes-triage-service:latest

# Run the service
docker run -p 8000:8000 ghcr.io/theorjiugovictor/diabetes-triage-service:latest

The API will be available at http://localhost:8000

Local Development

Clone the repository

git clone https://github.com/theorjiugovictor/diabetes-triage-service.git
cd diabetes-triage-service

Install dependencies
```
pip install -r requirements.txt
```
Train the model
```
python src/train.py
```

Run the API server

uvicorn src.api:app --host 0.0.0.0 --port 8000

Run tests
```
pytest tests/ -v
```

API Usage

Health Check

curl http://localhost:8000/health

Response:

{
   "status": "ok",
   "model_version": "0.2.0",
   "risk_threshold": 140.0
}

Predict Progression Score

curl -X POST http://localhost:8000/predict \
  -H "Content-Type: application/json" \
  -d '{
    "age": 0.02,
    "sex": -0.044,
    "bmi": 0.06,
    "bp": -0.03,
    "s1": -0.02,
    "s2": 0.03,
    "s3": -0.02,
    "s4": 0.02,
    "s5": 0.02,
    "s6": -0.001
  }'

Response:

{
   "prediction": 152.5,
   "model_version": "0.2.0",
   "high_risk": true,
   "risk_threshold": 140.0,
   "risk_level": "HIGH",
   "nurse_call_to_action": "Prioritize nurse follow-up within 24h; review labs; consider physician escalation."
}

Interactive API Documentation

Visit http://localhost:8000/docs for interactive Swagger UI documentation.

Features

Input Schema

All features are normalized values from patient records:

age: Age (normalized)
sex: Sex (normalized)
bmi: Body mass index (normalized)
bp: Average blood pressure (normalized)
s1: Total serum cholesterol (normalized)
s2: Low-density lipoproteins (normalized)
s3: High-density lipoproteins (normalized)
s4: Total cholesterol / HDL ratio (normalized)
s5: Log of serum triglycerides (normalized)
s6: Blood sugar level (normalized)

Response Format

Field	Description
`prediction`	Continuous progression score (higher = worse expected progression over next year)
`model_version`	Version string loaded from `models/version.txt`
`high_risk`	Boolean flag: `prediction >= risk_threshold`
`risk_threshold`	Threshold (float) loaded from `models/risk_threshold.txt` (fallback 140.0)
`risk_level`	Categorical label (`HIGH` / `LOW`) derived from `high_risk`
`nurse_call_to_action`	Recommended next step for the triage nurse / care team

Threshold & Triage Logic

Model outputs a regression score (not a probability, not a diagnosis).
A configurable threshold (default 140.0) binarizes the score into HIGH or LOW risk.
The API augments the response with a CTA string to guide next actions.

Update the threshold by editing (or creating) models/risk_threshold.txt with a single numeric value, then restarting the service.

Input Standardization

Inputs are standardized (z‑scores) as provided by the scikit‑learn diabetes dataset (mean ≈ 0, std ≈ 1). These are not raw clinical units. If you plan to submit raw values in the future, refactor the training pipeline to persist a StandardScaler and apply it inside the model pipeline at inference.

Model Details

Version 0.2.0 (Current)

Algorithm: Ridge Regression with polynomial features (degree 2)
Pipeline: PolynomialFeatures -> StandardScaler -> Ridge
Enhancements: Added triage thresholding & CTA fields in API
Training/Test Split: 80/20
Random Seed: 42 (reproducible)

Version 0.1.0 (Baseline)

Algorithm: Linear Regression
Preprocessing: Standard Scaling
Training/Test Split: 80/20
Random Seed: 42

See CHANGELOG.md for detailed metrics and improvements across versions.

Development Workflow

Creating a New Release

Update MODEL_VERSION in src/train.py
Make improvements and update CHANGELOG.md
Commit changes
Create and push a tag:
```
git tag v0.2.0
git push origin v0.2.0
```

The GitHub Actions workflow will automatically:

Train the model
Run tests
Build Docker image
Run smoke tests
Push to GitHub Container Registry
Create a GitHub Release

CI/CD Pipeline

PR/Push: Linting, tests, training smoke test, artifact upload
Tag (v)*: Full build, container tests, registry push, release creation

Project Structure

diabetes-triage-service/
├── .github/workflows/     # GitHub Actions workflows
├── src/
│   ├── train.py          # Training script
│   ├── api.py            # FastAPI service
│   └── utils.py          # Utility functions
├── tests/                # Test suite
├── models/               # Trained models (generated)
├── Dockerfile           # Container definition
├── requirements.txt     # Python dependencies
├── README.md           # This file
└── CHANGELOG.md        # Version history

Reproducibility

Python version: 3.13 (ensure dependency versions support this; see requirements.txt)
Dependencies pinned in requirements.txt
Random seeds set for deterministic training
All training data from scikit-learn (stable dataset)

License

MIT

Contributing

Fork the repository
Create a feature branch
Make changes and add tests
Submit a pull request

The CI pipeline will validate your changes automatically.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diabetes Triage Service

Overview

Quick Start

Using Docker (Recommended)

Local Development

API Usage

Health Check

Predict Progression Score

Interactive API Documentation

Features

Input Schema

Response Format

Threshold & Triage Logic

Input Standardization

Model Details

Version 0.2.0 (Current)

Version 0.1.0 (Baseline)

Development Workflow

Creating a New Release

CI/CD Pipeline

Project Structure

Reproducibility

License

Contributing

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
models		models
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt

theorjiugovictor/diabetes-triage-service

Folders and files

Latest commit

History

Repository files navigation

Diabetes Triage Service

Overview

Quick Start

Using Docker (Recommended)

Local Development

API Usage

Health Check

Predict Progression Score

Interactive API Documentation

Features

Input Schema

Response Format

Threshold & Triage Logic

Input Standardization

Model Details

Version 0.2.0 (Current)

Version 0.1.0 (Baseline)

Development Workflow

Creating a New Release

CI/CD Pipeline

Project Structure

Reproducibility

License

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages