Slow SQL Detection with OpenTelemetry

A sample application demonstrating how to detect and monitor slow SQL queries using OpenTelemetry.

This sample accompanies the blog post: How to Turn Slow Queries into Actionable Reliability Metrics with OpenTelemetry

Overview

This project demonstrates a practical implementation of slow SQL detection by:

Emitting database spans with OpenTelemetry semantic conventions from Go services
Distilling metrics from spans using the OpenTelemetry Collector's spanmetricsconnector
Detecting anomalies using PromQL-based adaptive thresholds
Visualizing results in Grafana dashboards, with docker-otel-lgtm

Prerequisites

Docker

Quick Start

Bring up all the services with docker-compose, wait for the load generator to create some traffic, and then go to http://localhost:3001 to take a look at the dashboards.

# Start all services
docker-compose up -d

# Check service status
docker-compose ps

# Stop all services
docker-compose down

Services

The sample app comes with a a load generator, so there is no need for you to create requests, the access to the ports are there just for developing the app. You should be most interested in going to the Grafana UI at http://localhost:3001

Service	Port	Description
`album-user-api`	8080	User-facing API with album queries
`album-admin-api`	8081	Admin API with statistics and analytics
`postgres`	5432	PostgreSQL database
`otelcol`	4317/4318	OpenTelemetry Collector (gRPC/HTTP)
`lgtm`	3001	Grafana with Loki, Tempo, and Mimir
`traffic`	-	Traffic generator for demo purposes

Viewing Dashboards

Open Grafana at http://localhost:3001
Login with admin / admin
Navigate to Dashboards → Slow SQL Dashboards

Four dashboards are included:

v1: Basic slow query metrics
v2: Query impact analysis (latency × call rate)
v3: Anomaly detection with adaptive thresholds
Unified: Single dashboard combining all three views below (By Duration, Traffic Weighted, Anomaly Detection)

Anomaly Detection

The sample includes rules for the PromQL Anomaly Detection Framework that:

Calculate adaptive upper/lower bounds based on historical patterns
Detect queries that deviate significantly from their baseline
Support both "adaptive" (short-term) and "robust" (long-term) strategies

Rules are located in prometheus-rules/ and automatically loaded by the LGTM stack.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
db		db
grafana/provisioning		grafana/provisioning
images		images
prometheus-rules		prometheus-rules
.gitignore		.gitignore
ANOMALY_DETECTION_SETUP.md		ANOMALY_DETECTION_SETUP.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
otelcol.yaml		otelcol.yaml
prometheus-custom.yaml		prometheus-custom.yaml
slowsql-anomalies-dashboard.json		slowsql-anomalies-dashboard.json
slowsql-dashboard-unified.json		slowsql-dashboard-unified.json
slowsql-dashboard-v1.json		slowsql-dashboard-v1.json
slowsql-dashboard-v2.json		slowsql-dashboard-v2.json
slowsql-dashboard-v3.json		slowsql-dashboard-v3.json
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Slow SQL Detection with OpenTelemetry

Overview

Prerequisites

Quick Start

Services

Viewing Dashboards

Anomaly Detection

License

About

Uh oh!

Languages

License

causely-oss/slow-query-lab

Folders and files

Latest commit

History

Repository files navigation

Slow SQL Detection with OpenTelemetry

Overview

Prerequisites

Quick Start

Services

Viewing Dashboards

Anomaly Detection

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Languages