Cap max_tokens and n to prevent local API DoS by arditbe · Pull Request #3637 · nomic-ai/gpt4all

arditbe · 2025-12-06T02:21:09Z

My Changes

Added server-side upper bounds for max_tokens and n in BaseCompletionRequest::parseImpl.
Requests exceeding these limits now return 400 via InvalidRequestError, preventing memory and CPU exhaustion in
/v1/completions and /v1/chat/completions.

Issue ticket number and link

Fixes #3635

Checklist before requesting a review

I have performed a self-review of my code.
If it is a core feature, I have added thorough tests.
I have added thorough documentation for my code.
I have tagged PR with relevant project labels.
If this PR addresses a bug, I have provided both a screenshot/video of the original bug and the working solution.

Demo

N/A (validation change)

Steps to Reproduce

Start GPT4All with API server enabled.
Send a request with very large max_tokens or n.
Observe the server now responds with 400 instead of consuming excessive resources.

Notes

Limits used: max_tokens <= 4096, n <= 8. Happy to adjust per maintainer preference.

This PR adds server-side upper bounds for max_tokens and n in BaseCompletionRequest::parseImpl. Requests exceeding limits now return 400 via InvalidRequestError, preventing memory/CPU exhaustion on /v1/completions and /v1/chat/completions. Fixes nomic-ai#3635. Signed-off-by: ardit <88629825+arditbe@users.noreply.github.com>

Signed-off-by: ardit <88629825+arditbe@users.noreply.github.com>

arditbe · 2025-12-06T02:23:53Z

Hi! This PR adds server-side caps for max_tokens and n in BaseCompletionRequest to prevent local API DoS on /v1/completions and /v1/chat/completions. Fixes #3635. Happy to adjust limits or add tests if you prefer.

arditbe added 2 commits December 6, 2025 03:12

Refactor parseImpl method for improved readability

db17d93

Signed-off-by: ardit <88629825+arditbe@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cap max_tokens and n to prevent local API DoS#3637

Cap max_tokens and n to prevent local API DoS#3637
arditbe wants to merge 2 commits intonomic-ai:mainfrom
arditbe:fix-dos-max-tokens-n

arditbe commented Dec 6, 2025

Uh oh!

arditbe commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

arditbe commented Dec 6, 2025

My Changes

Issue ticket number and link

Checklist before requesting a review

Demo

Steps to Reproduce

Notes

Uh oh!

arditbe commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant