Using the new free GPT-OSS open-weight models (from OpenAI) offline via Ollama? #20

Hedda · 2025-08-07T09:13:27Z

Hedda
Aug 7, 2025

Just read the news and wondering if anyone tested the gpt-oss-20b and/or gpt-oss-120b GPT-OSS LLM models (from OpenAI) via Ollama?

https://openai.com/index/introducing-gpt-oss/
- https://openai.com/index/gpt-oss-model-card/
  - https://github.com/openai/gpt-oss

FYI, I understand that gpt-oss-20b model require 16GB RAM (GPU/APU) while gpt-oss-120b model require 80GB RAM (GPU/APU).

gpt-oss-20b — for lower latency, and local or specialized use cases (21B parameters with 3.6B active parameters)
gpt-oss-120b — for production, general purpose, high reasoning use cases that fit into a single H100 GPU (117B parameters with 5.1B active parameters)

PS: These new LLMs are apparently a game changer for off-line reasoning AI tasks. Check out this quick explaination/review by MattWolfe:

https://www.youtube.com/watch?v=LEd_b2vTbAM&ab_channel=MattWolfe

mainframecn · 2026-01-29T19:30:35Z

mainframecn
Jan 29, 2026

I had the gpt-oss-20gb "working" with local ollama, but the ai chat was just spitting output where formatting was unusable. Haven't tried a payed online service

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using the new free GPT-OSS open-weight models (from OpenAI) offline via Ollama? #20

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Using the new free GPT-OSS open-weight models (from OpenAI) offline via Ollama? #20

Uh oh!

Uh oh!

Hedda Aug 7, 2025

Replies: 1 comment

Uh oh!

mainframecn Jan 29, 2026

Hedda
Aug 7, 2025

mainframecn
Jan 29, 2026