[ Classic | Neo ]
Stable Diffusion WebUI Forge is a platform on top of the original Stable Diffusion WebUI by AUTOMATIC1111, to make development easier, optimize resource management, speed up inference, and study experimental features.
The name "Forge" is inspired by "Minecraft Forge". This project aims to become the Forge of Stable Diffusion WebUI.
- lllyasviel
(paraphrased)
"Neo" mainly serves as an continuation for the "latest" version of Forge, which was built on Gradio 4.40.0 before lllyasviel became too busy... Additionally, this fork is focused on optimization and usability, with the main goal of being the lightest WebUI without any bloatwares.
Most base features of the original Automatic1111 Webui should still function
- Support Flux.2-Klein
4B,9Btxt2img,img2img,inpaint
- Support Z-Image
z-image,z-image-turbo
- Support Wan 2.2
txt2img,img2img,txt2vid,img2vid- use
Refinerto achieve High Noise / Low Noise switching- enable
Refinerin Settings/Refiner
- enable
Important
To export a video, you need to have FFmpeg installed
- Support Qwen-Image / Qwen-Image-Edit
txt2img/img2img,inpaint
Note
Since the layers between Qwen-Image and Qwen-Image-Edit are exactly the same, to be properly detected as an Edit model, the model needs to include "qwen" and "edit" in its path, either the file name or folder name.
- Support Flux Kontext
img2img,inpaint
Note
Since the layers between Flux-Dev, Flux-Krea, and Flux-Kontext are exactly the same, to be properly detected as a Kontext model, the model needs to include "kontext" in its path, either the file name or folder name.
- Support Multi-Image Inputs for Qwen-Image-Edit and Flux-Kontext
- Support Nunchaku (
SVDQ) Modelsflux-dev,flux-krea,flux-kontext,qwen-image,qwen-image-edit,z-image-turbo- support LoRA for
FluxandQwen - see Commandline
- Support Lumina-Image-2.0
Neta-Lumina,NetaYume-Lumina
- Support Chroma1-HD
Tip
Check out Download Models for where to get each model and the accompanying modules
Tip
Check out Inference References for how to use each model and the recommended parameters
- Rewrite Preset System
- now actually remembers the checkpoint/module selection and parameters for each preset
- Support uv package manager
- requires manually installing uv
- drastically speed up installation
- see Commandline
- Support SageAttention, FlashAttention,
fp16_accumulation,torch._scaled_mm- see Commandline
- Implement Triton Kernel for
matmulintorch.int8- speed up
bf16models - enable by selecting
int8in theDiffusion in Low Bits
- speed up
- Implement Seed Variance Enhancer
- improve seed-to-seed variance for distilled models
- Implement RescaleCFG
- reduce burnt colors; mainly for
v-predcheckpoints - enable in Settings/UI Alternatives
- reduce burnt colors; mainly for
- Implement MaHiRo
- alternative CFG calculation; improve prompt adherence
- enable in Settings/UI Alternatives
- Implement Epsilon Scaling
- enable in Settings/Stable Diffusion
- Support loading upscalers in
halfprecision- speed up; reduce quality
- enable in Settings/Upscaling
- Support running tile composition on GPU
- enable in Settings/Upscaling
- Update
spandrel- support new Upscaler architectures
- Add support for
.avif,.heif, and.jxlimage formats
- SD2
- SD3
- Forge Spaces
- Hypernetworks
- CLIP Interrogator
- Deepbooru Interrogator
- Textual Inversion Training
- Most built-in Extensions
- Some built-in Scripts
- Some Samplers
- Sampler in RadioGroup
- Unix
.shlaunch scripts- You can still use this WebUI by simply copying a launch script from other working WebUI
- [Comfy] Rewrite the Backend (
memory_management.py,ModelPatcher,attention.py, etc.) - No longer
gitcloneany repository on fresh install - Fix memory leak when switching checkpoints
- Speed up launch time
- Improve timer logs
- Remove unused
cmd_args - Remove unused
args_parser - Remove unused
shared_options - Remove legacy codes
- Fix some typos
- Fix automatic
Tiled VAEfallback - Pad conditioning for SDXL
- Remove redundant upscaler codes
- put every upscaler inside the
ESRGANfolder
- put every upscaler inside the
- Improve
ForgeCanvas- brush adjustments
- customization
- deobfuscate
- eraser
- hotkeys
- Optimize upscaler logics
- Optimize certain operations in
Spandrel - Improve memory management
- Improve color correction
- Update the implementation for
uni_pcandLCMsamplers - Update the implementation of LoRAs
- Revamp settings
- improve formatting
- update descriptions
- Check for Extension updates in parallel
- Move
embeddingsfolder intomodelsfolder - ControlNet Rewrite
- change Units to
gr.Tab - remove multi-inputs, as they are "misleading"
- change Units to
- Disable Refiner by default
- enable again in Settings/Refiner
- No longer install
bitsandbytesby default- see Commandline
- Lint & Format
- Update
Pillow- faster image processing
- Update
protobuf- faster
insightfaceloading
- faster
- Update to latest PyTorch
torch==2.10.0+cu130
Note
If your GPU does not support the latest PyTorch, manually install older version of PyTorch
- No longer install
open-cliptwice - Update some packages to newer versions
- Update recommended Python to
3.13.12 - many more... ™️
These flags can be added after the
set COMMANDLINE_ARGS=line in thewebui-user.bat(separate each flag with space)
Tip
Use python launch.py --help to see all available flags
--xformers: Install thexformerspackage to speed up generation--port: Specify a server port to use- defaults to
7860
- defaults to
--api: Enable API access
-
Add the following flags to slightly improve the model loading; in certain situations, they may cause
OutOfMemoryerrors instead...--cuda-malloc--cuda-stream--pin-shared-memory
-
--uv: Replace thepython -m pipcalls withuv pipto massively speed up package installation- requires uv to be installed first (see Installation)
-
--uv-symlink: Same as above; but additionally pass--link-mode symlinkto the commands- significantly reduces installation size (
~7 GBto~100 MB)
- significantly reduces installation size (
Important
Using symlink means it will directly access the packages from the cache folders; refrain from clearing the cache when setting this option
--model-ref: Points to a centralmodelsfolder that contains all your models- said folder should contain subfolders like
Stable-diffusion,Lora,VAE,ESRGAN, etc.
- said folder should contain subfolders like
Important
This simply replaces the models folder, rather than adding on top of it
-
--forge-ref-a1111-home: Point to an Automatic1111 installation to load itsmodelsfolders- i.e.
Stable-diffusion,text_encoder
- i.e.
-
--forge-ref-comfy-home: Point to a ComfyUI installation to load itsmodelsfolders- i.e.
diffusion_models,clip
- i.e.
--sage: Install thesageattentionpackage to speed up generation- will also attempt to install
tritonautomatically
- will also attempt to install
--flash: Install theflash_attnpackage to speed up generation--nunchaku: Install thenunchakupackage to inference SVDQ models--bnb: Install thebitsandbytespackage to do low-bits (nf4) inference--onnxruntime-gpu: Install theonnxruntimewith the latest GPU support
--fast-fp8: Use thetorch._scaled_mmfunction when the model type isfloat8_e4m3fn--fast-fp16: Enable theallow_fp16_accumulationoption--autotune: Enable thetorch.backends.cudnn.benchmarkoption- this is slower in my experience...
-
Install git
-
Clone the Repo
git clone https://github.com/Haoming02/sd-webui-forge-classic sd-webui-forge-neo --branch neo
-
Setup Python
Recommended Method
- Install uv
- Set up venv
cd sd-webui-forge-neo uv venv venv --python 3.13 --seed - Add the
--uvflag towebui-user.bat
Deprecated Method
- Install Python 3.13.12
- Remember to enable
Add Python to PATH
- Remember to enable
- (Optional) Configure Commandline
- Launch the WebUI via
webui-user.bat - During the first launch, it will automatically install all the requirements
- Once the installation is finished, the WebUI will start in a browser automatically
Tip
Check out Extra Installations for how to install git, uv, and FFmpeg
Important
The --xformers, --flash, and --sage args are only responsible for installing the packages, not whether its respective attention is used (this also means you can remove them once the packages are successfully installed)
Caution
Do not just blindly install all of them
Nowadays the native PyTorch scaled_dot_product_attention is usually as fast, and also more stable
Forge Neo tries to import the packages and automatically choose the first available attention function in the following order:
SageAttentionFlashAttentionxformersPyTorchBasic
Note
To skip a specific attention, add the respective disable arg such as --disable-sage
- Issues about removed features will simply be ignored
- Issues regarding installation will be ignored if it's obviously user-error
- Issues caused by StabilityMatrix will also be ignored
- only open an Issue if you can reproduce it on a clean install following the official Installation guide
- Linux, macOS, AMD, Intel will not be officially supported, as I cannot verify nor maintain them...
Tip
Check out the Wiki~
Special thanks to AUTOMATIC1111, lllyasviel, and comfyanonymous, kijai, city96,
along with the rest of the contributors,
for their invaluable efforts in the open-source image generation community
Buy me a Coffee~ ☕
