The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Go to file
2023-03-14 21:25:52 +00:00
.ci Fix small issue with build. 2023-03-13 15:09:11 -04:00
.github/workflows Remove omegaconf dependency and some ci changes. 2023-03-13 14:49:18 -04:00
comfy Make --cpu have priority over everything else. 2023-03-13 21:30:01 -04:00
comfy_extras Put image upscaling nodes in image/upscaling category. 2023-03-11 18:10:36 -05:00
custom_nodes Fix a few issues with the custom_nodes PR. 2023-02-17 11:19:49 -05:00
input Add a LoadImage node to load images for img2img. 2023-01-22 15:07:18 -05:00
models Take some code from chainner to implement ESRGAN and other upscale models. 2023-03-11 13:09:28 -05:00
notebooks Add WD VAE to colab. 2023-03-11 22:08:00 -05:00
output Initial commit. 2023-01-16 22:37:14 -05:00
script_examples Switch the default workflow to the CheckpointLoaderSimple node. 2023-03-05 03:00:28 -05:00
web Explain why animation frame used 2023-03-14 21:25:52 +00:00
.gitignore add loras to ignore 2023-03-03 15:19:56 +00:00
comfyui_screenshot.png Initial commit. 2023-01-16 22:37:14 -05:00
execution.py Updated to reuse session id if available 2023-03-07 13:24:15 +00:00
LICENSE Initial commit. 2023-01-16 22:37:14 -05:00
main.py Add pytorch attention support to VAE. 2023-03-13 12:45:54 -04:00
nodes.py Put image upscaling nodes in image/upscaling category. 2023-03-11 18:10:36 -05:00
README.md Move colab link to the installing section. 2023-03-13 17:50:48 -04:00
requirements.txt Remove omegaconf dependency and some ci changes. 2023-03-13 14:49:18 -04:00
server.py Xformers is now properly disabled when --cpu used. 2023-03-12 15:44:16 -04:00

ComfyUI

A powerful and modular stable diffusion GUI.

ComfyUI Screenshot

This ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. For some workflow examples and see what ComfyUI can do you can check out:

ComfyUI Examples

Features

  • Nodes/graph/flowchart interface to experiment and create complex Stable Diffusion workflows without needing to code anything.
  • Fully supports SD1.x and SD2.x
  • Asynchronous Queue system
  • Many optimizations: Only re-executes the parts of the workflow that changes between executions.
  • Command line option: --lowvram to make it work on GPUs with less than 3GB vram (enabled automatically on GPUs with low vram)
  • Works even if you don't have a GPU with: --cpu (slow)
  • Can load both ckpt and safetensors models/checkpoints. Standalone VAEs and CLIP models.
  • Embeddings/Textual inversion
  • Loras (regular and locon)
  • Loading full workflows (with seeds) from generated PNG files.
  • Saving/Loading workflows as Json files.
  • Nodes interface can be used to create complex workflows like one for Hires fix or much more advanced ones.
  • Area Composition
  • Inpainting with both regular and inpainting models.
  • ControlNet and T2I-Adapter
  • Upscale Models (ESRGAN, ESRGAN variants, SwinIR, Swin2SR, etc...)
  • Starts up very fast.
  • Works fully offline: will never download anything.

Workflow examples can be found on the Examples page

Installing

Windows

There is a portable standalone build for Windows that should work for running on Nvidia GPUs or for running on your CPU only on the releases page.

Just download, extract and run. Make sure you put your Stable Diffusion checkpoints/models (the huge ckpt/safetensors files) in: ComfyUI\models\checkpoints

Colab Notebook

To run it on colab or paperspace you can use my Colab Notebook here: Link to open with google colab

Manual Install (Windows, Linux)

Git clone this repo.

Put your SD checkpoints (the huge ckpt/safetensors files) in: models/checkpoints

Put your VAE in: models/vae

At the time of writing this pytorch has issues with python versions higher than 3.10 so make sure your python/pip versions are 3.10.

AMD (Linux only)

AMD users can install rocm and pytorch with pip if you don't have it already installed, this is the command to install the stable version:

pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/rocm5.2

I highly recommend you use the nightly/unstable pytorch builds though because they work a lot better for me (run this in the ComfyUI folder so it picks up the requirements.txt):

pip install --upgrade --pre torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/nightly/rocm5.4.2 -r requirements.txt

NVIDIA

Nvidia users should install torch using this command:

pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu117

Nvidia users should also install Xformers for a speed boost but can still run the software without it.

pip install xformers

Troubleshooting

If you get the "Torch not compiled with CUDA enabled" error, uninstall torch with:

pip uninstall torch

And install it again with the command above.

Dependencies

Install the dependencies by opening your terminal inside the ComfyUI folder and:

pip install -r requirements.txt

After this you should have everything installed and can proceed to running ComfyUI.

I already have another UI for Stable Diffusion installed do I really have to install all of these dependencies?

You don't. If you have another UI installed and working with it's own python venv you can use that venv to run ComfyUI. You can open up your favorite terminal and activate it:

source path_to_other_sd_gui/venv/bin/activate

or on Windows:

With Powershell: "path_to_other_sd_gui\venv\Scripts\Activate.ps1"

With cmd.exe: "path_to_other_sd_gui\venv\Scripts\activate.bat"

And then you can use that terminal to run Comfyui without installing any dependencies. Note that the venv folder might be called something else depending on the SD UI.

Running

python main.py

For AMD 6700, 6600 and maybe others

Try running it with this command if you have issues:

HSA_OVERRIDE_GFX_VERSION=10.3.0 python main.py

Notes

Only parts of the graph that have an output with all the correct inputs will be executed.

Only parts of the graph that change from each execution to the next will be executed, if you submit the same graph twice only the first will be executed. If you change the last part of the graph only the part you changed and the part that depends on it will be executed.

Dragging a generated png on the webpage or loading one will give you the full workflow including seeds that were used to create it.

You can use () to change emphasis of a word or phrase like: (good code:1.2) or (bad code:0.8). The default emphasis for () is 1.1. To use () characters in your actual prompt escape them like \( or \).

You can use {day|night}, for wildcard/dynamic prompts. With this syntax "{wild|card|test}" will be randomly replaced by either "wild", "card" or "test" by the frontend every time you queue the prompt. To use {} characters in your actual prompt escape them like: \{ or \}.

To use a textual inversion concepts/embeddings in a text prompt put them in the models/embeddings directory and use them in the CLIPTextEncode node like this (you can omit the .pt extension):

embedding:embedding_filename.pt

Fedora

To get python 3.10 on fedora: dnf install python3.10

Then you can:

python3.10 -m ensurepip

This will let you use: pip3.10 to install all the dependencies.

How to increase generation speed?

The fp16 model configs in the CheckpointLoader can be used to load them in fp16 mode, depending on your GPU this will increase your gen speed by a significant amount.

You can also set this command line setting to disable the upcasting to fp32 in some cross attention operations which will increase your speed. Note that this will very likely give you black images on SD2.x models.

--dont-upcast-attention

Support and dev channel

Matrix space: #comfyui_space:matrix.org (it's like discord but open source).

QA

Why did you make this?

I wanted to learn how Stable Diffusion worked in detail. I also wanted something clean and powerful that would let me experiment with SD without restrictions.

Who is this for?

This is for anyone that wants to make complex workflows with SD or that wants to learn more how SD works. The interface follows closely how SD works and the code should be much more simple to understand than other SD UIs.