GitHub - nimadez/mental-diffusion: Fast Stable Diffusion CLI Gradio

Mental Diffusion

Fast Stable Diffusion CLI
Powered by Hugging Face & Diffusers
Designed for Linux

MDX	0.9.4
Python	3.11 - 3.12
Torch	cu121
Diffusers	0.30.0
Gradio	4.37.2

MDX is tested-stable and maintained for low-end hardware, if quality is what you want, you'll need to use online services or pick up heavier models like Flux.

Features

SD, SDXL
Load VAE and LoRA weights
Txt2Img, Img2Img, Inpaint (auto-pipeline)
TAESD latents preview (image and animation)
Batch image generation, multiple images per prompt
Read/write PNG metadata, auto-rename files
CPU, GPU, Low VRAM mode (load SDXL with 4 GB)
Lightweight and fast, rewritten in 300 lines
Proxy, offline mode, minimal downloads
Gradio user-interface (mdx-ui.py)
Real-ESRGAN x2 and x4 script (mdx-upscale.py)

Installation

3.5 GB python packages (5.5 GB extracted)

Compatible with most diffusers-based python venvs

Make sure you have a swap partition or swap file

git clone https://github.com/nimadez/mental-diffusion
cd mental-diffusion

# Automatic installation:
sudo apt install python3-pip python3-venv
sh install-venv.sh

# Manual installation:
python3 -m venv ~/.venv/mdx
source ~/.venv/mdx/bin/activate
pip install torch torchvision --extra-index-url https://download.pytorch.org/whl/cu121
pip install -r ./requirements.txt
deactivate

Install Gradio and Zenity for user-interface:

~/.venv/mdx/bin/python3 -m pip install gradio==4.37.2
sudo apt install zenity

* Zenity: If you don't have a GNOME desktop, you can't use the file dialog to select .safetensors files, you have to enter the Checkpoint, LoRA and VAE path manually.

Arguments

~/.venv/mdx/bin/python3 mdx.py --help

--type        -t    str     sd, xl (def: custom)
--checkpoint  -c    str     /checkpoint.safetensors (def: custom)
--scheduler   -sc   str     ddim, ddpm, euler, eulera, lcm, lms, pndm (def: custom)
--prompt      -p    str     positive prompt
--negative    -n    str     negative prompt
--width       -w    int     divisible by 8 (def: custom)
--height      -h    int     divisible by 8 (def: custom)
--seed        -s    int     -1 randomize (def: -1)
--steps       -st   int     1 to 100  (def: 24)
--guidance    -g    float   0 - 20.0  (def: 8.0)
--strength    -sr   float   0 - 1.0 (def: 1.0)
--lorascale   -ls   float   0 - 1.0 (def: 1.0)
--image       -i    str     /image.png
--mask        -m    str     /mask.png
--vae         -v    str     /vae.safetensors
--lora        -l    str     /lora.safetensors
--filename    -f    str     filename prefix without .png extension, add {seed} to be replaced (def: img_{seed})
--output      -o    str     image and preview output directory (def: custom)
--number      -no   int     number of images to generate per prompt (def: 1)
--batch       -b    int     number of repeats to run in batch, --seed -1 to randomize
--preview     -pv           stepping is slower with preview enabled (def: no preview)
--lowvram     -lv           slower if you have enough VRAM, automatic on 4GB cards (def: no lowvram)
--metadata    -meta str     /image.png, extract metadata from png

[automatic pipeline]
Txt2Img: no --image and no --mask
Img2Img: --image and no --mask
Inpaint: --image and --mask
ERROR:   no --image and --mask

Default:    mdx -p "prompt" -st 28 -g 7.5
SD:         mdx -t sd -c /checkpoint.safetensors -w 512 -h 512
SDXL:       mdx -t xl -c /checkpoint.safetensors -w 768 -h 768
Img2Img:    mdx -i /image.png -sr 0.5
Inpaint:    mdx -i /image.png -m ./mask.png
VAE:        mdx -v /vae.safetensors
LoRA:       mdx -l /lora.safetensors -ls 0.5
Filename:   mdx -f img_test_{seed}
Output:     mdx -o /home/user/.mdx
Number:     mdx -no 4
Batch:      mdx -b 10
Preview:    mdx -pv
Low VRAM:   mdx -lv
Metadata:   mdx -meta ./image.png

mdx-upscale --help
mdx-upscale -i ./image.png
mdx-upscale -i ./image.png -m x2
mdx-upscale -i ./image.png -m x4 -o ~/Downloads

_{(?) the screenshot is optimized for viewer patience (4 GB VRAM, Kernel 6.9.7 bpo)}

User Interface

~/.venv/mdx/bin/python3 src/mdx-ui.py
sh mdx-ui
sh mdx-ui-dev   # development mode (auto reload)
open http://localhost:8011

Direct Inference

Import MDX class to inference from JSON data

from mdx import MDX

data = json.loads(data)
data["prompt"] = "new prompt"

parser = argparse.ArgumentParser()
args = parser.parse_args(namespace=argparse.Namespace(**data))

MDX().main(args)

Inference can be interrupted by creating a file named ".interrupt" in the --output directory. (e.g. mdx-ui.py)

Tips & Tricks

* Enable OFFLINE if you have already downloaded the huggingface cache
* Enable SAVE_ANIM to save the preview animation to {output}/filename.webp
* Model may be slow to load on first launch, but reloading SDXL with 4GB only takes a second

Preview, cancel, and repeat faster:
mdx -p "prompt" -g 8.0 -st 30 -pv
mdx -p "prompt" -g 8.0 -st 30 -s 827362763262387

Content-aware upscaling: (ImageMagick)
mdx -p "prompt" -st 20 -w 512 -h 512 -f image
magick convert ~/.mdx/image.png -resize 200% ~/.mdx/image_up.png
mdx -p "prompt" -st 20 -i ~/.mdx/image_up.png -sr 0.5

Generate 40 images in less time:
mdx -p "prompt" -b 10 -no 4

Extract images from WebP animation: (ImageMagick)
magick convert image.webp jpg

Explore output directory in a browser across the LAN:
cd ~/.mdx && python3 -m http.server 8000
$ open http://192.168.x.x:8000

Download huggingface cache in a specific path:
mkdir ~/.hfcache && ln -s ~/.hfcache ~/.cache/huggingface

Previous Experiments

Legacy command-line interface and server (diffusers)

ComfyUI bridge for VS Code extension

History

↑ Add Gradio user-interface
↑ Rewritten in 300 lines
↑ Port to Linux
↑ Back to Diffusers
↑ Port to Code (webui)
↑ Change to ComfyUI API (webui)
↑ Created for personal use on Windows OS (diffusers)

"AI will bring us back to the age of terminals."

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
comfyui		comfyui
legacy		legacy
libs		libs
media		media
src		src
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
install-venv.sh		install-venv.sh
mdx		mdx
mdx-ui		mdx-ui
mdx-ui-dev		mdx-ui-dev
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mental Diffusion

Features

Installation

Arguments

User Interface

Direct Inference

Tips & Tricks

Previous Experiments

History

License

Credits

Models

About

Languages

License

nimadez/mental-diffusion

Folders and files

Latest commit

History

Repository files navigation

Mental Diffusion

Features

Installation

Arguments

User Interface

Direct Inference

Tips & Tricks

Previous Experiments

History

License

Credits

Models

About

Topics

Resources

License

Stars

Watchers

Forks

Languages