Generative Image & Video Models

236 items · default last 14 days

v0.24.1

ComfyUI Releases (GitHub) 17h

v0.24.0

ComfyUI Releases (GitHub) yest

Ideogram 4.0 Day-0 Support in ComfyUI: Open Weights and Structured Control

ComfyUI Blog yest

May Wrapped

ComfyUI Blog Jun 1

Bringing Native Support for 3D Gaussian Splats into ComfyUI with TripoSplat

ComfyUI Blog Jun 1

v0.23.0

ComfyUI Releases (GitHub) Jun 1

Krea 2 Image is now available via Partner Nodes

ComfyUI Blog May 27

v0.22.3

ComfyUI Releases (GitHub) May 27

Z Image Turbo LoRA training experimentation.

r/StableDiffusion 1h

Lightricks to split into two companies as it cuts another 75 jobs

r/StableDiffusion 1h

Made a custom sampler (Akium), now available for both Forge and ComfyUI

r/StableDiffusion 1h

Flux.2 Klein Spectral Graft - a node for adding/removing object, clothes swapping, face swapping and more

r/StableDiffusion 2h

I generated 10 megapixels in a single shot with Ideogram 4.0… and it looks insane

r/StableDiffusion 2h

Ideogram generated a Gemini Watermark without being prompted to

r/StableDiffusion 2h

I didn't expect ideogram to be so good

r/StableDiffusion 3h

ComfyUI-PiD update: more backbones, workflows, and better low-VRAM support

r/StableDiffusion 7h

JoyAI-Echo video model released on HF

r/StableDiffusion 9h

hildegard - tiled upscaling and refining based on flux 2 klein

r/StableDiffusion 11h

Announcing Comfy Desktop: One App for every Comfy, rolling out 100% by Monday June 8

r/StableDiffusion 11h

CyberRealistic Z Image is an amazing checkpoint

r/StableDiffusion 14h

Testing Lens and Ideogram 4.0 with a bunch of my prompts

r/StableDiffusion 15h

Z-Image is unbelievably good at anime (Prompts Given)

r/StableDiffusion 17h

OK Ideogram 4.0 is Pretty Fun Actually!

r/StableDiffusion 18h

JoyAI-Echo - Large Scale LTX-2.3 finetune Model - Much better motions!

r/StableDiffusion yest

Fine-tuned SDXL model with LoRA to generate Tribal Indian art

r/StableDiffusion yest

ComfyUI node to compare multiple samplers and schedulers at once

r/StableDiffusion yest

Could you tell me why my post was removed? Which rule did it violate? Please specify. :(

r/StableDiffusion yest

Gotta call it, Cosmos3 Super need its "Anima moment"

r/StableDiffusion yest

Apparently Martin Scorsese uses Flux

r/StableDiffusion yest

On Ideogram 4 safety: Make sure it's not coming from the LLM, I used a local LLM and got 0 rejections on normal prompts

r/StableDiffusion yest

What do people use to keep likeness other than custom training loras and IPAdapters?

r/StableDiffusion yest

Ideogram safety filter is removed by using ExtendIntermediateSigmas node (a comfy native node) . use it before passing sigmas.

r/StableDiffusion yest

Sorry, not sorry (Ideogram jailbroken in 1 easy step)

r/StableDiffusion yest

People giving you crap because you prefer A1111 WebUI over Comfy, so you ask for a simple T2I workflow and they go "Here's a simple workflow" and then they hit you with this

r/StableDiffusion yest

Some Anime styles baked directly in the Anima model (style tags included)

r/StableDiffusion yest

Ideogram looks promising /s

r/StableDiffusion yest

Multiple characters Anima generations are so good. There is some bleeding but its only gonna get better

r/StableDiffusion yest

Ideogram 4.0 Just Open Sourced!

r/StableDiffusion yest

What would an open-source AI animation pipeline need to make solo anime pilots possible?

r/StableDiffusion yest

Adding audio to an existing video?

r/StableDiffusion yest

This is pleasant. SDXL/DMD-2 images, SEEDVR2, LTX-2.3, pieced together with Shotcut. Overall the whole thing took a couple days, just tweaking moments in Comfy, getting about 90 images together, cutting it down, ended up running 30 through LTX on a 3060 12GB/64GB - might get some vocals~

r/StableDiffusion yest

Flux klein9n misunderstands behind subject

r/StableDiffusion yest

Krea 2 will be open sourced soon

r/StableDiffusion Jun 3

Benchmarking local Stable Diffusion 1.5 generations on iPhone 17 - only 3 seconds per image

r/StableDiffusion Jun 3

Where No Man Has Gone Before: Lens - Flux.2 Klein 9b - Wan 2.2

r/StableDiffusion Jun 3

AutoCachedPreview

r/StableDiffusion Jun 3

JoyAI-Echo - Large Scale LTX-2.3 finetune for long form (5min) coherent stories.

r/StableDiffusion Jun 3

Why do people like flux2 klein edit so much?

r/StableDiffusion Jun 3

UPDATE NexusBTA v0.2.22 is out Ui with pre made Comfy Workflows

r/StableDiffusion Jun 2

Some Cosmic Fantasy Generations with Anima (Prompts Included)

r/StableDiffusion Jun 2

Geometrically consistent 360-degree scenes from single panoramas

r/StableDiffusion Jun 2

I compared 62 samplers and 16 schedulers for Z-Image Turbo and rated the image quality so you don't have to 😉

r/StableDiffusion Jun 2

Anima testing for complex scene

r/StableDiffusion Jun 2

v0.22.2

ComfyUI Releases (GitHub) May 22

Anyone test Bernini yet? What VRAM/RAM is already working?

r/StableDiffusion Jun 2

You can now make Mac generate high quality songs - ported Khala Music Ai to Apple Silicon

r/StableDiffusion Jun 2

Fizgig Klein 9b Lora Studio v1.2.4 - update targeting 16gb Card users

r/StableDiffusion Jun 2

Wan 2.2 with Audio works really well! Worflow included

r/StableDiffusion Jun 2

Beginner prompting Guide for LTX 2.3 : tips and tricks

r/StableDiffusion Jun 2

Bernini video test video edit

r/StableDiffusion Jun 2

Pallaidium: Omnimodal AI Movie Studio integrated in Blender

r/StableDiffusion Jun 2

GitHub - orion4d/Orion4D_anaglyph: Orion4D Anaglyph** is a high-performance custom node designed to transform 2D images into stereoscopic (3D) renders via a depth map. It offers total control over parallax, convergence, and depth processing to ensure optimal visual comfort

r/StableDiffusion Jun 2

PSA: If you HAVENT switched from AI Toolkit to One Trainer...

r/StableDiffusion Jun 2

Comfyui v0.23.0 Support NVIDIA PixelDiT and PiD (CORE-201) by @kijai in #14103

r/StableDiffusion Jun 2

v0.22.1

ComfyUI Releases (GitHub) May 21

CivitAI Adult Content Models for ComfyUI

r/StableDiffusion Jun 1

Cosmos3-Super-Image2Video running locally on a single RTX PRO 6000 96GB

r/StableDiffusion Jun 1

I've been trying to replicate this Anima V1.0 image all day but I can't manage it. Maybe I need a special workflow or something?

r/StableDiffusion Jun 1

Time Travel with LTX 2.3

r/StableDiffusion Jun 1

Cosmos3 Nano testing with vllm-omni

r/StableDiffusion Jun 1

I compared 62 samplers and 16 schedulers for WAN 2.1 image generation and rated the image quality so you don't have to 😬

r/StableDiffusion Jun 1

Local AI News You Missed - May 2026

r/StableDiffusion Jun 1

FLUX.2 Klein 9B Schematic LoRA - Depth, Normal, Pose, and Segmentation

r/StableDiffusion Jun 1

Anima with dark style anime lora is pretty good. Tried with some Sailor girls.

r/StableDiffusion Jun 1

Bernini released. Unified Video generation and editing model. Built on Wan-2.2

r/StableDiffusion Jun 1

The Cosmos omnimodel family of models - 3 variants Edge(4B) , Nano(16B) , Super (64B)

r/StableDiffusion Jun 1

An AI-generated short film I spent weeks creating.

r/StableDiffusion Jun 1

Nvidia releases Cosmos3-Super-Image2Video . 64B parametres

r/StableDiffusion Jun 1

Nvidia releasesCosmos3-Super-Text2Image model . 64 billion paramteres

r/StableDiffusion Jun 1

FLUX.2-klein-base-9B ControlLight LoRA Release for changing lighting of a photo

r/StableDiffusion Jun 1

Stable Audio 3.0 Day-0 Support in ComfyUI：From Sound Effects to Longer, More Musical Tracks

ComfyUI Blog May 21

lora dataset images and captions

r/StableDiffusion Jun 1

ai clean up photo request

r/StableDiffusion Jun 1

Perceptual LoRA Toolkit now supports Z-Image Turbo

r/StableDiffusion May 31

Best anime model for multiple characters or Lora?

r/StableDiffusion May 31

need desperate help

r/StableDiffusion May 31

PIT NVIDIA vs SeedVR2

r/StableDiffusion May 31

I can't get Flux 2 klein 9 base to work

r/StableDiffusion May 31

Is there a WAN 2.2 version of VACE?

r/StableDiffusion May 31

Enterprise Solutions

Stability AI News May 31

Meet Stable Audio 3.0, the model family built for artistic experimentation with open-weight models

Stability AI News May 31

Introducing Brand Studio: The creative production platform powered by your brand

Stability AI News May 31

Stability AI Joins the Tech Coalition

Stability AI News May 31

Warner Music Group and Stability AI Join Forces To Build The Next Generation Of Responsible AI Tools For Music Creation

Stability AI News May 31

Universal Music Group and Stability AI Announce Strategic Alliance to Co-Develop Professional AI Music Creation Tools

Stability AI News May 31

Stability AI and EA Partner to Empower Artists, Designers, and Developers to Reimagine Game Development

Stability AI News May 31

Stability AI Brings Image Services to Amazon Bedrock, Delivering End-to-End Creative Control with Enterprise-Grade Infrastructure

Stability AI News May 31

Stability AI’s Annual Integrity Transparency Report

Stability AI News May 31

Stability AI Introduces Stable Audio 2.5, the First Audio Model Built for Enterprise Sound Production at Scale

Stability AI News May 31

Stability AI and NVIDIA Bring Faster Performance and Simplified Enterprise Deployment with the Stable Diffusion 3.5 NIM

Stability AI News May 31

Introducing Stability AI Solutions: Generative AI Solutions to Accelerate Enterprise Creative Production

Stability AI News May 31

Damn Anima Base is cooking! What's your favorite lora?

r/StableDiffusion May 31

is it impossible to train lora on Microsoft lens?..

r/StableDiffusion May 31

ComfyUI_HYWorld2 update. Quality improvement + World Stereo Light models!

r/StableDiffusion May 31

Does anyone else can't stand ComfyUI and prefers classic Automatic/Forge UI or it's just me?

r/StableDiffusion May 31

Guidance on building 2D image to 3D image Diffusion model [D]

r/StableDiffusion May 31

v0.22.0

ComfyUI Releases (GitHub) May 20

Tried capturing that classic SF3 Bengus/Akiman/Ikeno art style in ComfyUI. Anyone else miss this vibe?

r/StableDiffusion May 31

Flux Identity Adjuster V2

r/StableDiffusion May 31

Renting a GPU for use with a service like a runpod has become prohibitively expensive. The last time I rented one was about 3 months ago. The price for a 4090 was $5 per day for 25. The hourly rate for a 5090 was higher than for an A100 about 3 months ago.

r/StableDiffusion May 31

How can I force Z-Image to create full-body portraits?

r/StableDiffusion May 31

Do you think AMD/ROCm has a future where it's viable to use with ComfyUI, etc., and can be real competition against Nvidia? Or will Nvidia/CUDA simply remain the only compatible option in the short to medium term? Would it be better to buy Nvidia now, wait for the RTX 6000 series, or give up on AMD?

r/StableDiffusion May 31

What image model should I use as somebody who likes the aesthetic of Midjourney and diverse outputs? 16 GB VRAM, 64 GB RAM

r/StableDiffusion May 31

Bonsai Image 4B, a pair of low-bit diffusion transformer deployments built from FLUX.2 Klein 4B .

r/StableDiffusion May 31

Python Grid push for 1536x768 - can throw together simple storyboard rough draft, springboard for ideas, imho - simple script in comments. These images can hit 12000x8000 at 100MB+ scaled down for this post.

r/StableDiffusion May 31

Anima prompt skill systempromt

r/StableDiffusion May 31

Help Needed: How to create this type of art in Stable Diffusion? (Models, LoRA & settings)

r/StableDiffusion May 30

New Open-Source Models Now in ComfyUI: VOID, BiRefNet & Gemma 4

ComfyUI Blog May 14

v0.21.1

ComfyUI Releases (GitHub) May 13

Tripo 3.1 in ComfyUI: production-ready, high-detail 3D asset generation.

ComfyUI Blog May 11

v0.21.0

ComfyUI Releases (GitHub) May 11

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding