Generative Image & Video Models

236 items · default last 14 days

v0.24.1 ComfyUI Releases (GitHub) 17h
v0.24.0 ComfyUI Releases (GitHub) yest
Ideogram 4.0 Day-0 Support in ComfyUI: Open Weights and Structured Control ComfyUI Blog yest
May Wrapped ComfyUI Blog Jun 1
Bringing Native Support for 3D Gaussian Splats into ComfyUI with TripoSplat ComfyUI Blog Jun 1
v0.23.0 ComfyUI Releases (GitHub) Jun 1
Krea 2 Image is now available via Partner Nodes ComfyUI Blog May 27
v0.22.3 ComfyUI Releases (GitHub) May 27
Z Image Turbo LoRA training experimentation. r/StableDiffusion 1h
Lightricks to split into two companies as it cuts another 75 jobs r/StableDiffusion 1h
Made a custom sampler (Akium), now available for both Forge and ComfyUI r/StableDiffusion 1h
Flux.2 Klein Spectral Graft - a node for adding/removing object, clothes swapping, face swapping and more r/StableDiffusion 2h
I generated 10 megapixels in a single shot with Ideogram 4.0… and it looks insane r/StableDiffusion 2h
Ideogram generated a Gemini Watermark without being prompted to r/StableDiffusion 2h
I didn't expect ideogram to be so good r/StableDiffusion 3h
ComfyUI-PiD update: more backbones, workflows, and better low-VRAM support r/StableDiffusion 7h
JoyAI-Echo video model released on HF r/StableDiffusion 9h
hildegard - tiled upscaling and refining based on flux 2 klein r/StableDiffusion 11h
Announcing Comfy Desktop: One App for every Comfy, rolling out 100% by Monday June 8 r/StableDiffusion 11h
CyberRealistic Z Image is an amazing checkpoint r/StableDiffusion 14h
Testing Lens and Ideogram 4.0 with a bunch of my prompts r/StableDiffusion 15h
Z-Image is unbelievably good at anime (Prompts Given) r/StableDiffusion 17h
OK Ideogram 4.0 is Pretty Fun Actually! r/StableDiffusion 18h
JoyAI-Echo - Large Scale LTX-2.3 finetune Model - Much better motions! r/StableDiffusion yest
Fine-tuned SDXL model with LoRA to generate Tribal Indian art r/StableDiffusion yest
ComfyUI node to compare multiple samplers and schedulers at once r/StableDiffusion yest
Could you tell me why my post was removed? Which rule did it violate? Please specify. :( r/StableDiffusion yest
Gotta call it, Cosmos3 Super need its "Anima moment" r/StableDiffusion yest
Apparently Martin Scorsese uses Flux r/StableDiffusion yest
On Ideogram 4 safety: Make sure it's not coming from the LLM, I used a local LLM and got 0 rejections on normal prompts r/StableDiffusion yest
What do people use to keep likeness other than custom training loras and IPAdapters? r/StableDiffusion yest
Ideogram safety filter is removed by using ExtendIntermediateSigmas node (a comfy native node) . use it before passing sigmas. r/StableDiffusion yest
Sorry, not sorry (Ideogram jailbroken in 1 easy step) r/StableDiffusion yest
People giving you crap because you prefer A1111 WebUI over Comfy, so you ask for a simple T2I workflow and they go "Here's a simple workflow" and then they hit you with this r/StableDiffusion yest
Some Anime styles baked directly in the Anima model (style tags included) r/StableDiffusion yest
Ideogram looks promising /s r/StableDiffusion yest
Multiple characters Anima generations are so good. There is some bleeding but its only gonna get better r/StableDiffusion yest
Ideogram 4.0 Just Open Sourced! r/StableDiffusion yest
What would an open-source AI animation pipeline need to make solo anime pilots possible? r/StableDiffusion yest
Adding audio to an existing video? r/StableDiffusion yest
This is pleasant. SDXL/DMD-2 images, SEEDVR2, LTX-2.3, pieced together with Shotcut. Overall the whole thing took a couple days, just tweaking moments in Comfy, getting about 90 images together, cutting it down, ended up running 30 through LTX on a 3060 12GB/64GB - might get some vocals~ r/StableDiffusion yest
Flux klein9n misunderstands behind subject r/StableDiffusion yest
Krea 2 will be open sourced soon r/StableDiffusion Jun 3
Benchmarking local Stable Diffusion 1.5 generations on iPhone 17 - only 3 seconds per image r/StableDiffusion Jun 3
Where No Man Has Gone Before: Lens - Flux.2 Klein 9b - Wan 2.2 r/StableDiffusion Jun 3
AutoCachedPreview r/StableDiffusion Jun 3
JoyAI-Echo - Large Scale LTX-2.3 finetune for long form (5min) coherent stories. r/StableDiffusion Jun 3
Why do people like flux2 klein edit so much? r/StableDiffusion Jun 3
UPDATE NexusBTA v0.2.22 is out Ui with pre made Comfy Workflows r/StableDiffusion Jun 2
Some Cosmic Fantasy Generations with Anima (Prompts Included) r/StableDiffusion Jun 2
Geometrically consistent 360-degree scenes from single panoramas r/StableDiffusion Jun 2
I compared 62 samplers and 16 schedulers for Z-Image Turbo and rated the image quality so you don't have to 😉 r/StableDiffusion Jun 2
Anima testing for complex scene r/StableDiffusion Jun 2
v0.22.2 ComfyUI Releases (GitHub) May 22
Anyone test Bernini yet? What VRAM/RAM is already working? r/StableDiffusion Jun 2
You can now make Mac generate high quality songs - ported Khala Music Ai to Apple Silicon r/StableDiffusion Jun 2
Fizgig Klein 9b Lora Studio v1.2.4 - update targeting 16gb Card users r/StableDiffusion Jun 2
Wan 2.2 with Audio works really well! Worflow included r/StableDiffusion Jun 2
Beginner prompting Guide for LTX 2.3 : tips and tricks r/StableDiffusion Jun 2
Bernini video test video edit r/StableDiffusion Jun 2
Pallaidium: Omnimodal AI Movie Studio integrated in Blender r/StableDiffusion Jun 2
GitHub - orion4d/Orion4D_anaglyph: Orion4D Anaglyph** is a high-performance custom node designed to transform 2D images into stereoscopic (3D) renders via a depth map. It offers total control over parallax, convergence, and depth processing to ensure optimal visual comfort r/StableDiffusion Jun 2
PSA: If you HAVENT switched from AI Toolkit to One Trainer... r/StableDiffusion Jun 2
Comfyui v0.23.0 Support NVIDIA PixelDiT and PiD (CORE-201) by @kijai in #14103 r/StableDiffusion Jun 2
v0.22.1 ComfyUI Releases (GitHub) May 21
CivitAI Adult Content Models for ComfyUI r/StableDiffusion Jun 1
Cosmos3-Super-Image2Video running locally on a single RTX PRO 6000 96GB r/StableDiffusion Jun 1
I've been trying to replicate this Anima V1.0 image all day but I can't manage it. Maybe I need a special workflow or something? r/StableDiffusion Jun 1
Time Travel with LTX 2.3 r/StableDiffusion Jun 1
Cosmos3 Nano testing with vllm-omni r/StableDiffusion Jun 1
I compared 62 samplers and 16 schedulers for WAN 2.1 image generation and rated the image quality so you don't have to 😬 r/StableDiffusion Jun 1
Local AI News You Missed - May 2026 r/StableDiffusion Jun 1
FLUX.2 Klein 9B Schematic LoRA - Depth, Normal, Pose, and Segmentation r/StableDiffusion Jun 1
Anima with dark style anime lora is pretty good. Tried with some Sailor girls. r/StableDiffusion Jun 1
Bernini released. Unified Video generation and editing model. Built on Wan-2.2 r/StableDiffusion Jun 1
The Cosmos omnimodel family of models - 3 variants Edge(4B) , Nano(16B) , Super (64B) r/StableDiffusion Jun 1
An AI-generated short film I spent weeks creating. r/StableDiffusion Jun 1
Nvidia releases Cosmos3-Super-Image2Video . 64B parametres r/StableDiffusion Jun 1
Nvidia releasesCosmos3-Super-Text2Image model . 64 billion paramteres r/StableDiffusion Jun 1
FLUX.2-klein-base-9B ControlLight LoRA Release for changing lighting of a photo r/StableDiffusion Jun 1
Stable Audio 3.0 Day-0 Support in ComfyUI:From Sound Effects to Longer, More Musical Tracks ComfyUI Blog May 21
lora dataset images and captions r/StableDiffusion Jun 1
ai clean up photo request r/StableDiffusion Jun 1
Perceptual LoRA Toolkit now supports Z-Image Turbo r/StableDiffusion May 31
Best anime model for multiple characters or Lora? r/StableDiffusion May 31
need desperate help r/StableDiffusion May 31
PIT NVIDIA vs SeedVR2 r/StableDiffusion May 31
I can't get Flux 2 klein 9 base to work r/StableDiffusion May 31
Is there a WAN 2.2 version of VACE? r/StableDiffusion May 31
Enterprise Solutions Stability AI News May 31
Meet Stable Audio 3.0, the model family built for artistic experimentation with open-weight models Stability AI News May 31
Introducing Brand Studio: The creative production platform powered by your brand Stability AI News May 31
Stability AI Joins the Tech Coalition Stability AI News May 31
Warner Music Group and Stability AI Join Forces To Build The Next Generation Of Responsible AI Tools For Music Creation Stability AI News May 31
Universal Music Group and Stability AI Announce Strategic Alliance to Co-Develop Professional AI Music Creation Tools Stability AI News May 31
Stability AI and EA Partner to Empower Artists, Designers, and Developers to Reimagine Game Development Stability AI News May 31
Stability AI Brings Image Services to Amazon Bedrock, Delivering End-to-End Creative Control with Enterprise-Grade Infrastructure Stability AI News May 31
Stability AI’s Annual Integrity Transparency Report Stability AI News May 31
Stability AI Introduces Stable Audio 2.5, the First Audio Model Built for Enterprise Sound Production at Scale Stability AI News May 31
Stability AI and NVIDIA Bring Faster Performance and Simplified Enterprise Deployment with the Stable Diffusion 3.5 NIM Stability AI News May 31
Introducing Stability AI Solutions: Generative AI Solutions to Accelerate Enterprise Creative Production Stability AI News May 31
Damn Anima Base is cooking! What's your favorite lora? r/StableDiffusion May 31
is it impossible to train lora on Microsoft lens?.. r/StableDiffusion May 31
ComfyUI_HYWorld2 update. Quality improvement + World Stereo Light models! r/StableDiffusion May 31
Does anyone else can't stand ComfyUI and prefers classic Automatic/Forge UI or it's just me? r/StableDiffusion May 31
Guidance on building 2D image to 3D image Diffusion model [D] r/StableDiffusion May 31
v0.22.0 ComfyUI Releases (GitHub) May 20
Tried capturing that classic SF3 Bengus/Akiman/Ikeno art style in ComfyUI. Anyone else miss this vibe? r/StableDiffusion May 31
Flux Identity Adjuster V2 r/StableDiffusion May 31
Renting a GPU for use with a service like a runpod has become prohibitively expensive. The last time I rented one was about 3 months ago. The price for a 4090 was $5 per day for 25. The hourly rate for a 5090 was higher than for an A100 about 3 months ago. r/StableDiffusion May 31
How can I force Z-Image to create full-body portraits? r/StableDiffusion May 31
Do you think AMD/ROCm has a future where it's viable to use with ComfyUI, etc., and can be real competition against Nvidia? Or will Nvidia/CUDA simply remain the only compatible option in the short to medium term? Would it be better to buy Nvidia now, wait for the RTX 6000 series, or give up on AMD? r/StableDiffusion May 31
What image model should I use as somebody who likes the aesthetic of Midjourney and diverse outputs? 16 GB VRAM, 64 GB RAM r/StableDiffusion May 31
Bonsai Image 4B, a pair of low-bit diffusion transformer deployments built from FLUX.2 Klein 4B . r/StableDiffusion May 31
Python Grid push for 1536x768 - can throw together simple storyboard rough draft, springboard for ideas, imho - simple script in comments. These images can hit 12000x8000 at 100MB+ scaled down for this post. r/StableDiffusion May 31
Anima prompt skill systempromt r/StableDiffusion May 31
Help Needed: How to create this type of art in Stable Diffusion? (Models, LoRA & settings) r/StableDiffusion May 30
New Open-Source Models Now in ComfyUI: VOID, BiRefNet & Gemma 4 ComfyUI Blog May 14
v0.21.1 ComfyUI Releases (GitHub) May 13
Tripo 3.1 in ComfyUI: production-ready, high-detail 3D asset generation. ComfyUI Blog May 11
v0.21.0 ComfyUI Releases (GitHub) May 11
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding arXiv cs.CV (Computer Vision) 8h
NIV: Neural Axis Variations for Variable Font Generation arXiv cs.CV (Computer Vision) 8h
Personal AI Agent for Camera Roll VQA arXiv cs.CV (Computer Vision) 8h
Do Models Share Safety Representations? Cross-Model Steering for Safe Visual Generation arXiv cs.CV (Computer Vision) 8h
TopoPult-SSL: Gland-Mask-Free Cross-Device Meibomian Gland Segmentation via Self-Distilled Weak Clinical Priors arXiv cs.CV (Computer Vision) 8h
LightVesselNet: An Ultra-Lightweight Sub-100K Parameter Network for Retinal Blood Vessel Segmentation arXiv cs.CV (Computer Vision) 8h
Recovering Physically Plausible Human-Object Interactions from Monocular Videos arXiv cs.CV (Computer Vision) 8h
Biomazon: A Multimodal Dataset for 3D Forest Structure and Biomass Modeling in the Amazon Basin arXiv cs.CV (Computer Vision) 8h
Three-Dimensional Retinal Microvasculature Restoration in OCT Angiography arXiv cs.CV (Computer Vision) 8h
Deep Learning-assisted AMD Staging based on OCT and OCT Angiography arXiv cs.CV (Computer Vision) 8h
UniPixie: Unified and Probabilistic 3D Physics Learning via Flow Matching arXiv cs.CV (Computer Vision) 8h
Would you still call this Dax? Novel Visual References in VLMs and Humans arXiv cs.CV (Computer Vision) 8h
Disentangled Fine-Grained Prototype Learning for Incomplete Image-Tabular Classification arXiv cs.CV (Computer Vision) 8h
Horse Eye Blink Detection and Classification for Equine Affective State Assessment arXiv cs.CV (Computer Vision) 8h
ORACLE-CT: Anatomy-Aware Support Pooling for CT Classification arXiv cs.CV (Computer Vision) 8h
How to prompt Grok Imagine Video 1.5 Replicate Blog May 21
Dive into the Scene: Breaking the Perceptual Bottleneck in Vision-Language Decision Making via Focus Plan Generation arXiv cs.CV (Computer Vision) yest
Weakly Supervised Incremental Segmentation via Semantic Anchors and Spatial Arbitration arXiv cs.CV (Computer Vision) yest
Intra-Modal Neighbors Never Lie: Rectifying Inter-Modal Noisy Correspondence via Graph-Based Intra-Modal Reasoning arXiv cs.CV (Computer Vision) yest
Optimal Transport Flow Matching by Design arXiv cs.CV (Computer Vision) yest
When Seeing Is Not Believing -- A Benchmark for Search-Grounded Video Misinformation Detection arXiv cs.CV (Computer Vision) yest
Reflection Separation from a Single Image via Joint Latent Diffusion arXiv cs.CV (Computer Vision) yest
Pinpoint: Grounded Worldwide Image Geolocation via Cross-Source Retrieval and Reranking arXiv cs.CV (Computer Vision) yest
End-to-End Text Line Detection and Ordering arXiv cs.CV (Computer Vision) yest
GroupToM-Bench: Benchmarking Group Theory of Mind and Nonlinear Social Emergence in MLLMs arXiv cs.CV (Computer Vision) yest
Spatial Artifact Coherence Determines Codec Robustness in Patch-Based rPPG arXiv cs.CV (Computer Vision) yest
Overview of the EReL@MIR 2025 Multimodal Document Retrieval Challenge (Track 1) arXiv cs.CV (Computer Vision) yest
Prospective Dynamic 3D MRI Reconstruction via Latent-Space Motion Tracking from Single Measurement arXiv cs.CV (Computer Vision) yest
SBP-Net: Learning Thin Structure Reconstruction with Sliding-Box Projections arXiv cs.CV (Computer Vision) yest
UniCanvas: A Diffusion-base Unified Model for Text-in-Image Joint Generation arXiv cs.CV (Computer Vision) yest
StandardE2E: A Unified Framework for End-to-End Autonomous Driving Datasets arXiv cs.CV (Computer Vision) yest
fal and AWS: Building for the Next Phase of Generative Media fal.ai Blog May 19
Google just redesigned the search box for the first time in 25 years — here’s why it matters more than you think. VentureBeat — AI May 19
COD10K-C: Benchmarking Robustness of Camouflaged Object Detection Under Natural Image Corruptions arXiv cs.CV (Computer Vision) Jun 3
AVTrack: Audio-Visual Tracking in Human-centric Complex Scenes arXiv cs.CV (Computer Vision) Jun 3
Consistent Yet Wrong: Evidence Insensitivity in Spatial Vision-Language Models arXiv cs.CV (Computer Vision) Jun 3
Plan2Map: A Multimodal Benchmark for Document-Grounded Geospatial Boundary Reconstruction from Planning Records arXiv cs.CV (Computer Vision) Jun 3
MetaWorld: Scaling Multi-Agent Video World Model from Single-view Video Data arXiv cs.CV (Computer Vision) Jun 3
From Local Training to Large-Scale Mapping: A Comparative Assessment of Machine Learning and Deep Learning for Transferable Satellite-Derived Bathymetry arXiv cs.CV (Computer Vision) Jun 3
GeoDrive-Bench: Benchmarking Region-Specific Multimodal Reasoning in Autonomous Driving arXiv cs.CV (Computer Vision) Jun 3
Diagnosis of Human Object Interaction Detectors for Real World Educational Applications arXiv cs.CV (Computer Vision) Jun 3
Cosmos 3: Omnimodal World Models for Physical AI arXiv cs.CV (Computer Vision) Jun 3
Automated Report-Derived Oncology VQA Benchmark for Evaluating Vision-Language Models on 3D Medical Imaging arXiv cs.CV (Computer Vision) Jun 3
Principled Reflection Separation via Nonlinear Superposition and Feature Interaction arXiv cs.CV (Computer Vision) Jun 3
Pathway-Structured Privileged Distillation for Deployable Computational Pathology arXiv cs.CV (Computer Vision) Jun 3
Tiny Collaborative Inference for Occlusion-Robust Object Detection arXiv cs.CV (Computer Vision) Jun 3
Any2Poster: Any-Source Poster Generation Across Modalities and Domains arXiv cs.CV (Computer Vision) Jun 3
Pixel Cube: Diffusion-based Portrait Video Relighting Through Realistic Lighting Reproduction arXiv cs.CV (Computer Vision) Jun 3
v0.20.3 ComfyUI Releases (GitHub) May 8
"Reality is so boring" – Krea Podcast with Boldtron, Serialcut, and Remembering_orion Krea AI News Jun 2
DefocusTrackerAI -- A Generalized Framework for the Automatic Detection of Defocused Particle Images arXiv cs.CV (Computer Vision) Jun 2
Improved Belief-Attention in Vision Task arXiv cs.CV (Computer Vision) Jun 2
Flow-Based Generative Modeling for Optimizing Sampling Policies in Compressed Sensing Applications arXiv cs.CV (Computer Vision) Jun 2
Planktonzilla: Multimodal dataset and models for understanding plankton ecosystems arXiv cs.CV (Computer Vision) Jun 2
Structured Visual Evidence Decomposition for Evidence-Grounded Multimodal Screening of Obstructive Sleep Apnea-Hypopnea Syndrome arXiv cs.CV (Computer Vision) Jun 2
Aligning Cellular Sheaves with Classifier Attention for Interpretable Weakly-Supervised Pathology Localization arXiv cs.CV (Computer Vision) Jun 2
Diffusion Image Generation with Explicit Modeling of Data Manifold Geometry arXiv cs.CV (Computer Vision) Jun 2
Bridging the 2D-3D Gap: A Hierarchical Semantic-Geometric Map for Vision Language Navigation arXiv cs.CV (Computer Vision) Jun 2
Diversity Over Frequency: Rethinking Tool Use in Visual Chain-of-Thought Agents arXiv cs.CV (Computer Vision) Jun 2
Segmentation-Guided Spatial Indexing for Generalizable and Explainable Deepfake Detection arXiv cs.CV (Computer Vision) Jun 2
CoilDrop-MRI: Self-supervised physics-guided MRI reconstruction with coil dropout arXiv cs.CV (Computer Vision) Jun 2
CoCoVideo: The High-Quality Commercial-Model-Based Contrastive Benchmark for AI-Generated Video Detection arXiv cs.CV (Computer Vision) Jun 2
Visual-Noise Guided In-Context Distillation for Multimodal Large Language Model Unlearning arXiv cs.CV (Computer Vision) Jun 2
VDSB-GWSyn: Diffusion Schr\"{o}dinger Bridge for Controllable and Anatomically Feasible Guidewire Synthesis in Coronary Angiography arXiv cs.CV (Computer Vision) Jun 2
General Covariant Action Modeling: Constructing Generalized Manifolds via Spatio-Temporal Decoupling arXiv cs.CV (Computer Vision) Jun 2
Lightweight SAR Ship Detection via Contrastive Distillation arXiv cs.CV (Computer Vision) Jun 1
SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer arXiv cs.CV (Computer Vision) Jun 1
DTG-Restore: Training-Free Diffusion Refinement for Generative Video Super-Resolution arXiv cs.CV (Computer Vision) Jun 1
Mitigating Content Shift and Hallucination in GenAI Image Editing via Structural Refinement arXiv cs.CV (Computer Vision) Jun 1
Dex2HOI: Dexterous Bimanual Two-Object Interaction Generation arXiv cs.CV (Computer Vision) Jun 1
Clustering Guided Domain-Specific Pretrained Foundation Model Very High-Resolution Arctic Remote Sensing arXiv cs.CV (Computer Vision) Jun 1
A Novel Global Context-aware Deep Neural Network for Enhanced Brain Tumor Segmentation using Magnetic Resonance Images arXiv cs.CV (Computer Vision) Jun 1
OmniMem: Scalable and Adaptive Memory Retrieval for Long Video Generation arXiv cs.CV (Computer Vision) Jun 1
On-Device Generative AI for GDPR-Compliant Visual Monitoring: Natural Language Alerts from Local Object Detection arXiv cs.CV (Computer Vision) Jun 1
Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)? arXiv cs.CV (Computer Vision) Jun 1
VLM3: Vision Language Models Are Native 3D Learners arXiv cs.CV (Computer Vision) Jun 1
Prior Availability in Industrial Visual Sim-to-Real: A Review of CAD-Guided and CAD-Unavailable Regimes arXiv cs.CV (Computer Vision) Jun 1
ReGuLaR: Relation-Grounded Latent Reasoning for Large Vision-Language Models arXiv cs.CV (Computer Vision) Jun 1
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs arXiv cs.CV (Computer Vision) Jun 1
Controllable Lung Nodule Synthesis via Histogram-Regularized Latent Diffusion Models arXiv cs.CV (Computer Vision) Jun 1
Luma Uni-1 is now available via Partner Nodes ComfyUI Blog May 5
Building long-term trust in a world where creation moves at the speed of thought fal.ai Blog May 15
April Wrapped ComfyUI Blog May 4
We are partnering with Henning Larsen Krea AI News May 29
v0.20.2 ComfyUI Releases (GitHub) May 3
LoRAs for Krea 2 Are Coming to Train Tool Krea AI News May 29
Microsoft Lens in ComfyUI - Small but Powerful! Nerdy Rodent (YouTube) May 28
Moodboard Gallery in Krea 2 + Preview of Random and Auto Krea AI News May 28
Krea 2 API Krea AI News May 27
Diffusers 0.38.0: New image and audio pipelines, Core library improvements, and more Hugging Face diffusers Releases May 1
HappyHorse 1.0 is Now Available in ComfyUI ComfyUI Blog Apr 27
v0.20.1 ComfyUI Releases (GitHub) Apr 27
v0.20.0 ComfyUI Releases (GitHub) Apr 27
Krea 2 deep dive: exploration, style references, and moodboards Krea AI News May 21
Make High Quality Music in ComfyUI - Low VRAM! Nerdy Rodent (YouTube) May 21
Krea 2 LoRA training is now available Krea AI News May 21
ComfyUI raises $30M to scale open-source AI for creative production ComfyUI Blog Apr 24
Unlock Virtual Portrait & Real Human Video Generation with Seedance 2.0 in ComfyUI ComfyUI Blog Apr 24
GPT Image 2 is now here via Partner Nodes ComfyUI Blog Apr 22
Quiver: Structured SVG generation in ComfyUI ComfyUI Blog Apr 20
Using HiDream-O1 Natively in ComfyUI Nerdy Rodent (YouTube) May 15
Mood boards in Krea 2 Krea AI News May 15
ACE-Step 1.5 XL: Commercial-Grade Music Generation in ComfyUI ComfyUI Blog Apr 17
Introducing Krea 2 Krea AI News May 12
Krea 2 Large shines in style fidelity Krea AI News May 12
ERNIE-Image Support in ComfyUI: Precise Text Rendering and Structured Image Generation ComfyUI Blog Apr 15
ComfyUI Now Supports Sonilo via Partner Nodes ComfyUI Blog Apr 14
Seedance 2.0 is Now Available in ComfyUI ComfyUI Blog Apr 13
Z-Anime: Finetuned for Anime Styled Images in ComfyUI Nerdy Rodent (YouTube) May 9
LTX 2.3 Workflow Powerup - AI Video In-Painting & LoRA fun! Nerdy Rodent (YouTube) May 2
How to make remarkable videos with Seedance 2.0 Replicate Blog Apr 15
Introducing PATINA fal.ai Blog Apr 10
LTX 2.3 - Improved AI Videos & Extensions in ComfyUI! Nerdy Rodent (YouTube) Apr 24
ERNIE-Image Turbo Beats Z-Image-Turbo in Benchmarks?! Nerdy Rodent (YouTube) Apr 17
ACE-Step 1.5 XL = Free Music Generation in ComfyUI! Nerdy Rodent (YouTube) Apr 11

Keyboard

j / k
move between items
Space
expand / collapse
o
open original
s
save / unsave
m
mark read
/
focus search
?
this help