0% found this document useful (0 votes)

173 views15 pages

ComfyUI Text To Image Workflow

This document provides a comprehensive guide on using ComfyUI for text-to-image AI art generation, outlining the workflow, necessary components, and principles of diffusion models. It details the steps to set up the workflow, load models, and generate images, along with prompting techniques for optimal results. Additionally, it introduces the SD1.5 model, highlighting its advantages and limitations in the context of AI art generation.

Uploaded by

sculp4all

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

173 views15 pages

ComfyUI Text To Image Workflow

Uploaded by

sculp4all

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

ComfyUI Text to Image Workflow

This guide will help you understand the concept of text-to-image in AI art generation and
complete a text-to-image workflow in ComfyUI

This guide aims to introduce you to ComfyUI's text-to-image workflow and help you understand
the functionality and usage of various ComfyUI nodes.

In this document, we will:

Complete a text-to-image workflow

Gain a basic understanding of diffusion model principles
Learn about the functions and roles of workflow nodes
Get an initial understanding of the SD1.5 model

We'll start by running a text-to-image workflow, followed by explanations of related concepts.

Please choose the relevant sections based on your needs.

About Text to Image

Text to Image is a fundamental process in AI art generation that creates images from text
descriptions, with diffusion models at its core.

The text-to-image process requires the following elements:

Artist: The image generation model

Canvas: The latent space
Image Requirements (Prompts): Including positive prompts (elements you want in the
image) and negative prompts (elements you don't want)

This text-to-image generation process can be simply understood as telling your requirements
(positive and negative prompts) to an artist (the image model), who then creates what you
want based on these requirements.

ComfyUI Text to Image Workflow Example Guide

1. Preparation
Ensure you have at least one SD1.5 model file in your ComfyUI/models/checkpoints folder,
such as v1-5-pruned-emaonly-fp16.safetensors
If you haven't installed it yet, please refer to the model installation section in Getting Started
with ComfyUI AI Art Generation.

2. Loading the Text to Image Workflow

Download the image below and drag it into ComfyUI to load the workflow:

3. Loading the Model and Generating Your First Image

After installing the image model, follow the steps in the image below to load the model and
generate your first image
Follow these steps according to the image numbers:

1. In the Load Checkpoint node, use the arrows or click the text area to ensure v1-5-pruned-
emaonly-fp16.safetensors is selected, and the left/right arrows don't show null text
2. Click the Queue button or use the shortcut Ctrl + Enter to execute image generation

After the process completes, you should see the resulting image in the Save Image node
interface, which you can right-click to save locally
If you're not satisfied with the result, try running the generation multiple times. Each time you
run it, KSampler will use a different random seed based on the seed parameter, so each
generation will produce different results

4. Start Experimenting
Try modifying the text in the CLIP Text Encoder
The Positive connection to the KSampler node represents positive prompts, while the
Negative connection represents negative prompts

Here are some basic prompting principles for the SD1.5 model:

Use English whenever possible

Separate prompts with English commas ,
Use phrases rather than long sentences
Use specific descriptions
Use expressions like (golden hour:1.2) to increase the weight of specific keywords,
making them more likely to appear in the image. 1.2 is the weight, golden hour is the
keyword
Use keywords like masterpiece, best quality, 4k to improve generation quality

Here are several prompt examples you can try, or use your own prompts for generation:

1. Anime Style

Positive prompts:
anime style, 1girl with long pink hair, cherry blossom background, studio
ghibli aesthetic, soft lighting, intricate details

masterpiece, best quality, 4k

Negative prompts:

low quality, blurry, deformed hands, extra fingers

2. Realistic Style

Positive prompts:

(ultra realistic portrait:1.3), (elegant woman in crimson silk dress:1.2),

full body, soft cinematic lighting, (golden hour:1.2),
(fujifilm XT4:1.1), shallow depth of field,
(skin texture details:1.3), (film grain:1.1),
gentle wind flow, warm color grading, (perfect facial symmetry:1.3)

Negative prompts:

(deformed, cartoon, anime, doll, plastic skin, overexposed, blurry, extra

fingers)

3. Specific Artist Style

Positive prompts:

fantasy elf, detailed character, glowing magic, vibrant colors, long flowing
hair, elegant armor, ethereal beauty, mystical forest, magical aura, high
detail, soft lighting, fantasy portrait, Artgerm style

Negative prompts:

blurry, low detail, cartoonish, unrealistic anatomy, out of focus, cluttered,

flat lighting

Text to Image Working Principles

The entire text-to-image process can be understood as a reverse diffusion process. The v1-
5-pruned-emaonly-fp16.safetensors we downloaded is a pre-trained model that can generate
target images from pure Gaussian noise. We only need to input our prompts, and it can
generate target images through denoising random noise.

Error parsing Mermaid diagram!

Cannot read properties of null (reading 'getBoundingClientRect')

We need to understand two concepts:

1. Latent Space: Latent Space is an abstract data representation method in diffusion models.
Converting images from pixel space to latent space reduces storage space and makes it
easier to train diffusion models and reduce denoising complexity. It's like architects using
blueprints (latent space) for design rather than designing directly on the building (pixel
space), maintaining structural features while significantly reducing modification costs
2. Pixel Space: Pixel Space is the storage space for images, which is the final image we see,
used to store pixel values.

If you want to learn more about diffusion models, you can read these papers:

Denoising Diffusion Probabilistic Models (DDPM)

Denoising Diffusion Implicit Models (DDIM)
High-Resolution Image Synthesis with Latent Diffusion Models

ComfyUI Text to Image Workflow Node Explanation

A. Load Checkpoint Node

This node is typically used to load the image generation model. A checkpoint usually contains
three components: MODEL (UNet) , CLIP , and VAE

MODEL (UNet) : The UNet model responsible for noise prediction and image generation
during the diffusion process
CLIP : The text encoder that converts our text prompts into vectors that the model can
understand, as the model cannot directly understand text prompts
VAE : The Variational AutoEncoder that converts images between pixel space and latent
space, as diffusion models work in latent space while our images are in pixel space

B. Empty Latent Image Node

Defines a latent space that outputs to the KSampler node. The Empty Latent Image node
constructs a pure noise latent space

You can think of its function as defining the canvas size, which determines the dimensions of
our final generated image

C. CLIP Text Encoder Node

Used to encode prompts, which are your requirements for the image

The Positive condition input connected to the KSampler node represents positive
prompts (elements you want in the image)
The Negative condition input connected to the KSampler node represents negative
prompts (elements you don't want in the image)

The prompts are encoded into semantic vectors by the CLIP component from the Load
Checkpoint node and output as conditions to the KSampler node

D. KSampler Node
The KSampler is the core of the entire workflow, where the entire noise denoising process
occurs, ultimately outputting a latent space image

Error parsing Mermaid diagram!

Cannot read properties of null (reading 'getBoundingClientRect')

Here's an explanation of the KSampler node parameters:

Parameter Name Description Function
model Diffusion model Determines the style and quality of
used for denoising generated images
positive Positive prompt Guides generation to include specified
condition encoding elements
negative Negative prompt Suppresses unwanted content
condition encoding
latent_image Latent space Serves as the input carrier for noise
image to be initialization
denoised
seed Random seed for Controls generation randomness
noise generation
control_after_generate Seed control mode Determines seed variation pattern in batch
after generation generation
steps Number of More steps mean finer details but longer
denoising iterations processing time
cfg Classifier-free Controls prompt constraint strength (too
guidance scale high leads to overfitting)
sampler_name Sampling algorithm Determines the mathematical method for
name denoising path
scheduler Scheduler type Controls noise decay rate and step size
allocation
denoise Denoising strength Controls noise strength added to latent
coefficient space, 0.0 preserves original input
features, 1.0 is complete noise

In the KSampler node, the latent space uses seed as an initialization parameter to construct
random noise, and semantic vectors Positive and Negative are input as conditions to the
diffusion model

Then, based on the number of denoising steps specified by the steps parameter, denoising is
performed. Each denoising step uses the denoising strength coefficient specified by the
denoise parameter to denoise the latent space and generate a new latent space image

E. VAE Decode Node

Converts the latent space image output from the KSampler into a pixel space image

F. Save Image Node

Previews and saves the decoded image from latent space to the local ComfyUI/output folder

Introduction to SD1.5 Model

SD1.5 (Stable Diffusion 1.5) is an AI image generation model developed by Stability AI. It's the
foundational version of the Stable Diffusion series, trained on 512×512 resolution images,
making it particularly good at generating images at this resolution. With a size of about 4GB, it
runs smoothly on consumer-grade GPUs (e.g., 6GB VRAM). Currently, SD1.5 has a rich
ecosystem, supporting various plugins (like ControlNet, LoRA) and optimization tools.
As a milestone model in AI art generation, SD1.5 remains the best entry-level choice thanks to
its open-source nature, lightweight architecture, and rich ecosystem. Although newer versions
like SDXL/SD3 have been released, its value for consumer-grade hardware remains
unmatched.

Basic Information
Release Date: October 2022
Core Architecture: Based on Latent Diffusion Model (LDM)
Training Data: LAION-Aesthetics v2.5 dataset (approximately 590M training steps)
Open Source Features: Fully open-source model/code/training data

Advantages and Limitations

Model Advantages:

Lightweight: Small size, only about 4GB, runs smoothly on consumer GPUs
Low Entry Barrier: Supports a wide range of plugins and optimization tools
Mature Ecosystem: Extensive plugin and tool support
Fast Generation: Smooth operation on consumer GPUs

Model Limitations:

Detail Handling: Hands/complex lighting prone to distortion

Resolution Limits: Quality degrades for direct 1024x1024 generation
Prompt Dependency: Requires precise English descriptions for control

Dreamfusion: 2D Diffusion to 3D
No ratings yet
Dreamfusion: 2D Diffusion to 3D
40 pages
Stable Diffusion img2img Guide
No ratings yet
Stable Diffusion img2img Guide
3 pages
(FREE GUIDE) Install ComfyUI, Tooncrafter
100% (1)
(FREE GUIDE) Install ComfyUI, Tooncrafter
4 pages
S72639 1741686144558001iRDO
No ratings yet
S72639 1741686144558001iRDO
28 pages
Dream Fusion
No ratings yet
Dream Fusion
39 pages
AniThing Model Settings Guide
No ratings yet
AniThing Model Settings Guide
18 pages
Info
No ratings yet
Info
10 pages
2023-NeurIPS-High-Fidelity and Diverse Text-to-3D Generation With Variational Score Distillation
No ratings yet
2023-NeurIPS-High-Fidelity and Diverse Text-to-3D Generation With Variational Score Distillation
36 pages
Flux NSFW BJ Sevenof9
No ratings yet
Flux NSFW BJ Sevenof9
4 pages
SDXL Diffusion Model Training - Style & Objects
No ratings yet
SDXL Diffusion Model Training - Style & Objects
49 pages
Team15 Dreamfusion
No ratings yet
Team15 Dreamfusion
40 pages
Diffusion
100% (6)
Diffusion
62 pages
Wk4 - AI Generated Images
No ratings yet
Wk4 - AI Generated Images
30 pages
Image N Vid
No ratings yet
Image N Vid
3 pages
Stable Diffusion
No ratings yet
Stable Diffusion
19 pages
Image Classification and Generation of Images
No ratings yet
Image Classification and Generation of Images
21 pages
GPU-Optimized On-Device Diffusion Models
No ratings yet
GPU-Optimized On-Device Diffusion Models
5 pages
Diffusion Model (SD 1
No ratings yet
Diffusion Model (SD 1
7 pages
Stable Diffusion Img2Img + Anything v-3.0 Tutorial (My Workflow)
100% (1)
Stable Diffusion Img2Img + Anything v-3.0 Tutorial (My Workflow)
5 pages
Zero-Shot Text-to-Image Synthesis
No ratings yet
Zero-Shot Text-to-Image Synthesis
20 pages
AI Tuts - Stable Diffusion Prompt Engineering Guide For Beginners
100% (1)
AI Tuts - Stable Diffusion Prompt Engineering Guide For Beginners
15 pages
Text To Image Survey
No ratings yet
Text To Image Survey
40 pages
ChatPDF DDPM
No ratings yet
ChatPDF DDPM
9 pages
UI/UX Prototype & Image Gen Lab
No ratings yet
UI/UX Prototype & Image Gen Lab
10 pages
Document
No ratings yet
Document
29 pages
Voldy Retard GUide
No ratings yet
Voldy Retard GUide
11 pages
Local AI Fantasy Workflow Guide
No ratings yet
Local AI Fantasy Workflow Guide
3 pages
Slides Curso Generacion Imagenes Ai
No ratings yet
Slides Curso Generacion Imagenes Ai
180 pages
Sistillthis
No ratings yet
Sistillthis
3 pages
EDiff-Text-To-Image Diffusion Models With An Ensemble of Expert Denoisers
No ratings yet
EDiff-Text-To-Image Diffusion Models With An Ensemble of Expert Denoisers
24 pages
Report Mini Project 2
No ratings yet
Report Mini Project 2
10 pages
Unit3 CNN
No ratings yet
Unit3 CNN
66 pages
Stable Diffusion Report Updated
No ratings yet
Stable Diffusion Report Updated
19 pages
StableDiffusion Presentation
No ratings yet
StableDiffusion Presentation
27 pages
Rombach High-Resolution Image Synthesis With Latent Diffusion Models CVPR 2022 Paper-2
No ratings yet
Rombach High-Resolution Image Synthesis With Latent Diffusion Models CVPR 2022 Paper-2
12 pages
Pneumonia Detection
No ratings yet
Pneumonia Detection
34 pages
Image Preprocessing and CNN Guide
No ratings yet
Image Preprocessing and CNN Guide
10 pages
Project Proposal
No ratings yet
Project Proposal
22 pages
Diffusion Models
No ratings yet
Diffusion Models
27 pages
S52095 - The Future of Generative AI For Content Creation - 1679442103451001Fz15
No ratings yet
S52095 - The Future of Generative AI For Content Creation - 1679442103451001Fz15
40 pages
Dis10 Sol
No ratings yet
Dis10 Sol
11 pages
Difusion Estable
No ratings yet
Difusion Estable
43 pages
Stable Diffusion Tutorial Creating AI Art
No ratings yet
Stable Diffusion Tutorial Creating AI Art
3 pages
ComfyUI Community Manual
No ratings yet
ComfyUI Community Manual
3 pages
Img Proc
No ratings yet
Img Proc
2 pages
Extracting Prompts from AI Images
No ratings yet
Extracting Prompts from AI Images
1 page
Dream Studio Guide for Beginners
100% (1)
Dream Studio Guide for Beginners
17 pages
Paper 10
No ratings yet
Paper 10
8 pages
Taming Diffusion Model For Exemplar-Based Image Translation
No ratings yet
Taming Diffusion Model For Exemplar-Based Image Translation
13 pages
Deepfakeguide
No ratings yet
Deepfakeguide
3 pages
Synthetic Data Generation For CV
No ratings yet
Synthetic Data Generation For CV
15 pages
(Free Guide) Consistent Character Creator v01
No ratings yet
(Free Guide) Consistent Character Creator v01
4 pages
Generative Models for Beginners
No ratings yet
Generative Models for Beginners
17 pages
Deep Learning for Art and Game Textures
No ratings yet
Deep Learning for Art and Game Textures
41 pages
Sketch-to-Photo Synthesis Guide
No ratings yet
Sketch-to-Photo Synthesis Guide
19 pages
DIFFBLENDER Scalable and Composable
No ratings yet
DIFFBLENDER Scalable and Composable
18 pages
FreeCAD Book
100% (10)
FreeCAD Book
103 pages
EXPLORE ESP32 MICROPYTHON - Python Coding, Arduino Coding, Raspberry Pi, ESP8266, IoT Projects, Android Application Projects
100% (13)
EXPLORE ESP32 MICROPYTHON - Python Coding, Arduino Coding, Raspberry Pi, ESP8266, IoT Projects, Android Application Projects
347 pages
Smith D. - Arduino For Complete Idiots - 2017
91% (22)
Smith D. - Arduino For Complete Idiots - 2017
175 pages
Handbook of Arduino - 100+ Arduino Projects
100% (13)
Handbook of Arduino - 100+ Arduino Projects
608 pages
Fusion 360 Exercises
90% (10)
Fusion 360 Exercises
38 pages
Motorcontrol Ebook
100% (13)
Motorcontrol Ebook
262 pages
The Ultimate Guide To Arduino Library
92% (12)
The Ultimate Guide To Arduino Library
76 pages
ESP32 Development Using The Arduino IDE
100% (10)
ESP32 Development Using The Arduino IDE
162 pages
100 CAD Exercises
100% (14)
100 CAD Exercises
109 pages
MIT App Inventor Projects 50+ Apps With Raspberry Pi, ESP32 and Arduino - Compressed
100% (11)
MIT App Inventor Projects 50+ Apps With Raspberry Pi, ESP32 and Arduino - Compressed
334 pages
Free CAD
90% (10)
Free CAD
196 pages
A Freecad Manual
91% (11)
A Freecad Manual
181 pages
Arduino Programming in 24 Hours Richard Blum Softarchive Net PDF
92% (26)
Arduino Programming in 24 Hours Richard Blum Softarchive Net PDF
605 pages
Arduino Tutorial
97% (39)
Arduino Tutorial
229 pages
Handbook of ESP32 Using The Arduino IDE, 2024
82% (11)
Handbook of ESP32 Using The Arduino IDE, 2024
249 pages
The Python Bible
97% (33)
The Python Bible
506 pages
Arduino For Beginners EBOOK
100% (12)
Arduino For Beginners EBOOK
134 pages
Programming With Node Red Ebook
100% (6)
Programming With Node Red Ebook
325 pages
3D Printing Project
100% (10)
3D Printing Project
98 pages
Electronica Conceptos
100% (3)
Electronica Conceptos
318 pages
TinkerCAD For Everyone
100% (1)
TinkerCAD For Everyone
15 pages
Arduino Programming Step by Step Guide To Mastering Arduino Hardware and Software
100% (9)
Arduino Programming Step by Step Guide To Mastering Arduino Hardware and Software
109 pages
FreeCAD - Learn Easily & Quickly (PDFDrive)
86% (7)
FreeCAD - Learn Easily & Quickly (PDFDrive)
254 pages
Raspberry Pi Pico Essentials by Dogan Ibrahim
100% (12)
Raspberry Pi Pico Essentials by Dogan Ibrahim
250 pages
ESP32-C3 Wireless Adventure
100% (10)
ESP32-C3 Wireless Adventure
423 pages
ARDUINO PROGRAMMING WITH MIT APP INVENTOR - Learn With Tutorial Guide
100% (4)
ARDUINO PROGRAMMING WITH MIT APP INVENTOR - Learn With Tutorial Guide
72 pages
Arduino Measurement Projects For Beginners
100% (7)
Arduino Measurement Projects For Beginners
175 pages
Arduino For Beginners - A Step by Step Ultimate Guide To Learn Arduino Programming PDF
100% (5)
Arduino For Beginners - A Step by Step Ultimate Guide To Learn Arduino Programming PDF
195 pages
Home Robotics - Maker-Inspired Projects For Building Your Own Robots (PDFDrive) PDF
100% (4)
Home Robotics - Maker-Inspired Projects For Building Your Own Robots (PDFDrive) PDF
162 pages
Raspberry PI
100% (5)
Raspberry PI
26 pages
WWW Macanforum Com Threads Diy Diayor PCM 3 1 Hard Drive HDD To SSD Replacement and Backup 180790
No ratings yet
WWW Macanforum Com Threads Diy Diayor PCM 3 1 Hard Drive HDD To SSD Replacement and Backup 180790
20 pages
Aperture (Mollusc) : in Gastropods
No ratings yet
Aperture (Mollusc) : in Gastropods
7 pages
Handkerchief Edging Patterns
No ratings yet
Handkerchief Edging Patterns
3 pages
General Methods of Teaching 4
No ratings yet
General Methods of Teaching 4
12 pages
ReviewModule24 Steel2
No ratings yet
ReviewModule24 Steel2
3 pages
Mathematical Model For Abrasive Jet Machining Ajm For Brittle Material
0% (1)
Mathematical Model For Abrasive Jet Machining Ajm For Brittle Material
32 pages
3M DBI X300 Exofit Brochure
No ratings yet
3M DBI X300 Exofit Brochure
2 pages
Tobacco Export Data Country-Wise 2024-25 Uptodate
No ratings yet
Tobacco Export Data Country-Wise 2024-25 Uptodate
1 page
Harrison
No ratings yet
Harrison
18 pages
Matter in Our Surroundings
No ratings yet
Matter in Our Surroundings
9 pages
Problem On Steam Nozzle 04
No ratings yet
Problem On Steam Nozzle 04
24 pages
GR 5 Xhosa Term 1 2015
No ratings yet
GR 5 Xhosa Term 1 2015
2 pages
Lexmark Supplies Guide 2003
No ratings yet
Lexmark Supplies Guide 2003
22 pages
A Short History of Phonology
No ratings yet
A Short History of Phonology
7 pages
GE4 Module 1
No ratings yet
GE4 Module 1
14 pages
ADPL
No ratings yet
ADPL
6 pages
Pharmacology of Local Anesthetics Rev
No ratings yet
Pharmacology of Local Anesthetics Rev
33 pages
Medical Immunology (7th Ed - 2020)
100% (2)
Medical Immunology (7th Ed - 2020)
474 pages
Asia Pacific Auto Market Leader
No ratings yet
Asia Pacific Auto Market Leader
8 pages
Fendt 920 Spare Parts Catalog
100% (2)
Fendt 920 Spare Parts Catalog
707 pages
11kV Cable Pre-Commissioning Guide
85% (13)
11kV Cable Pre-Commissioning Guide
47 pages
Laboratory Diagnosis of Typhoid and Liver Disease
No ratings yet
Laboratory Diagnosis of Typhoid and Liver Disease
23 pages
Yellowjackets 1x06 - Saints
No ratings yet
Yellowjackets 1x06 - Saints
54 pages
(LAWENG3) Final Paper - Venida, Josue Marie N.
No ratings yet
(LAWENG3) Final Paper - Venida, Josue Marie N.
11 pages
Pds Sikament NN
No ratings yet
Pds Sikament NN
3 pages
Year Round Fire Safety Awerness Program
100% (4)
Year Round Fire Safety Awerness Program
2 pages
What Is The Lecture Mainly About
No ratings yet
What Is The Lecture Mainly About
4 pages
Despiece de Acople Tanaka TPH-200-210 - BD
No ratings yet
Despiece de Acople Tanaka TPH-200-210 - BD
10 pages
Iaru Region 1 HF Band Plan: As Revised at The General Conference Sun City 2011
No ratings yet
Iaru Region 1 HF Band Plan: As Revised at The General Conference Sun City 2011
4 pages
Complex Geometry & Foliations
No ratings yet
Complex Geometry & Foliations
146 pages

ComfyUI Text To Image Workflow

Uploaded by

ComfyUI Text To Image Workflow

Uploaded by

ComfyUI Text to Image Workflow

In this document, we will:

Complete a text-to-image workflow

We'll start by running a text-to-image workflow, followed by explanations of related concepts.

About Text to Image

The text-to-image process requires the following elements:

Artist: The image generation model

ComfyUI Text to Image Workflow Example Guide

2. Loading the Text to Image Workflow

3. Loading the Model and Generating Your First Image

Use English whenever possible

masterpiece, best quality, 4k

low quality, blurry, deformed hands, extra fingers

(ultra realistic portrait:1.3), (elegant woman in crimson silk dress:1.2),

(deformed, cartoon, anime, doll, plastic skin, overexposed, blurry, extra

3. Specific Artist Style

blurry, low detail, cartoonish, unrealistic anatomy, out of focus, cluttered,

Text to Image Working Principles

Error parsing Mermaid diagram!

Cannot read properties of null (reading 'getBoundingClientRect')

We need to understand two concepts:

Denoising Diffusion Probabilistic Models (DDPM)

ComfyUI Text to Image Workflow Node Explanation

B. Empty Latent Image Node

C. CLIP Text Encoder Node

Error parsing Mermaid diagram!

Cannot read properties of null (reading 'getBoundingClientRect')

Here's an explanation of the KSampler node parameters:

E. VAE Decode Node

F. Save Image Node

Introduction to SD1.5 Model

Advantages and Limitations

Detail Handling: Hands/complex lighting prone to distortion

You might also like