clip-guided stable diffusion correctness

### Describe the bug

the clip-guided pipeline uses these text-embeddings:

https://github.com/huggingface/diffusers/blob/2345481c0e21f1bd84c0d85b866b57d34506d836/examples/community/clip_guided_stable_diffusion.py#L285-L288

defined earlier as:

https://github.com/huggingface/diffusers/blob/2345481c0e21f1bd84c0d85b866b57d34506d836/examples/community/clip_guided_stable_diffusion.py#L234-L237

which I read as using the _unconditioned_ (i.e. null prompt) embeddings.

Is that the way it's supposed to work? That doesn't feel like how it's supposed to work. Like, if the normal classifier-free guidance function is turned off, it would be using the embeddings from the text prompt, not the nulls.

But this pipeline was added without any tests or samples or other reference material, so I really don't know.

### Reproduction

_No response_

### Logs

_No response_

### System Info

👀

	# perform clip guidance
	if clip_guidance_scale > 0:
	text_embeddings_for_guidance = (
	text_embeddings.chunk(2)[0] if do_classifier_free_guidance else text_embeddings

	# For classifier free guidance, we need to do two forward passes.
	# Here we concatenate the unconditional and text embeddings into a single batch
	# to avoid doing two forward passes
	text_embeddings = torch.cat([uncond_embeddings, text_embeddings])

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

clip-guided stable diffusion correctness #596

Describe the bug

Reproduction

Logs

System Info

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

clip-guided stable diffusion correctness #596

Description

Describe the bug

Reproduction

Logs

System Info

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions