[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline #2779

nipunjindal · 2023-03-22T08:43:28Z

Enabled DPMSolverMultistepScheduler in CLIP-guided pipeline.
Issue: #2737

Here is code to test the changes:

import torch
from diffusers import (
    DiffusionPipeline,
    DPMSolverMultistepScheduler,
    LMSDiscreteScheduler,
)
from transformers import CLIPFeatureExtractor, CLIPModel

feature_extractor = CLIPFeatureExtractor.from_pretrained(
    "laion/CLIP-ViT-B-32-laion2B-s34B-b79K"
)
clip_model = CLIPModel.from_pretrained(
    "laion/CLIP-ViT-B-32-laion2B-s34B-b79K", torch_dtype=torch.float16
)


guided_pipeline = DiffusionPipeline.from_pretrained(
    "runwayml/stable-diffusion-v1-5",
    custom_pipeline="/home/njindal/diffusers/examples/community/clip_guided_stable_diffusion.py",
    clip_model=clip_model,
    feature_extractor=feature_extractor,
    torch_dtype=torch.float16,
)
guided_pipeline.scheduler = DPMSolverMultistepScheduler.from_config(
    guided_pipeline.scheduler.config
)

guided_pipeline.enable_attention_slicing()
guided_pipeline = guided_pipeline.to("cuda")

prompt = "fantasy book cover, full moon, fantasy forest landscape, golden vector elements, fantasy magic, dark light night, intricate, elegant, sharp focus, illustration, highly detailed, digital painting, concept art, matte, art by WLOP and Artgerm and Albert Bierstadt, masterpiece"

generator = torch.Generator(device="cuda").manual_seed(0)

image = guided_pipeline(
    prompt=prompt,
    num_inference_steps=10,
    guidance_scale=7.5,
    clip_guidance_scale=100,
    num_cutouts=4,
    use_cutouts=False,
    generator=generator,
).images[0]

display(image)

Within 10 steps able to get good result with new scheduler

…lines

HuggingFaceDocBuilderDev · 2023-03-22T08:48:05Z

The documentation is not available anymore as the PR was closed or merged.

yuvalkirstain · 2023-03-22T11:04:32Z

@nipunjindal thank you! so it seems like the process for DPMSolverMultistepScheduler is the same as PNDMScheduler.

Do you happen to know what those lines are meant to do?

fac = torch.sqrt(beta_prod_t)
sample = pred_original_sample * (fac) + latents * (1 - fac)

and why does it use the cutouts + spherical loss rather than the loss from the original paper?

Also, it probably worth adding a few side-by-side image of w/o classifier guidance vs w classifier guidance (for different classifier guidance scales).

pete-finesse · 2023-03-23T03:10:54Z

What would be the process for including the ancestral samplers as well? I find these work best when using clip guidance using the sdk from stability.ai

sayakpaul · 2023-03-23T03:39:07Z

examples/community/clip_guided_stable_diffusion.py

-        if isinstance(self.scheduler, LMSDiscreteScheduler):
-            sigma = self.scheduler.sigmas[index]
-            # the model input needs to be scaled to match the continuous ODE formulation in K-LMS
-            latent_model_input = latents / ((sigma**2 + 1) ** 0.5)
-        else:
-            latent_model_input = latents


Do you mind elaborating on this change?

This looks correct! This should not be done in the pipeline :-)

sayakpaul

LGTM, well done 🔥

Could you maybe also include a few lines about this support in the README as well since it improves the results (efficiency-wise at least). Do you think that would make sense?

@patrickvonplaten could you give it a quick review too?

sayakpaul · 2023-03-23T03:45:16Z

What would be the process for including the ancestral samplers as well? I find these work best when using clip guidance using the sdk from stability.ai

The process would not differ much. You will need to consider adding an ancestral sampler in the condition (such as this). If it's not immediately compatible, then some changes might be necessary. An example:

diffusers/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_instruct_pix2pix.py

Line 344 in 92e1164

# For karras style schedulers the model does classifer free guidance using the

patrickvonplaten

Ok to merge for me

…line (huggingface#2779) [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines Co-authored-by: njindal <[email protected]> Co-authored-by: Patrick von Platen <[email protected]>

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipe…

94fb856

…lines

nipunjindal mentioned this pull request Mar 22, 2023

CLIP Guidance support for DPMSolverMultistepScheduler #2737

Closed

sayakpaul requested review from patrickvonplaten and sayakpaul March 22, 2023 08:53

sayakpaul reviewed Mar 23, 2023

View reviewed changes

sayakpaul approved these changes Mar 23, 2023

View reviewed changes

patrickvonplaten approved these changes Mar 23, 2023

View reviewed changes

Merge branch 'main' into njindal/2737

5362102

patrickvonplaten approved these changes Mar 23, 2023

View reviewed changes

patrickvonplaten merged commit 055c90f into huggingface:main Mar 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline #2779

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline #2779

Uh oh!

nipunjindal commented Mar 22, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 22, 2023 •

edited

Loading

Uh oh!

yuvalkirstain commented Mar 22, 2023 •

edited

Loading

Uh oh!

pete-finesse commented Mar 23, 2023

Uh oh!

sayakpaul Mar 23, 2023

Uh oh!

patrickvonplaten Mar 23, 2023

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul commented Mar 23, 2023

Uh oh!

patrickvonplaten left a comment

Uh oh!

Uh oh!

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline #2779

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline #2779

Uh oh!

Conversation

nipunjindal commented Mar 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Mar 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yuvalkirstain commented Mar 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pete-finesse commented Mar 23, 2023

Uh oh!

sayakpaul Mar 23, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Mar 23, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Mar 23, 2023

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nipunjindal commented Mar 22, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 22, 2023 •

edited

Loading

yuvalkirstain commented Mar 22, 2023 •

edited

Loading