unCLIP variant #2297

williamberman · 2023-02-09T03:07:53Z

Porting weights

Weights are ported with modifications to the existing convert_from_ckpt.py script

Changes to existing models

The unet is conditioned on the unCLIP image embedding through the existing class_embeddings. The embedding type is very similar to the existing timestep embeddings type except that the class_embeddings should not be first converted to sinusoidal embeddings and should be projected from an arbitrary input dimension. I added the new class_embed_type "projection" for this.

New models

Added the NoiseAugmentor class which handles adding noise to the image embeddings. This had to be a separate class because it needs parameters to store the clip mean and std vectors. The class is configured with a noise schedule just as our scheduler classes are. I considered making this class much lighter weight by removing the noise schedule and just having it hold the clip stats. If we did this, we could use the existing scheduler class on the pipeline to noise the vector. I opted against doing this because it would require both of the noise schedules for the diffusion process and augmenting the image embedding to be configured the same. This would work in the current models we ported as they do use the same squaredcos_cap_v2 betas with 1000 timesteps. However, we can't guarantee that will always be the case, and the noise scheduling code to be duplicated is quite small. Instead added a DDPMScheduler to the pipeline to hold the noising schedule and used StableUnCLIPImageNormalizer to hold the CLIP statistics

HuggingFaceDocBuilderDev · 2023-02-09T03:12:19Z

The documentation is not available anymore as the PR was closed or merged.

tests/test_pipelines_common.py

src/diffusers/schedulers/scheduling_ddpm.py

src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py

src/diffusers/pipelines/stable_diffusion/convert_from_ckpt.py

src/diffusers/models/noise_augmentor.py

scripts/convert_original_stable_diffusion_to_diffusers.py

src/diffusers/models/unet_2d_condition.py

src/diffusers/models/noise_augmentor.py

src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py

src/diffusers/models/noise_augmentor.py

src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py

src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py

…p_img2img.py Co-authored-by: Patrick von Platen <[email protected]>

scripts/convert_original_stable_diffusion_to_diffusers.py

patrickvonplaten

Great work! Let's just add docs now and we're good to merge :-)

patrickvonplaten · 2023-02-14T16:42:56Z

Happy to merge after the docs are in :-)

patrickvonplaten · 2023-02-14T19:04:40Z

docs/source/en/api/pipelines/stable_diffusion/text2img.mdx

@@ -17,7 +17,7 @@ specific language governing permissions and limitations under the License.
 The Stable Diffusion model was created by the researchers and engineers from [CompVis](https://github.com/CompVis), [Stability AI](https://stability.ai/), [runway](https://github.com/runwayml), and [LAION](https://laion.ai/). The [`StableDiffusionPipeline`] is capable of generating photo-realistic images given any text input using Stable Diffusion.

 The original codebase can be found here: 
- *Stable Diffusion V1*: [CampVis/stable-diffusion](https://github.com/CompVis/stable-diffusion)
+- *Stable Diffusion V1*: [CompVis/stable-diffusion](https://github.com/CompVis/stable-diffusion)


docs/source/en/api/pipelines/stable_unclip.mdx

patrickvonplaten

Super! Thanks a lot for the addition

Co-authored-by: Patrick von Platen <[email protected]>

@patrickvonplaten

* pipeline_variant * Add docs for when clip_stats_path is specified * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * prepare_latents # Copied from re: @patrickvonplaten * NoiseAugmentor->ImageNormalizer * stable_unclip_prior default to None re: @patrickvonplaten * prepare_prior_extra_step_kwargs * prior denoising scale model input * {DDIM,DDPM}Scheduler -> KarrasDiffusionSchedulers re: @patrickvonplaten * docs * Update docs/source/en/api/pipelines/stable_unclip.mdx Co-authored-by: Patrick von Platen <[email protected]> --------- Co-authored-by: Patrick von Platen <[email protected]>

aluo-x · 2023-03-20T21:23:12Z

It appears that the documentation currently points to an invalid checkpoint:

fusing/stable-unclip-2-1-l

so currently if you attempt to run the model according to the documentation it will fail.

The stable unclip model is also conceptually very similar to the versatile diffusion model (they have image variations and text + image conditioned image synthesis). Perhaps a note can be put into the current stable_unclip docs so people can utilize that model until the stable-unclip weights are up?
@williamberman @patrickvonplaten

patrickvonplaten · 2023-03-21T13:49:32Z

Yeah this is a bit WIP still

@patrickvonplaten

* pipeline_variant * Add docs for when clip_stats_path is specified * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * prepare_latents # Copied from re: @patrickvonplaten * NoiseAugmentor->ImageNormalizer * stable_unclip_prior default to None re: @patrickvonplaten * prepare_prior_extra_step_kwargs * prior denoising scale model input * {DDIM,DDPM}Scheduler -> KarrasDiffusionSchedulers re: @patrickvonplaten * docs * Update docs/source/en/api/pipelines/stable_unclip.mdx Co-authored-by: Patrick von Platen <[email protected]> --------- Co-authored-by: Patrick von Platen <[email protected]>

@patrickvonplaten

* pipeline_variant * Add docs for when clip_stats_path is specified * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * prepare_latents # Copied from re: @patrickvonplaten * NoiseAugmentor->ImageNormalizer * stable_unclip_prior default to None re: @patrickvonplaten * prepare_prior_extra_step_kwargs * prior denoising scale model input * {DDIM,DDPM}Scheduler -> KarrasDiffusionSchedulers re: @patrickvonplaten * docs * Update docs/source/en/api/pipelines/stable_unclip.mdx Co-authored-by: Patrick von Platen <[email protected]> --------- Co-authored-by: Patrick von Platen <[email protected]>

@patrickvonplaten

* pipeline_variant * Add docs for when clip_stats_path is specified * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Co-authored-by: Patrick von Platen <[email protected]> * prepare_latents # Copied from re: @patrickvonplaten * NoiseAugmentor->ImageNormalizer * stable_unclip_prior default to None re: @patrickvonplaten * prepare_prior_extra_step_kwargs * prior denoising scale model input * {DDIM,DDPM}Scheduler -> KarrasDiffusionSchedulers re: @patrickvonplaten * docs * Update docs/source/en/api/pipelines/stable_unclip.mdx Co-authored-by: Patrick von Platen <[email protected]> --------- Co-authored-by: Patrick von Platen <[email protected]>

pipeline_variant

136c9f3

williamberman commented Feb 9, 2023

View reviewed changes

tests/test_pipelines_common.py Show resolved Hide resolved

williamberman commented Feb 9, 2023

View reviewed changes

src/diffusers/schedulers/scheduling_ddpm.py Show resolved Hide resolved

williamberman commented Feb 9, 2023

View reviewed changes

src/diffusers/pipelines/stable_diffusion/pipeline_stable_unclip_img2img.py Show resolved Hide resolved

williamberman commented Feb 9, 2023

View reviewed changes

src/diffusers/pipelines/stable_diffusion/convert_from_ckpt.py Outdated Show resolved Hide resolved

williamberman commented Feb 9, 2023

View reviewed changes

src/diffusers/models/noise_augmentor.py Outdated Show resolved Hide resolved

Add docs for when clip_stats_path is specified

f203e1a

williamberman commented Feb 9, 2023

View reviewed changes

scripts/convert_original_stable_diffusion_to_diffusers.py Outdated Show resolved Hide resolved

williamberman commented Feb 9, 2023

View reviewed changes

src/diffusers/models/unet_2d_condition.py Show resolved Hide resolved

williamberman requested review from pcuenca, patil-suraj, yiyixuxu and patrickvonplaten February 9, 2023 03:34