V0.15버전 PR 요청드립니다. #6

howsmyanimeprofilepicture · 2023-04-24T11:54:29Z

https://github.com/huggingface/diffusers/releases/tag/v0.15.0

해당 버전에 해당하는 브랜치를 따로 만들었는데(v0.15),

main으로 머지해야 할 거 같습니다.

…ace#2732) The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

* initial TokenEncoder and ContinuousEncoder * initial modules * added ContinuousContextTransformer * fix copy paste error * use numpy for get_sequence_length * initial terminal relative positional encodings * fix weights keys * fix assert * cross attend style: concat encodings * make style * concat once * fix formatting * Initial SpectrogramPipeline * fix input_tokens * make style * added mel output * ignore weights for config * move mel to numpy * import pipeline * fix class names and import * moved models to models folder * import ContinuousContextTransformer and SpectrogramDiffusionPipeline * initial spec diffusion converstion script * renamed config to t5config * added weight loading * use arguments instead of t5config * broadcast noise time to batch dim * fix call * added scale_to_features * fix weights * transpose laynorm weight * scale is a vector * scale the query outputs * added comment * undo scaling * undo depth_scaling * inital get_extended_attention_mask * attention_mask is none in self-attention * cleanup * manually invert attention * nn.linear need bias=False * added T5LayerFFCond * remove to fix conflict * make style and dummy * remove unsed variables * remove predict_epsilon * Move accelerate to a soft-dependency (huggingface#1134) * finish * finish * Update src/diffusers/modeling_utils.py * Update src/diffusers/pipeline_utils.py Co-authored-by: Anton Lozhkov <[email protected]> * more fixes * fix Co-authored-by: Anton Lozhkov <[email protected]> * fix order * added initial midi to note token data pipeline * added int to int tokenizer * remove duplicate * added logic for segments * add melgan to pipeline * move autoregressive gen into pipeline * added note_representation_processor_chain * fix dtypes * remove immutabledict req * initial doc * use np.where * require note_seq * fix typo * update dependency * added note-seq to test * added is_note_seq_available * fix import * added toc * added example usage * undo for now * moved docs * fix merge * fix imports * predict first segment * avoid un-needed copy to and from cpu * make style * Copyright * fix style * add test and fix inference steps * remove bogus files * reorder models * up * remove transformers dependency * make work with diffusers cross attention * clean more * remove @ * improve further * up * uP * Apply suggestions from code review * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py * loop over all tokens * make style * Added a section on the model * fix formatting * grammer * formatting * make fix-copies * Update src/diffusers/pipelines/__init__.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <[email protected]> * added callback ad optional ionnx * do not squeeze batch dim * clean up more * upload * convert jax to nnumpy * make style * fix warning * make fix-copies * fix warning * add initial fast tests * add initial pipeline_params * eval mode due to dropout * skip batch tests as pipeline runs on a single file * make style * fix relative path * fix doc tests * Update src/diffusers/models/t5_film_transformer.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/models/t5_film_transformer.py Co-authored-by: Patrick von Platen <[email protected]> * Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx Co-authored-by: Patrick von Platen <[email protected]> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <[email protected]> * add MidiProcessor * format * fix org * Apply suggestions from code review * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py * make style * pin protobuf to <4 * fix formatting * white space * tensorboard needs protobuf --------- Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Anton Lozhkov <[email protected]>

…line (huggingface#2779) [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines Co-authored-by: njindal <[email protected]> Co-authored-by: Patrick von Platen <[email protected]>

* small fixes to the text to video doc. * add: Spaces link. * add: warning on research-only model.

* Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * format

* Skip mps in text-to-video tests. * style * Skip UNet3D mps tests.

* add contronet flax --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

* add colab notebook and spaces * fix image link

* Add AudioLDM * up * add vocoder * start unet * unconditional unet * clap, vocoder and vae * clean-up: conversion scripts * fix: conversion script token_type_ids * clean-up: pipeline docstring * tests: from SD * clean-up: cpu offload vocoder instead of safety checker * feat: adapt tests to audioldm * feat: add docs * clean-up: amend pipeline docstrings * clean-up: make style * clean-up: make fix-copies * fix: add doc path to toctree * clean-up: args for conversion script * clean-up: paths to checkpoints * fix: use conditional unet * clean-up: make style * fix: type hints for UNet * clean-up: docstring for UNet * clean-up: make style * clean-up: remove duplicate in docstring * clean-up: make style * clean-up: make fix-copies * clean-up: move imports to start in code snippet * fix: pass cross_attention_dim as a list/tuple to unet * clean-up: make fix-copies * fix: update checkpoint path * fix: unet cross_attention_dim in tests * film embeddings -> class embeddings * Apply suggestions from code review Co-authored-by: Will Berman <[email protected]> * fix: unet film embed to use existing args * fix: unet tests to use existing args * fix: make style * fix: transformers import and version in init * clean-up: make style * Revert "clean-up: make style" This reverts commit 5d6d1f8. * clean-up: make style * clean-up: use pipeline tester mixin tests where poss * clean-up: skip attn slicing test * fix: add torch dtype to docs * fix: remove conversion script out of src * fix: remove .detach from 1d waveform * fix: reduce default num inf steps * fix: swap height/width -> audio_length_in_s * clean-up: make style * fix: remove nightly tests * fix: imports in conversion script * clean-up: slim-down to two slow tests * clean-up: slim-down fast tests * fix: batch consistent tests * clean-up: make style * clean-up: remove vae slicing fast test * clean-up: propagate changes to doc * fix: increase test tol to 1e-2 * clean-up: finish docs * clean-up: make style * feat: vocoder / VAE compatibility check * feat: possibly expand / cut audio waveform * fix: pipeline call signature test * fix: slow tests output len * clean-up: make style * make style --------- Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: William Berman <[email protected]>

* TIME first commit * styling. * styling 2. * fixes; tests * apply styling and doc fix. * remove sups. * fixes * remove temp file * move augmentations to const * added doc entry * code quality * customize augmentations * quality * quality --------- Co-authored-by: Sayak Paul <[email protected]>

* Relax DiT test * relax 2 more tests * fix style * skip test on mac due to older protobuf

* update import onnxruntime package, enable onnxruntime-rocm and onnxruntime-training * add ort_nightly_gpu

* up * fix more 7 * up * finish

…gingface#2815) * update docs to reflect the updated ckpts. * update: point about prompt. * Apply suggestions from code review Co-authored-by: Patrick von Platen <[email protected]> * emove image resizing. * Apply suggestions from code review * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <[email protected]>

* comment update * comment update

…ce#2839)

* Apply same ruff settings as in transformers See https://github.com/huggingface/transformers/blob/main/pyproject.toml Co-authored-by: Aaron Gokaslan <[email protected]> * Apply new style rules * Style Co-authored-by: Aaron Gokaslan <[email protected]> * style * remove list, ruff wouldn't auto fix. --------- Co-authored-by: Aaron Gokaslan <[email protected]>

…mbeddings (huggingface#2845)

) * Helper function to disable custom attention processors. * Restore code deleted by mistake. * Format * Fix modeling_text_unet copy.

…uggingface#2804) * add: better warning messages when handling multiple conditioning. * fix: handling of controlnet_conditioning_scale

* add train_controlnet_flax --------- Co-authored-by: Patrick von Platen <[email protected]>

* Workaround for saving dynamo-wrapped models. * Accept suggestion from code review Co-authored-by: Patrick von Platen <[email protected]> * Apply workaround when overriding pipeline components. * Ensure the correct config.json is saved to disk. Instead of the dynamo class. * Save correct module (not compiled one) * Add test * style * fix docstrings * Go back to using string comparisons. PyTorch CPU does not have _dynamo. * Simple test for save_pretrained of compiled models. * Helper function to test whether module is compiled. --------- Co-authored-by: Patrick von Platen <[email protected]>

Improve init

…odel from checkpoint (huggingface#2768) * Allow user to disable SafetyChecker and enable dtypes if loading models from .ckpt or .safetensors * Fix Import sorting (Ruff error) * Get rid of the dtype convert method as it was implemented all along * Fix the docstring * Fix ruff formatting --------- Co-authored-by: Patrick von Platen <[email protected]>

When doing generation manually and using guidance_scale as a static argument.

* Fix invocation of some slow tests. We use __call__ rather than pmapping the generation function ourselves because the number of static arguments is different now. * style

@patrickvonplaten

* add only cross attention to simple attention blocks * add test for only_cross_attention re: @patrickvonplaten * mid_block_only_cross_attention better default allow mid_block_only_cross_attention to default to `only_cross_attention` when `only_cross_attention` is given as a single boolean

* ⚙️chore(train_controlnet) fix typo in logger message * ⚙️chore(models) refactor modules order; make them the same as calling order When printing the BasicTransformerBlock to stdout, I think it's crucial that the attributes order are shown in proper order. And also previously the "3. Feed Forward" comment was not making sense. It should have been close to self.ff but it's instead next to self.norm3 * correct many tests * remove bogus file * make style * correct more tests * finish tests * fix one more * make style * make unclip deterministic * ⚙️chore(models/attention) reorganize comments in BasicTransformerBlock class --------- Co-authored-by: Patrick von Platen <[email protected]>

* unet time embedding activation function * typo act_fn -> time_embedding_act_fn * flatten conditional

@patrickvonplaten

add group norm type to attention processor cross attention norm This lets the cross attention norm use both a group norm block and a layer norm block. The group norm operates along the channels dimension and requires input shape (batch size, channels, *) where as the layer norm with a single `normalized_shape` dimension only operates over the least significant dimension i.e. (*, channels). The channels we want to normalize are the hidden dimension of the encoder hidden states. By convention, the encoder hidden states are always passed as (batch size, sequence length, hidden states). This means the layer norm can operate on the tensor without modification, but the group norm requires flipping the last two dimensions to operate on (batch size, hidden states, sequence length). All existing attention processors will have the same logic and we can consolidate it in a helper function `prepare_encoder_hidden_states` prepare_encoder_hidden_states -> norm_encoder_hidden_states re: @patrickvonplaten move norm_cross defined check to outside norm_encoder_hidden_states add missing attn.norm_cross check

add AttnAddedKVProcessor2_0 block

…uggingface#2994) * fix: norm group test for UNet3D. * fix: type-casting issue in controlnet training.

* add: first draft for a better LoRA enabler. * make fix-copies. * feat: backward compatibility. * add: entry to the docs. * add: tests. * fix: docs. * fix: norm group test for UNet3D. * feat: add support for flat dicts. * add depcrcation message instead of warning.

* fix slow tsets * make style

* fix: norm group test for UNet3D. * fix: unet rejig. * fix: unwrapping when running validation inputs. * unwrapping the unet too. * fix: device. * better unwrapping. * unwrapping before ema. * unwrapping.

* Update index.mdx * Edit docs & add HF space link * Only change equation numbers in comments

* add use_memory_efficient params placeholder * test * add memory efficient attention jax * add memory efficient attention jax * newline * forgot dot * Rename use_memory_efficient * Keep dtype last. * Actually use key_chunk_size * Rename symbol * Apply style * Rename use_memory_efficient * Keep dtype last * Pass `use_memory_efficient_attention` in `from_pretrained` * Move JAX memory efficient attention to attention_flax. * Simple test. * style --------- Co-authored-by: muhammad_hanif <[email protected]> Co-authored-by: MuhHanif <[email protected]>

* inital commit for lora test cases * help a bit with lora for 3d * fixed lora tests * replaced redundant code --------- Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* fix pipeline __setattr__ * add test --------- Co-authored-by: Patrick von Platen <[email protected]>

… pipelines (huggingface#2597) * add support for prompt embeds to SD ONNX pipeline * fix up the pipeline copies * add prompt embeds param to other ONNX pipelines * fix up prompt embeds param for SD upscaling ONNX pipeline * add missing type annotations to ONNX pipes

* [2737]: Add Karras DPMSolverMultistepScheduler * [2737]: Add Karras DPMSolverMultistepScheduler * Add test * Apply suggestions from code review Co-authored-by: Patrick von Platen <[email protected]> * fix: repo consistency. * remove Copied from statement from the set_timestep method. * fix: test * Empty commit. Co-authored-by: njindal <[email protected]> --------- Co-authored-by: njindal <[email protected]> Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* Finish docs textual inversion * Apply suggestions from code review Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Pedro Cuenca <[email protected]> --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Pedro Cuenca <[email protected]>

* fix: norm group test for UNet3D. * refactor text-to-video zero docs.

Update Flax TPU tests. Co-authored-by: Patrick von Platen <[email protected]>

* Fix a bug of pano when not doing CFG * enhance code quality * apply formatting. --------- Co-authored-by: Sayak Paul <[email protected]>

* fix progress bar issue in pipeline_text_to_video_zero.py. Copy scheduler after first backward * fix tensor loading in test_text_to_video_zero.py * make style && make quality

Mishig and others added 30 commits March 23, 2023 13:42

[doc wip] literalinclude (huggingface#2718)

8e35ef0

Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (huggingf…

14e3a28

…ace#2732) The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipe…

055c90f

…line (huggingface#2779) [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines Co-authored-by: njindal <[email protected]> Co-authored-by: Patrick von Platen <[email protected]>

[Docs] small fixes to the text to video doc. (huggingface#2787)

0d7aac3

* small fixes to the text to video doc. * add: Spaces link. * add: warning on research-only model.

Update train_text_to_image_lora.py (huggingface#2767)

dc5b4e2

* Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * format

Skip mps in text-to-video tests (huggingface#2792)

aa0531f

* Skip mps in text-to-video tests. * style * Skip UNet3D mps tests.

Flax controlnet (huggingface#2727)

df91c44

* add contronet flax --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

[docs] Add Colab notebooks and Spaces (huggingface#2713)

1870fb0

* add colab notebook and spaces * fix image link

Update train_text_to_image_lora.py (huggingface#2795)

4a98d6e

Relax DiT test (huggingface#2808)

f6feb69

* Relax DiT test * relax 2 more tests * fix style * skip test on mac due to older protobuf

Update onnxruntime package candidates (huggingface#2666)

c4892f1

* update import onnxruntime package, enable onnxruntime-rocm and onnxruntime-training * add ort_nightly_gpu

[Stable UnCLIP] Finish Stable UnCLIP (huggingface#2814)

dbcb15c

* up * fix more 7 * up * finish

StableDiffusionModelEditingPipeline documentation (huggingface#2810)

9fb0217

* comment update * comment update

Update examples README.md to include the latest examples (huggingfa…

abb22b4

…ce#2839)

[Tests] Fix slow tests (huggingface#2846)

4c26cb9

Fix StableUnCLIPImg2ImgPipeline handling of explicitly passed image e…

7bc2fff

…mbeddings (huggingface#2845)

Helper function to disable custom attention processors (huggingface#2791

b10f527

) * Helper function to disable custom attention processors. * Restore code deleted by mistake. * Format * Fix modeling_text_unet copy.

improve stable unclip doc. (huggingface#2823)

fab4f3d

add: better warning messages when handling multiple conditionings. (h…

58fc824

…uggingface#2804) * add: better warning messages when handling multiple conditioning. * fix: handling of controlnet_conditioning_scale

[WIP]Flax training script for controlnet (huggingface#2818)

d4f846f

* add train_controlnet_flax --------- Co-authored-by: Patrick von Platen <[email protected]>

[Init] Make sure shape mismatches are caught early (huggingface#2847)

42d9501

Improve init

updated onnx pndm test (huggingface#2811)

c0afca2

fix KarrasVePipeline bug (huggingface#2828)

8bdf423

pcuenca and others added 23 commits April 11, 2023 23:20

Fix scheduler type mismatch (huggingface#3041)

526827c

When doing generation manually and using guidance_scale as a static argument.

Fix invocation of some slow Flax tests (huggingface#3058)

e3095c5

* Fix invocation of some slow tests. We use __call__ rather than pmapping the generation function ourselves because the number of static arguments is different now. * style

unet time embedding activation function (huggingface#3048)

2d52e81

* unet time embedding activation function * typo act_fn -> time_embedding_act_fn * flatten conditional

Attn added kv processor torch 2.0 block (huggingface#3023)

ea39cd7

add AttnAddedKVProcessor2_0 block

[Examples] Fix type-casting issue in the ControlNet training script (h…

e607a58

…uggingface#2994) * fix: norm group test for UNet3D. * fix: type-casting issue in controlnet training.

fix slow tsets (huggingface#3066)

0c72006

* fix slow tsets * make style

Fix InstructPix2Pix training in multi-GPU mode (huggingface#2978)

5a7d35e

* fix: norm group test for UNet3D. * fix: unet rejig. * fix: unwrapping when running validation inputs. * unwrapping the unet too. * fix: device. * better unwrapping. * unwrapping before ema. * unwrapping.

[Docs] update Self-Attention Guidance docs (huggingface#2952)

0df47ef

* Update index.mdx * Edit docs & add HF space link * Only change equation numbers in comments

fix pipeline __setattr__ value == None (huggingface#3063)

639f645

* fix pipeline __setattr__ * add test --------- Co-authored-by: Patrick von Platen <[email protected]>

[Docs] refactor text-to-video zero (huggingface#3049)

fa736e3

* fix: norm group test for UNet3D. * refactor text-to-video zero docs.

Update Flax TPU tests (huggingface#3069)

caa5884

Update Flax TPU tests. Co-authored-by: Patrick von Platen <[email protected]>

Fix a bug of pano when not doing CFG (huggingface#3030)

a439343

* Fix a bug of pano when not doing CFG * enhance code quality * apply formatting. --------- Co-authored-by: Sayak Paul <[email protected]>

Text2video zero refinements (huggingface#3070)

b9b8916

* fix progress bar issue in pipeline_text_to_video_zero.py. Copy scheduler after first backward * fix tensor loading in test_text_to_video_zero.py * make style && make quality

Release: v0.15.0

e753454

howsmyanimeprofilepicture requested review from seriousran and tjdtnsu April 24, 2023 11:54

howsmyanimeprofilepicture closed this Apr 24, 2023

howsmyanimeprofilepicture reopened this Apr 24, 2023

tjdtnsu approved these changes Apr 26, 2023

View reviewed changes

tjdtnsu merged commit 1f0530f into main Apr 27, 2023

tjdtnsu deleted the v0.15 branch May 23, 2023 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

V0.15버전 PR 요청드립니다. #6

V0.15버전 PR 요청드립니다. #6

Uh oh!

howsmyanimeprofilepicture commented Apr 24, 2023

Uh oh!

Uh oh!

V0.15버전 PR 요청드립니다. #6

V0.15버전 PR 요청드립니다. #6

Uh oh!

Conversation

howsmyanimeprofilepicture commented Apr 24, 2023

Uh oh!

Uh oh!