EMA: fix `state_dict()` and `load_state_dict()` & add `cur_decay_value` #2146

chenguolin · 2023-01-28T04:49:48Z

fix the saved value for min_decay in EMA self.state_dict().
add an interface self.cur_decay_value to track the current value for decay.
(as self.decay is a constant value meaning "max decay value")
track the current EMA decay value by self.cur_decay_value in "unconditional image generation" examples, instead of self.decay.

HuggingFaceDocBuilderDev · 2023-01-28T04:54:14Z

The documentation is not available anymore as the PR was closed or merged.

'float' object (`state_dict["power"]`) has no attribute 'get'.

chenguolin · 2023-01-29T05:09:48Z

fix a bug in EMA load_static_dict(): "AttributeError: 'float' object has no attribute 'get'"

patrickvonplaten · 2023-01-31T09:18:55Z

This looks good to me! @chenguolin could you maybe also check if the EMAModel class is correctly used in: https://github.com/huggingface/diffusers/blob/main/examples/unconditional_image_generation/train_unconditional.py and potentially fix it? :-)

chenguolin · 2023-01-31T09:43:58Z

It looks good. I have changed logs["ema_decay"] to the new ema_model.cur_decay_value to track the current decay value at:

diffusers/examples/unconditional_image_generation/train_unconditional.py

Line 522 in 7d96b38

logs["ema_decay"] = ema_model.decay

patrickvonplaten

This LGTM, @pcuenca @patil-suraj could you take a look?

patrickvonplaten · 2023-02-03T15:53:09Z

Actually we should probably also apply the same changes to: https://github.com/huggingface/diffusers/blob/main/examples/text_to_image/train_text_to_image.py

In case you have 5min it'd be great if you could take a look at applying the same changes to train_text_to_image as wel @chenguolin :-)

cc @patil-suraj wdyt of this PR

chenguolin · 2023-02-04T07:21:35Z

Hi @patrickvonplaten, the only necessary change to example "train_unconditional.py" is logs["ema_decay"] = ema_model.decay -> logs["ema_decay"] = ema_model.cur_decay_value.

While other examples don't use ema_model.decay, maybe there is no need to change them.

patil-suraj

Thanks a lot for fixing this!

patil-suraj · 2023-02-07T12:50:37Z

@chenguolin there's a merge conflict in train_unconditional_ort.py could you please fix it? It should be good to merge after that.

pcuenca

Thank you!

chenguolin · 2023-02-07T16:55:26Z

Hi @patil-suraj, I have just deleted train_unconditional_ort.py ane merged.

williamberman

Failing test is un-related :)

…e` (huggingface#2146) * EMA: fix `state_dict()` & add `cur_decay_value` * EMA: fix a bug in `load_state_dict()` 'float' object (`state_dict["power"]`) has no attribute 'get'. * del train_unconditional_ort.py

EMA: fix state_dict() & add cur_decay_value

c3e70c3

EMA: fix a bug in load_state_dict()

e60364c

'float' object (`state_dict["power"]`) has no attribute 'get'.

chenguolin changed the title ~~EMA: fix state_dict() & add cur_decay_value~~ EMA: fix state_dict() and load_state_dict() & add cur_decay_value Jan 29, 2023

patrickvonplaten requested review from pcuenca and patil-suraj February 3, 2023 15:50

patrickvonplaten approved these changes Feb 3, 2023

View reviewed changes

patrickvonplaten requested a review from williamberman February 7, 2023 07:54

patil-suraj approved these changes Feb 7, 2023

View reviewed changes

pcuenca approved these changes Feb 7, 2023

View reviewed changes

chenguolin added 3 commits February 7, 2023 16:48

del train_unconditional_ort.py

83c5093

Merge branch 'main' of https://github.com/chenguolin/diffusers into main

81bdae0

Merge branch 'huggingface:main' into main

14d6b79

williamberman approved these changes Feb 7, 2023

View reviewed changes

patil-suraj merged commit 9d0d070 into huggingface:main Feb 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EMA: fix `state_dict()` and `load_state_dict()` & add `cur_decay_value` #2146

EMA: fix `state_dict()` and `load_state_dict()` & add `cur_decay_value` #2146

Uh oh!

chenguolin commented Jan 28, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jan 28, 2023 •

edited

Loading

Uh oh!

chenguolin commented Jan 29, 2023

Uh oh!

patrickvonplaten commented Jan 31, 2023

Uh oh!

chenguolin commented Jan 31, 2023

Uh oh!

patrickvonplaten left a comment

Uh oh!

patrickvonplaten commented Feb 3, 2023

Uh oh!

chenguolin commented Feb 4, 2023

Uh oh!

patil-suraj left a comment

Uh oh!

patil-suraj commented Feb 7, 2023

Uh oh!

pcuenca left a comment

Uh oh!

chenguolin commented Feb 7, 2023

Uh oh!

williamberman left a comment

Uh oh!

Uh oh!

EMA: fix state_dict() and load_state_dict() & add cur_decay_value #2146

EMA: fix state_dict() and load_state_dict() & add cur_decay_value #2146

Uh oh!

Conversation

chenguolin commented Jan 28, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jan 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenguolin commented Jan 29, 2023

Uh oh!

patrickvonplaten commented Jan 31, 2023

Uh oh!

chenguolin commented Jan 31, 2023

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten commented Feb 3, 2023

Uh oh!

chenguolin commented Feb 4, 2023

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

patil-suraj commented Feb 7, 2023

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

chenguolin commented Feb 7, 2023

Uh oh!

williamberman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

EMA: fix `state_dict()` and `load_state_dict()` & add `cur_decay_value` #2146

EMA: fix `state_dict()` and `load_state_dict()` & add `cur_decay_value` #2146

HuggingFaceDocBuilderDev commented Jan 28, 2023 •

edited

Loading