Update transformer_flux.py. Change float64 to float32 #9133

jsmidt · 2024-08-09T01:24:35Z

What does this PR do?

dtype=torch.float64 is overkill, and float64 is not defined for certain devices such as Apple Silicon mps. This change enables the flux pipeline to be run on certain devices such as Apple Silicon mps without negative consequences.

dtype=torch.float64 is overkill, and float64 is not defined for certain devices such as Apple Silicon mps.

Update transformer_flux.py. Change float64 to float32

bghira · 2024-08-09T02:44:47Z

just a note that macos 14 and pytorch 2.4 or greater still can't do it. but i think macos 15 can, or, pytorch 2.3.1 with macos 14. but then training uses a lot more vram.

edit: no, macos 15 still broken - don't upgrade to fix it.

HuggingFaceDocBuilderDev · 2024-08-12T04:54:27Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

hvaara · 2024-08-13T22:32:09Z

This fix should make the model runnable on MPS. Tested with https://gist.github.com/hvaara/bc8754b2aab6ef07a95c82c5e436f6d3. Running macOS 14.6. transformers and diffusers from the main branch.

@bghira Does it work for you with the patch from this PR and the code form my gist? Need ~45 GB VRAM. If not, what error are you seeing?

bghira · 2024-08-13T23:33:35Z

oh roy 🙉 we meet again. i see no error, it's just that the image is all noise

hvaara · 2024-08-14T00:51:08Z

Haha! Indeed we do 😂

Are you using latest diffusers and transformers? Are you sure your weights are not corrupted? With the code change by OP and the script in my gist I get great images using MPS as the accelerator.

I actually came here to contrib the exact same change as OP 😅

bghira · 2024-08-14T05:02:40Z

all day every day on git branches.. latest and hot off the wire. pytorch nightly, latest diffusers/transformers, macos 15 beta 4.

hvaara · 2024-08-14T22:54:10Z

Issues @bghira experienced has been identified as a bug in PyTorch. I will open a bug and propose a fix upstream.

hvaara · 2024-08-15T17:18:52Z

Follow pytorch/pytorch#133520 for updates on the noisy output image issue.

sayakpaul · 2024-08-16T01:57:55Z

Does this PR reliably solve the problem?

Cc: @DN6 @pcuenca

hvaara · 2024-08-16T02:17:54Z

Yes. The only thing I would consider is the precision reduction. This has been solved in the past by predicating on the device and only reducing precision for MPS.

Prior art: #1169 #6365 #942

sayakpaul · 2024-08-16T02:21:42Z

Thanks!

This has been solved in the past by predicating on the device and only reducing precision for MPS.

I would advocate for this myself and also perhaps logging a warning that we're reducing the precision here and results may be unexpected and refer to this PR. WDYT?

hvaara · 2024-08-16T03:46:19Z

Did some testing. I don't know how much of an impact it has, but overall I think the images generated with float64 look the best.

A cat

Prompt

A cat holding a sign that says hello world

float16

float32

float64

A landscape

Prompt

Highly detailed fantasy landscape at golden hour in an ancient forest. Towering trees with glowing runes are illuminated by warm light, casting shadows on a mossy floor dotted with glowing flowers.

A clear stream winds through the foreground, reflecting the sky's hues and surrounded by glowing mushrooms. Fairy-like creatures with translucent wings flutter above, leaving shimmering trails.

In the background

(CLIP cut me off)

float16

float32

float64

DN6 · 2024-08-16T04:05:51Z

Yeah agree with @hvaara. @jsmidt could we just add a check for MPS device and then downcast to FP32?

yiyixuxu · 2024-08-17T08:18:30Z

cc @asomoza here
can you take a look (or possibly run more test) to see if there is any difference

my eye couldn't spot any quality difference between the float64 output and float32 outputs

context is I'm refactoring flux to use get_1d_rotary_pos_embed here #9074, which use float32 - I think if float64 indeed generate better results, we might want to apply that for all models use rotary embedding (and of course downcast for mps);

bghira · 2024-08-17T12:20:08Z

some models do better with fp16 rope embeds. a transformer config option maybe?

asomoza · 2024-08-19T02:12:05Z

can you take a look (or possibly run more test) to see if there is any difference

@yiyixuxu, I did some tests with flux dev and I also don't see a really big difference but if I have to choose one, I also think that float64 is better if I really look into some tiny details.

* refactor rotary embeds * adding jsmidt as co-author of this PR for #9133 --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Joseph Smidt <[email protected]>

github-actions · 2024-09-14T15:03:19Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

beaugunderson · 2024-09-14T15:07:26Z

did anything remove the need for this on mps?

hvaara · 2024-09-14T16:43:37Z

Yes, #9074 solved it. This PR can be closed.

* refactor rotary embeds * adding jsmidt as co-author of this PR for #9133 --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Joseph Smidt <[email protected]>

jsmidt added 2 commits August 8, 2024 19:15

Update transformer_flux.py. Change float64 to float32

ed0c49b

dtype=torch.float64 is overkill, and float64 is not defined for certain devices such as Apple Silicon mps.

Merge pull request #1 from jsmidt/jsmidt-patch-1

c990396

Update transformer_flux.py. Change float64 to float32

Merge branch 'main' into main

98ae620

pytorch-bot bot mentioned this pull request Aug 14, 2024

[MPS] Incorrect result from batch norm with sliced inputs pytorch/pytorch#133520

Closed

This was referenced Aug 15, 2024

FLUX Issue | MPS framework doesn't support float64 comfyanonymous/ComfyUI#4165

Open

flux does not work on MPS devices #9047

Closed

Merge branch 'main' into main

cbfeeba

yiyixuxu pushed a commit that referenced this pull request Aug 19, 2024

adding jsmidt as co-author of this PR for #9133

72d1cf0

yiyixuxu mentioned this pull request Aug 19, 2024

Flux followup #9074

Merged

2 tasks

yiyixuxu added a commit that referenced this pull request Aug 21, 2024

Flux followup (#9074)

c291617

* refactor rotary embeds * adding jsmidt as co-author of this PR for #9133 --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Joseph Smidt <[email protected]>

github-actions bot added the stale Issues that haven't received updates label Sep 14, 2024

sayakpaul closed this Sep 14, 2024

sayakpaul added a commit that referenced this pull request Dec 23, 2024

Flux followup (#9074)

6bea130

* refactor rotary embeds * adding jsmidt as co-author of this PR for #9133 --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Joseph Smidt <[email protected]>

hvaara mentioned this pull request Jun 2, 2025

Use float32 RoPE freqs in Wan with MPS backends #11643

Merged

Update transformer_flux.py. Change float64 to float32 #9133

Update transformer_flux.py. Change float64 to float32 #9133

Uh oh!

Conversation

jsmidt commented Aug 9, 2024

What does this PR do?

Uh oh!

bghira commented Aug 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 12, 2024

Uh oh!

hvaara commented Aug 13, 2024

Uh oh!

bghira commented Aug 13, 2024

Uh oh!

hvaara commented Aug 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bghira commented Aug 14, 2024

Uh oh!

hvaara commented Aug 14, 2024

Uh oh!

hvaara commented Aug 15, 2024

Uh oh!

sayakpaul commented Aug 16, 2024

Uh oh!

hvaara commented Aug 16, 2024

Uh oh!

sayakpaul commented Aug 16, 2024

Uh oh!

hvaara commented Aug 16, 2024

A cat

Prompt

float16

float32

float64

A landscape

Prompt

float16

float32

float64

Uh oh!

DN6 commented Aug 16, 2024

Uh oh!

yiyixuxu commented Aug 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bghira commented Aug 17, 2024

Uh oh!

asomoza commented Aug 19, 2024

Uh oh!

github-actions bot commented Sep 14, 2024

Uh oh!

beaugunderson commented Sep 14, 2024

Uh oh!

hvaara commented Sep 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

bghira commented Aug 9, 2024 •

edited

Loading

hvaara commented Aug 14, 2024 •

edited

Loading

yiyixuxu commented Aug 17, 2024 •

edited

Loading