mps cross-attention hack: don't crash on fp16 #2258

pcuenca · 2023-02-06T10:22:09Z

Found while testing #1791.

The pipeline doesn't crash, but it still doesn't work.

HuggingFaceDocBuilderDev · 2023-02-06T10:28:06Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten · 2023-02-07T07:53:56Z

src/diffusers/models/cross_attention.py

@@ -234,7 +234,7 @@ def prepare_attention_mask(self, attention_mask, target_length):
                # HACK: MPS: Does not support padding by greater than dimension of input tensor.
                # Instead, we can manually construct the padding tensor.
                padding_shape = (attention_mask.shape[0], attention_mask.shape[1], target_length)
-                padding = torch.zeros(padding_shape, device=attention_mask.device)
+                padding = torch.zeros(padding_shape).to(attention_mask)


Suggested change

padding = torch.zeros(padding_shape).to(attention_mask)

padding = torch.zeros(padding_shape).to(attention_mask.device)

no?

I recently learned that doing it that way changes both the device and the dtype at once. But it may not be clear, so I'll change it.

Ah interesting! Good to keep in mind :-)

In general, it's better IMO to stick to our existing API usage, e.g. we always use f-strings, ... so even if it requires more code IMO we should stick to what we have currently and change it only if we change the whole codebase so that our code design stays consistent.

Also for such cases we should also be loosely aware of whether it's a very recent feature of PyTorch or already there since ~2 years. If it's there since ~2 years then happy to ad

It's been there for several versions, I verified :) But I do agree that it's nowhere in our codebase, so it's better to err on the explicit side.

patrickvonplaten

Thanks!

williamberman · 2023-02-07T19:04:57Z

Ah, thanks @pcuenca !

* mps cross-attention hack: don't crash on fp16 * Make conversion explicit.

mps cross-attention hack: don't crash on fp16

97958bc

pcuenca requested a review from williamberman February 6, 2023 10:22

pcuenca mentioned this pull request Feb 6, 2023

UnCLIPPipeline not supported on Mac/MPS #1791

Closed

patrickvonplaten reviewed Feb 7, 2023

View reviewed changes

Make conversion explicit.

9a5d358

patrickvonplaten approved these changes Feb 7, 2023

View reviewed changes

patrickvonplaten merged commit e619db2 into main Feb 7, 2023

keturn mentioned this pull request Feb 7, 2023

[bug]: diffusers - FP16 doesn't work for MPS devices, causes low level driver/library crash invoke-ai/InvokeAI#2336

Closed

1 task

pcuenca deleted the mps-hack-fix branch February 7, 2023 20:34

yiyixuxu pushed a commit to evinpinar/diffusers-attend-and-excite-pipeline that referenced this pull request Feb 16, 2023

mps cross-attention hack: don't crash on fp16 (huggingface#2258)

3d1a89f

* mps cross-attention hack: don't crash on fp16 * Make conversion explicit.

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

mps cross-attention hack: don't crash on fp16 (huggingface#2258)

7e2b47e

* mps cross-attention hack: don't crash on fp16 * Make conversion explicit.

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

mps cross-attention hack: don't crash on fp16 (huggingface#2258)

a26711a

* mps cross-attention hack: don't crash on fp16 * Make conversion explicit.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mps cross-attention hack: don't crash on fp16 #2258

mps cross-attention hack: don't crash on fp16 #2258

Uh oh!

pcuenca commented Feb 6, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Feb 6, 2023 •

edited

Loading

Uh oh!

patrickvonplaten Feb 7, 2023

Uh oh!

pcuenca Feb 7, 2023

Uh oh!

patrickvonplaten Feb 7, 2023

Uh oh!

pcuenca Feb 7, 2023

Uh oh!

patrickvonplaten left a comment

Uh oh!

williamberman commented Feb 7, 2023

Uh oh!

Uh oh!

	padding = torch.zeros(padding_shape).to(attention_mask)
	padding = torch.zeros(padding_shape).to(attention_mask.device)

mps cross-attention hack: don't crash on fp16 #2258

mps cross-attention hack: don't crash on fp16 #2258

Uh oh!

Conversation

pcuenca commented Feb 6, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Feb 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickvonplaten Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

pcuenca Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

pcuenca Feb 7, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

williamberman commented Feb 7, 2023

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 6, 2023 •

edited

Loading