Support using different attention kwargs for different types of processors in one model.

**Is your feature request related to a problem? Please describe.**
Say you want to use CustomDiffusion for some layers, and LoRA for some others.
You want to pass a {'scale': 0.5} to LoRA.
Then the code goes:
```python
TypeError: __call__() got an unexpected keyword argument 'scale'
```

Because CustomDiffusion has no idea what this parameter will do.

**Describe the solution you'd like**
1. The easiest solution is to drop excess kwargs for implemented attention processors. The downside is that silent bugs may come up.
2. Perhaps implementing a flag to indicate whether excess kwargs are expected or not. Downside of this fix is that it looks a bit too ad-hoc.
3. Add support for attn kwargs that also specify the layers or attn-proc types affected. The downside is a bit complicated design and also is a lot of work, possibly need to modify every pipeline.

**Additional context**
I see a similar issue in this comment: https://github.com/huggingface/diffusers/pull/1639#issuecomment-1420734927
But it did not get enough attention.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support using different attention kwargs for different types of processors in one model. #4152

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support using different attention kwargs for different types of processors in one model. #4152

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions