why we use torch.baddbmm to do query @ key? https://github.com/huggingface/diffusers/blob/main/src/diffusers/models/attention.py#L640