Skip to content

[Bug]: Automatic Prefix Caching doesn't support Nvidia Turing Arch. #3687

@esmeetu

Description

@esmeetu

🐛 Describe the bug

triton==2.1.0 doesn't support Turing arch, and has been fixed in triton-lang/triton#2364
Upgrade to triton==2.2.0 will resolve this issue.

Perhaps this could be planned after #3442.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions