Skip to content

Conversation

rasmith
Copy link

@rasmith rasmith commented Jan 21, 2025

Support for int8 models is broken due to this change: vllm-project#11785

I added TritonScaledMMLinearKernel and it seems to work. I have the change upstream, but adding it here, since trying to fix for us ASAP.

Copy link

This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you!

@github-actions github-actions bot added the stale label Apr 22, 2025
Copy link

This pull request has been automatically closed due to inactivity. Please feel free to reopen if you intend to continue working on it. Thank you!

@github-actions github-actions bot closed this May 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant