-
Notifications
You must be signed in to change notification settings - Fork 306
Open
Labels
compressed-tensorsRelates to compressed-tensorsRelates to compressed-tensorsenhancementNew feature or requestNew feature or requestgood first issueA good first issue for users wanting to contributeA good first issue for users wanting to contributegood follow-up issueA good issue for users with some familiarity of the codebaseA good issue for users with some familiarity of the codebase
Description
Is your feature request related to a problem? Please describe.
- For the packed compressor, when applying asymmetric quantization, the zero-point is also packed. There is however, no support for decompressing / unpacking the zero-point. We need to update the compressor to support zero-point unpaacking / decompression
Describe the solution you'd like
- Support zero-point decompression if running asymmetric quantization using the packed-compressor
- Quantized compressors only support weight-decompression, as seen here: https://github.com/neuralmagic/compressed-tensors/blob/c69c8bbf6ba93ecb8d86b8837e86b7abda8568af/src/compressed_tensors/compressors/quantized_compressors/base.py#L211
- Add tests to validate this case - you should be able to load this model through AutoModelForCasualLM and run generations without issue
aladerran
Metadata
Metadata
Labels
compressed-tensorsRelates to compressed-tensorsRelates to compressed-tensorsenhancementNew feature or requestNew feature or requestgood first issueA good first issue for users wanting to contributeA good first issue for users wanting to contributegood follow-up issueA good issue for users with some familiarity of the codebaseA good issue for users with some familiarity of the codebase