Qwen2: assume tied weights if lm_head/output weights is missing #6738

jklj077 · 2024-04-18T09:55:54Z

This PR adds the proper support of Qwen2-0.5B, which uses tied word embeddings. Example config: https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat/blob/main/config.json#L21.

Previous attempt: #6578

Qwen2: assume tied weights if lm_head/output weights is missing

c7ab76e

slaren approved these changes Apr 18, 2024

View reviewed changes

ggerganov merged commit e11b2e6 into ggml-org:master Apr 18, 2024
49 of 58 checks passed

Provide feedback