Fix YaRN ramp calculation and add --yarn-orig-ctx #2

jquesnelle · 2023-10-20T04:42:22Z

This fixes a subtle bug in the YaRN implementation. When calculating the linear ramp, we're attempting to replicate this code:

if min == max:
        max += 0.001  # Prevent singularity

    linear_func = (torch.arange(dim, dtype=torch.float32) - min) / (max - min)

So, when min == max, we want max - min = 0.001. The code currently calculates a particular entry of linear_func as

const float y = (i0 / 2 - low) / min(0.001f, high - low);

But, when high - low == 0, min(0.001, 0) = 0, not 0.001. The fix is to change the min to a max.

I've also added in the code to be able to set --yarn-orig-ctx from the command line, so that models such as TheBloke/Yarn-Llama-2-7B-64K-GGUF which were converted without the GGUF YaRN keys in them can still be used (if the correct values are passed on the command line).

…he command line

* vvhg-code-infill (#1) * infill in separate example (#2) * reverted changes to main and added infill example * cleanup * naming improvement * make : add missing blank line * fix missing semicolon * brought infill up to current main code * cleanup --------- Co-authored-by: Cebtenzzre <[email protected]>

jquesnelle added 5 commits October 19, 2023 13:13

fix yarn mscale calculation due to inverted freq_scale

b8e3759

put rope.scaling.type into gguf (and fix loading it)

27a81af

fix yarn ramp and only apply when yarn enabled

57f0291

make yarn original context size (--yarn-orig-ctx) configurable from t…

68bd40a

…he command line

Merge remote-tracking branch 'cebtenzzre/ntkv2' into cebtenzzre-ntkv2

f51eed1

jquesnelle mentioned this pull request Oct 20, 2023

llama: implement YaRN RoPE scaling ggml-org/llama.cpp#2268

Merged

5 tasks

cebtenzzre merged commit 14cf93b into cebtenzzre:ntkv2 Oct 20, 2023

cebtenzzre mentioned this pull request Nov 1, 2023

llama : fix llama_context_default_params after #2268 ggml-org/llama.cpp#3893

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix YaRN ramp calculation and add --yarn-orig-ctx #2

Fix YaRN ramp calculation and add --yarn-orig-ctx #2

Uh oh!

jquesnelle commented Oct 20, 2023

Uh oh!

Uh oh!

Fix YaRN ramp calculation and add --yarn-orig-ctx #2

Fix YaRN ramp calculation and add --yarn-orig-ctx #2

Uh oh!

Conversation

jquesnelle commented Oct 20, 2023

Uh oh!

Uh oh!