scripts: fix pattern and get n_tokens in one go #10221

lhpqaq · 2024-11-08T14:01:53Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Implement a todo and fix #10219

lhpqaq · 2024-11-08T14:30:41Z

@slaren @ggerganov PTAL~

reinvantveer · 2024-11-08T15:02:06Z

Can confirm that this solves the error message and exit 1 status reported in #10219

reinvantveer · 2024-11-08T15:02:21Z

Thank you @lhpqaq !

lhpqaq · 2024-11-08T15:27:26Z

@reinvantveer thanks for your review

reinvantveer · 2024-11-08T15:31:50Z

I'm not sure but I suspect there's some kind of regression though. The script appears to read input where I haven't given any

reinvantveer · 2024-11-08T15:33:33Z

A quote from the "interactive storytelling" I'm trying out with the model

As you're planning, you hear a noise coming from outside. It sounds like someone is approaching.
Do you:
A) Investigate the noise
B) Ignore it and continue planning
C) Abort the plan
D) Prepare to defend yourself
User:




main: saving final output to session file './chat/rein/current-cache.bin'



User:




main: saving final
main: saving final output to session file './chat/rein/current-cache-bin'



User:




main: saving final output to session file './chat/rein/current-cache-bin'



User:




main: saving final output to session file './chat/rein/current-cache-bin'



User:

main: saving final output to session file './chat/rein/current-cache.bin'



User: A
hatLLaMa:  You decide to investigate the noise. You and Samantha carefully make your way to the door and listen intently. The noise sounds like footsteps, but they're light and cautious. It's clear that whoever it is, they're trying not to be seen.

reinvantveer · 2024-11-08T16:00:40Z

A quote from the "interactive storytelling"

I'm unsure on where these extra main: saving final output to session file originate, whether it's something introduced by the changes or if it's a different bug

lhpqaq · 2024-11-08T16:06:31Z

A quote from the "interactive storytelling"

I'm unsure on where these extra main: saving final output to session file originate, whether it's something introduced by the changes or if it's a different bug

This is from llama-cli (main. cpp), this PR only modifies regular pattern matching

reinvantveer · 2024-11-08T16:10:43Z

this PR only modifies regular pattern matching

I see, I switched back to the master branch and my chat is now stuck in an infinite loop of

User:


main: saving final output to session file './chat/rein2/current-cache.bin'



User:


main: saving final output to session file './chat/rein2/current-cache.bin'



User:


main: saving final output to session file './chat/rein2/current-cache.bin'



User:

main: saving final output to session file './chat/rein2/current-cache.bin'

... [etc]

lhpqaq · 2024-11-08T16:13:58Z

After each conversation, the script infers once in the background, but only redirects the standard error to the log, which saving final output to session file is output to the standard output

    # Update cache for next prompt in background, ideally during user input
    ./llama-cli >>"$LOG_BG" 2>&1 "${OPTS[@]}" \
          --prompt-cache "$NEXT_PROMPT_CACHE" \
          --file "$NEXT_PROMPT_FILE" \
          --n_predict 1 &

@reinvantveer There are indeed bugs here

scripts: fix pattern and get n_tokens in one go

50a65ef

github-actions bot added the examples label Nov 8, 2024

ggerganov approved these changes Nov 9, 2024

View reviewed changes

ggerganov merged commit 8fc393f into ggml-org:master Nov 9, 2024
1 check passed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

scripts : fix pattern and get n_tokens in one go (ggml-org#10221)

9c5b355

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

scripts : fix pattern and get n_tokens in one go (ggml-org#10221)

ac6f39c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

scripts: fix pattern and get n_tokens in one go #10221

scripts: fix pattern and get n_tokens in one go #10221

Uh oh!

lhpqaq commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024 •

edited

Loading

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

scripts: fix pattern and get n_tokens in one go #10221

scripts: fix pattern and get n_tokens in one go #10221

Uh oh!

Conversation

lhpqaq commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024

Uh oh!

reinvantveer commented Nov 8, 2024

Uh oh!

lhpqaq commented Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

reinvantveer commented Nov 8, 2024 •

edited

Loading

lhpqaq commented Nov 8, 2024 •

edited

Loading