how to set this chat_template in server? #5974

wac81 · 2024-03-10T10:44:42Z

how to set this chat_template in openchat?
because i watched output it's difference from ./server and python -m llama.cpp.server. then i thought, may is difference chat_template made this?

openchat chat_template:
Using gguf chat template: {{ bos_token }}{% for message in messages %}{{ 'GPT4 Correct ' + message['role'].title() + ': ' + message['content'] + '<|end_of_turn|>'}}{% endfor %}{% if add_generation_prompt %}{{ 'GPT4 Correct Assistant:' }}{% endif %}

how to set chat_template in ./server with --chat-template

phymbert · 2024-03-10T10:51:59Z

Have a look to https://github.com/ggerganov/llama.cpp/wiki/Templates-supported-by-llama_chat_apply_template

ngxson · 2024-03-10T11:01:43Z

We don’t support custom chat template for now, but you can use /completions endpoint (not /chat/completions) and input the formatted chat with the template that you made

wac81 · 2024-03-10T13:46:02Z

I have an extension of that problem, ./server and python -m llama.cpp.server startup get inconsistent results, server is way worse and less controllable, I observed the python -m startup method, it should be using gguf's chat_template, I don't know if I'm right or not, but the result is very different, I am using openchat-0106.

I have an extension of that problem, ./server and python -m llama.cpp.server startup get inconsistent results, server is way worse and less controllable, I observed the python -m startup method, it should be using gguf's chat_template, I don't know if I'm right or not, but the result is very different, I am using openchat-0106.

wac81 · 2024-03-10T13:48:46Z

Have a look to https://github.com/ggerganov/llama.cpp/wiki/Templates-supported-by-llama_chat_apply_template

I've definitely seen the link, but thanks anyway

teleprint-me · 2024-03-10T15:15:47Z

You can manually set the chat template using gguf-py. Look at set-metadata.py in examples. I would go into more detail, but I have to go to work. I can post later if you're still having trouble understanding how to do it.

ngxson · 2024-03-10T16:08:30Z

@teleprint-me He's using custom self-made chat template {{ 'GPT4 Correct ' + message['role'].title()

This is not a common chat template that is supported by llama.cpp, so it won't work. Please see discussion: #5922 (comment)

teleprint-me · 2024-03-10T19:44:21Z

Yes, I understand how the template works. It doesn't change anything. My point stands.

github-actions · 2024-04-24T01:06:37Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

wac81 added the enhancement New feature or request label Mar 10, 2024

phymbert assigned phymbert and ngxson and unassigned phymbert Mar 10, 2024

ngxson removed their assignment Mar 10, 2024

ngxson mentioned this issue Mar 10, 2024

Server: possibility of customizable chat template? #5922

Closed

github-actions bot added the stale label Apr 10, 2024

github-actions bot closed this as completed Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

how to set this chat_template in server? #5974

how to set this chat_template in server? #5974

wac81 commented Mar 10, 2024

phymbert commented Mar 10, 2024

Uh oh!

ngxson commented Mar 10, 2024

Uh oh!

wac81 commented Mar 10, 2024 •

edited

Loading

Uh oh!

wac81 commented Mar 10, 2024

Uh oh!

teleprint-me commented Mar 10, 2024

Uh oh!

ngxson commented Mar 10, 2024

Uh oh!

teleprint-me commented Mar 10, 2024

Uh oh!

github-actions bot commented Apr 24, 2024

Uh oh!

how to set this chat_template in server? #5974

how to set this chat_template in server? #5974

Comments

wac81 commented Mar 10, 2024

phymbert commented Mar 10, 2024

Uh oh!

ngxson commented Mar 10, 2024

Uh oh!

wac81 commented Mar 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wac81 commented Mar 10, 2024

Uh oh!

teleprint-me commented Mar 10, 2024

Uh oh!

ngxson commented Mar 10, 2024

Uh oh!

teleprint-me commented Mar 10, 2024

Uh oh!

github-actions bot commented Apr 24, 2024

Uh oh!

wac81 commented Mar 10, 2024 •

edited

Loading