Support specify template in OpenAI server #1263

kai01ai · 2023-10-05T02:28:20Z

Currently, the OpenAI API server uses request.model as the template name. This approach isn't convenient when deploying custom models (e.g., a custom fine-tuned chat model that uses the chatgpt/llama2 template).

This patch addresses the issue by introducing a template field in the CompletionRequest/ChatCompletionRequest. If the template field isn't specified, it will default to the value of model to maintain backward compatibility.

Tostino · 2023-10-09T01:37:30Z

Well, it looks like we are working on somewhat related features. Not exactly the same thing, but these would be incompatible if both PRs were accepted right now.

#1294

kai01ai · 2023-10-09T02:14:51Z

@Tostino Your implementation does offer more flexibility. However, I personally prefer using Prompt templating introduced in transformers 4.34.0. This would allow us to not rely on fastchat and to create custom templates stored in the tokenizer config files. Admittedly, this represents a significant change.

Tostino · 2023-10-09T03:08:02Z

Right. I agree something like that is a much better solution...but I am unable to use my model with vLLM right now without patching it, and that was the solution that was suggested to me in Discord, so I went with it and submitted a PR because it is useful "right now". I'd expect it to be replaced when an implementation of HF prompt templates is in place, and I very much look forward to this madness being solved.

Tostino · 2023-10-16T02:23:03Z

Just FYI, I am going to work on a PR for supporting the HF tokenizer prompt templating. Closed my other one since it hadn't been accepted yet.

kai01ai · 2023-10-16T02:27:50Z

Very nice! I will close this PR since the HF tokenizer prompt templating work

kai01ai · 2023-12-04T02:23:52Z

Closed due to #1756. Many thanks to @Tostino for the contribution.

Support specify template in OpenAI server

eb53ba2

Merge branch 'vllm-project:main' into specify-template

205d72f

kai01ai closed this Dec 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support specify template in OpenAI server #1263

Support specify template in OpenAI server #1263

Uh oh!

kai01ai commented Oct 5, 2023

Uh oh!

Tostino commented Oct 9, 2023

Uh oh!

kai01ai commented Oct 9, 2023

Uh oh!

Tostino commented Oct 9, 2023

Uh oh!

Tostino commented Oct 16, 2023

Uh oh!

kai01ai commented Oct 16, 2023

Uh oh!

kai01ai commented Dec 4, 2023

Uh oh!

Uh oh!

Uh oh!

Support specify template in OpenAI server #1263

Support specify template in OpenAI server #1263

Uh oh!

Conversation

kai01ai commented Oct 5, 2023

Uh oh!

Tostino commented Oct 9, 2023

Uh oh!

kai01ai commented Oct 9, 2023

Uh oh!

Tostino commented Oct 9, 2023

Uh oh!

Tostino commented Oct 16, 2023

Uh oh!

kai01ai commented Oct 16, 2023

Uh oh!

kai01ai commented Dec 4, 2023

Uh oh!

Uh oh!