server: enable token array inputs for OAI API #15001

JohannesGaessler · 2025-07-31T22:37:45Z

According to the OpenAI documentation formatting the prompt as an array of tokens is supported. However, the llama.cpp server raises an error if you provide such input. I assume the reason is that the interpretation of tokens depends on the model so this would not be "OpenAI compatible" either way. However, I have a use case where I need such inputs. This PR simply removes the error in the llama.cpp server. I don't think this would cause issues but my understanding of the server code is also relatively poor.

I'm currently working on benchmarking llama.cpp vs. vllm. Both projects provide an OAI-compatible API. So I want to make scripts/server-bench.py use the OAI-compatible API instead of the llama.cpp-specific API in order to use the exact same code for benchmarking either project. Under these circumstances I want to be able to send prompts of an exact length (in tokens) while at the same time the interpretations of those prompts as text are irrelevant.

server: enable token array inputs for OAI API

d93ce3d

JohannesGaessler requested a review from ngxson as a code owner July 31, 2025 22:37

github-actions bot added examples server labels Jul 31, 2025

ggerganov approved these changes Aug 1, 2025

View reviewed changes

JohannesGaessler merged commit f906275 into ggml-org:master Aug 2, 2025
47 checks passed

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Aug 5, 2025

server: enable token array inputs for OAI API (ggml-org#15001)

aff39f7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server: enable token array inputs for OAI API #15001

server: enable token array inputs for OAI API #15001

Uh oh!

JohannesGaessler commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

server: enable token array inputs for OAI API #15001

server: enable token array inputs for OAI API #15001

Uh oh!

Conversation

JohannesGaessler commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!