server : fix crash when using verbose output with input tokens that are not in printable range (#12178) #12338

ishaangandhi · 2025-03-11T16:15:19Z

This PR fixes #12178.

Previously, when calling /completions with a token outside of the token vocabulary range, you would get this exception:

terminate called after throwing an instance of 'std::out_of_range'
  what():  vector::_M_range_check: __n (which is 18446744073709551615) >= this->size() (which is 151936)

Now, the server continues to run, and the user gets a useful error:

{"error":{"code":400,"message":"Prompt contains invalid tokens","type":"invalid_request_error"}}%

common/common.cpp

ngxson

Bonus: would be nice if you can add a test case for it, see server/tests/test_tokenize.py

examples/server/server.cpp

ishaangandhi · 2025-03-12T13:29:04Z

@ngxson are you OK to merge this? I rebased the changes.

re: adding a test - I think test_tokenize tests the /tokenize and /detokenize endpoints, which invoke a different path than what we fix in this PR.

How about we merge this as-is?

ngxson · 2025-03-13T10:08:38Z

I only refer to that file so you can see how the code looks like. Adding test should be easy and it is a good way to show that your code actually work.

…re not in printable range (ggml-org#12178) (ggml-org#12338) * Fix DOS index bug * Remove new APIs * remove extra line * Remove from API * Add extra newline * Update examples/server/server.cpp --------- Co-authored-by: Xuan-Son Nguyen <[email protected]>

ishaangandhi requested a review from ngxson as a code owner March 11, 2025 16:15

ngxson reviewed Mar 11, 2025

View reviewed changes

common/common.cpp Outdated Show resolved Hide resolved

github-actions bot added examples server labels Mar 11, 2025

ngxson approved these changes Mar 11, 2025

View reviewed changes

examples/server/server.cpp Outdated Show resolved Hide resolved

ishaangandhi requested a review from JohannesGaessler as a code owner March 12, 2025 13:15

ishaangandhi and others added 6 commits March 12, 2025 09:16

Fix DOS index bug

d8485ad

Remove new APIs

1320897

remove extra line

761f4d9

Remove from API

2e48a6d

Add extra newline

cc53039

Update examples/server/server.cpp

1e14b14

ishaangandhi force-pushed the fix-dos-index branch from f0d9d04 to 1e14b14 Compare March 12, 2025 13:16

ngxson changed the title ~~bugfix: Prevent DOS when using verbose output with input tokens that are not in printable range (#12178)~~ server : fix crash when using verbose output with input tokens that are not in printable range (#12178) Mar 13, 2025

ngxson merged commit 2048b59 into ggml-org:master Mar 13, 2025
47 checks passed

ngxson removed the request for review from JohannesGaessler March 13, 2025 10:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server : fix crash when using verbose output with input tokens that are not in printable range (#12178) #12338

server : fix crash when using verbose output with input tokens that are not in printable range (#12178) #12338

Uh oh!

ishaangandhi commented Mar 11, 2025

Uh oh!

Uh oh!

ngxson left a comment

Uh oh!

Uh oh!

ishaangandhi commented Mar 12, 2025

Uh oh!

ngxson commented Mar 13, 2025

Uh oh!

Uh oh!

Uh oh!

server : fix crash when using verbose output with input tokens that are not in printable range (#12178) #12338

server : fix crash when using verbose output with input tokens that are not in printable range (#12178) #12338

Uh oh!

Conversation

ishaangandhi commented Mar 11, 2025

Uh oh!

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ishaangandhi commented Mar 12, 2025

Uh oh!

ngxson commented Mar 13, 2025

Uh oh!

Uh oh!

Uh oh!