-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Tokenizer SPM fixes for phi-3 and llama-spm (bugfix) #7425
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
This reverts commit 7fb66eb.
I reverted the changes that I made to the server tests - these should no longer be necessary |
* Update brute force test: add_special * Update brute force test: default values for add_bos_token and add_eos_token * Enable rtrim when pre-inserting BOS Co-authored-by: Georgi Gerganov <[email protected]> * Revert "server : fix test regexes"
On main branch this caused a failure under 'CI / macOS-latest-cmake-arm64 (push) Failing after 2m' for this test
|
It's not related - some of the CPY tests occasionally produce numerical differences that slightly exceed the epsilon threshold:
The difference is due to different rounding mode between the CPU / GPU: #4698 |
* Update brute force test: add_special * Update brute force test: default values for add_bos_token and add_eos_token * Enable rtrim when pre-inserting BOS Co-authored-by: Georgi Gerganov <[email protected]> * Revert "server : fix test regexes"
Bugfixing #7375 : Need rtrim after the pre-inserted BOS. More details here.