Skip to content

Commit 10acf90

Browse files
ggerganovteleprint-me
authored andcommitted
tests : add test-tokenizer-0.sh + fix some tokenizers (ggml-org#7036)
* tests : add test-tokenizer-0.sh * unicode : add all unicode number ranges * starcoder : fix pre-tokenizer * tests : add test that fails with DeepSeek tokenizers * falcon : fix regex * unicode : regenerate unicode tables * refact : add tokenizer model * lint : fix * tests : disable failing tests ggml-ci * refact : add tests files ggml-ci * convert : print -> logging ggml-ci * lint : fix * unicode : digit -> number * phi-3 : update
1 parent 7925744 commit 10acf90

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/test-tokenizer-0.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
1-
import time
21
import argparse
2+
import time
33

44
from transformers import AutoTokenizer
55

0 commit comments

Comments
 (0)