Avoid calls to tokenizer.added_tokens_decoder #12473

bartowski1182 · 2025-03-20T06:05:26Z

tokenizer.added_tokens_decoder returns a fresh dict every time relatively slowly (~0.04s on average) which results in massive slowdowns when we have a huge number of added tokens:

https://github.com/huggingface/transformers/blob/9be4728af8bec48073ae841881d7f4e2ac3521d1/src/transformers/tokenization_utils_fast.py#L264

Typically this slowdown is imperceptible, but when we have a model like ByteCraft with 100,000 added tokens, suddenly 0.04 * 2 * 100,000 = 8000 seconds extra to process the tokens: https://huggingface.co/SamsungSAILMontreal/ByteCraft/blob/main/added_tokens.json

This fix removes the slowdown entirely by calling it only once at the start (initial tokenizer load is still slow at 2 minutes but that's at least workable)

Make sure to read the contributing guidelines before submitting a PR

tokenizer.added_tokens_decoder returns a fresh dict every time relatively slowly (~0.04s on average) which results in massive slowdowns when we have a huge number of added tokens

Update convert_hf_to_gguf.py

effcc97

tokenizer.added_tokens_decoder returns a fresh dict every time relatively slowly (~0.04s on average) which results in massive slowdowns when we have a huge number of added tokens

github-actions bot added the python python script changes label Mar 20, 2025

ggerganov approved these changes Mar 20, 2025

View reviewed changes

ggerganov merged commit 732b5fb into ggml-org:master Mar 20, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid calls to tokenizer.added_tokens_decoder #12473

Avoid calls to tokenizer.added_tokens_decoder #12473

Uh oh!

bartowski1182 commented Mar 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Avoid calls to tokenizer.added_tokens_decoder #12473

Avoid calls to tokenizer.added_tokens_decoder #12473

Uh oh!

Conversation

bartowski1182 commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bartowski1182 commented Mar 20, 2025 •

edited

Loading