Unable to convert bloom models #4768

JerryKwan · 2024-01-04T07:45:54Z

When trying to convert bloom model downloaded from Huggingface (https://huggingface.co/bigscience/bloomz-1b7) using the following command

python3.10 convert.py /root/bloomz-1b7/

it outputs the following messages

Loading model file /root/bloomz-1b7/model.safetensors
Traceback (most recent call last):
  File "/root/workspace/llama.cpp/convert.py", line 1295, in <module>
    main()
  File "/root/workspace/llama.cpp/convert.py", line 1234, in main
    params = Params.load(model_plus)
  File "/root/workspace/llama.cpp/convert.py", line 318, in load
    params = Params.loadHFTransformerJson(model_plus.model, hf_config_path)
  File "/root/workspace/llama.cpp/convert.py", line 237, in loadHFTransformerJson
    raise Exception("failed to guess 'n_ctx'. This model is unknown or unsupported.\n"
Exception: failed to guess 'n_ctx'. This model is unknown or unsupported.
Suggestion: provide 'config.json' of the model in the same directory containing model files.

And config.json is in the same directory containing the model file
Any one knows what caused the problem and how to solve it?

The text was updated successfully, but these errors were encountered:

JerryKwan · 2024-01-04T07:59:00Z

And when using bloomz.cpp(https://github.com/NouamaneTazi/bloomz.cpp) to convert the model using the following command succeed

python3.10 convert-hf-to-ggml.py /root/bloomz-1b7/ ./models

ggerganov · 2024-01-04T08:02:44Z

Looks related to #4493

JerryKwan · 2024-01-04T08:20:58Z

@ggerganov Thanks for looking into this issue. Any ETA about when the problem will be solved?
There are a large number of users are using bloom-like model.
And anything I can help to solve the problem?

ggerganov · 2024-01-04T08:26:41Z

@teleprint-me mentioned that they will take a look, but not sure if they had the chance yet. I prefer to rely on the community's help for the Python issues as it is not my field of expertise, but I will take a look if it does not get resolved soon

JerryKwan · 2024-01-04T08:38:35Z

@ggerganov
Seems like I can use the following command to convert bloomz-1b7 successfully (commit f3f62f0 ),

python3.10 ./convert-hf-to-gguf.py /root/bloomz-1b7/

And using the following command to load the model successfully

./main -m /root/bloomz-1b7/ggml-model-f16.gguf -n 128

So, there must be something wrong with convert.py and should not cost too much time to solve. I will digger deeper later

player1537 · 2024-01-04T15:36:46Z

I can confirm that, several weeks ago, I was able to use the convert-hf-to-gguf.py script to convert Bloom-560M to GGUF format. If it helps, I have the converted files for Bloom-560M available on huggingface hub.

Galunid · 2024-01-04T19:50:05Z

Was bloom ever supported by convert.py in the first place? I believe that one was meant for llama models (and some derivatives). Bloom used to have a separate script convert-bloom-hf-to-gguf.py (or something similar) that was then refactored into convert-hf-to-gguf.py.

Exception: failed to guess 'n_ctx'. This model is unknown or unsupported. suggests it wasn't.

JerryKwan · 2024-01-05T00:29:52Z

@player1537 Thanks for lending help. I can convert the model successfully, thank you

@Galunid I am not sure if bloom was supported by convert.py in the first place, but I think it would be better to use convert.py as the main convert tool, and it can use the functions defined in other modules

Galunid · 2024-01-05T15:11:20Z

That's not possible for now, perhaps in the future the scripts will be unified.

teleprint-me · 2024-01-06T02:31:24Z

Sorry I'm late to party. I've been sick. Just starting to feel functional compared to the last few days.

@Galunid is right for the most part. My 2 cents for clarity is that convert.py only handles the llama and gpt architectures. The convert-hf-to-gguf.py superseded the separate scripts that previously existed.

The main difference between them is convert.py uses a custom shim to save on memory usage and loads the tensors as they're needed. convert-hf-to-gguf.py consumes more memory as a result.

There are other contributors that are more knowledgeable about it's functionality than I. I've been slowly picking it apart though.

My advice would be to use the intended script which is the now homogenized convert-hf-to-gguf.py script. It's not so simple to merge the scripts which I discovered as I began to dig deeper into the torch code base.

JerryKwan added the bug-unconfirmed label Jan 4, 2024

Galunid added invalid This doesn't seem right and removed bug-unconfirmed labels Jan 5, 2024

Galunid closed this as completed Jan 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to convert bloom models #4768

Unable to convert bloom models #4768

JerryKwan commented Jan 4, 2024

JerryKwan commented Jan 4, 2024

ggerganov commented Jan 4, 2024

JerryKwan commented Jan 4, 2024

ggerganov commented Jan 4, 2024

JerryKwan commented Jan 4, 2024

player1537 commented Jan 4, 2024

Galunid commented Jan 4, 2024

JerryKwan commented Jan 5, 2024

Galunid commented Jan 5, 2024

teleprint-me commented Jan 6, 2024 •

edited

Loading

Unable to convert bloom models #4768

Unable to convert bloom models #4768

Comments

JerryKwan commented Jan 4, 2024

JerryKwan commented Jan 4, 2024

ggerganov commented Jan 4, 2024

JerryKwan commented Jan 4, 2024

ggerganov commented Jan 4, 2024

JerryKwan commented Jan 4, 2024

player1537 commented Jan 4, 2024

Galunid commented Jan 4, 2024

JerryKwan commented Jan 5, 2024

Galunid commented Jan 5, 2024

teleprint-me commented Jan 6, 2024 • edited Loading

teleprint-me commented Jan 6, 2024 •

edited

Loading