Fix conversion of some BERT embedding models #6937

christianazinn · 2024-04-26T22:33:43Z

BERT-based embedding models require the use of convert-hf-to-gguf.py to be converted from safetensors/PyTorch format to GGML. The BertModel class was missing logic that resolves unsupported datatypes, resulting in some models like acge_text_embedding failing with TypeError: Got unsupported ScalarType BFloat16 when running the line data = data_torch.squeeze().numpy().

This is rectified by converting unsupported datatypes to f32 - this is done in every other model class, so it was probably just missed in BERT models. This fix allows for satisfactory conversion and subsequent quantization, as seen in this GGUF quantization of acge_text_embedding created with this fix.

It's also literally just two lines of code, so unless converting tensors in BERT models specifically is undesirable, I hope this is an easy fix.

…es to GGUF

christianazinn · 2024-04-28T02:43:01Z

Fixed whitespace issues, would like a review.

iamlemec · 2024-04-28T03:42:36Z

Makes sense to me. Can confirm that conversion of acge_text_embedding works and model outputs accurate results. Also checked that bge-base-en-v1.5 and nomic-embed-text-v1.5 give identical coverted GGUFs between this PR and master.

Convert unsupported datatypes to f32 when converting BERT architectur…

5ae78a1

…es to GGUF

christianazinn force-pushed the fix-bfloat16 branch from ddd5289 to 5ae78a1 Compare April 27, 2024 18:29

slaren approved these changes Apr 28, 2024

View reviewed changes

ggerganov merged commit 3055a41 into ggml-org:master Apr 29, 2024
22 checks passed

nopperl pushed a commit to nopperl/llama.cpp that referenced this pull request May 5, 2024

convert : fix conversion of some BERT embedding models (ggml-org#6937)

675e3cb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix conversion of some BERT embedding models #6937

Fix conversion of some BERT embedding models #6937

Uh oh!

christianazinn commented Apr 26, 2024 •

edited

Loading

Uh oh!

christianazinn commented Apr 28, 2024

Uh oh!

iamlemec commented Apr 28, 2024

Uh oh!

Uh oh!

Uh oh!

Fix conversion of some BERT embedding models #6937

Fix conversion of some BERT embedding models #6937

Uh oh!

Conversation

christianazinn commented Apr 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christianazinn commented Apr 28, 2024

Uh oh!

iamlemec commented Apr 28, 2024

Uh oh!

Uh oh!

Uh oh!

christianazinn commented Apr 26, 2024 •

edited

Loading