Not unload a model from VRAM anymore. #434

berkut1 · 2023-06-29T02:08:55Z

The last version 0.1.66, not unload model from VRAM.
I thought the problem was with oobabooga/text-generation-webui oobabooga/text-generation-webui#2920, but after digging into the code from projects, I think the problem is with this library.
I think there

llama-cpp-python/llama_cpp/llama.py

Line 1439 in 442213b

def __del__(self):

should be added a new llama_free_model function.

Sorry if I'm wrong, I don't work with python.

The text was updated successfully, but these errors were encountered:

abetlen · 2023-06-29T03:59:31Z

@berkut1 thank you for caching that, will have publish a new version shortly

iactix · 2023-06-30T19:35:37Z

It still only "deletes" the model though, VRAM still ends up a lot higher than before loading the model.
#223

* File load progress reporting * Move llama_progress_handler into llama_context_params * Renames * Use seekg to find file size instead * More correct load progress * Call progress callback more frequently * Fix typo

abetlen closed this as completed in a5e059c Jun 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not unload a model from VRAM anymore. #434

Not unload a model from VRAM anymore. #434

berkut1 commented Jun 29, 2023

abetlen commented Jun 29, 2023

iactix commented Jun 30, 2023

Not unload a model from VRAM anymore. #434

Not unload a model from VRAM anymore. #434

Comments

berkut1 commented Jun 29, 2023

abetlen commented Jun 29, 2023

iactix commented Jun 30, 2023