Skip to content

dataesr/llm-finetuning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

build docker image

make docker-build-finetuning --> build the finetuning image make docker-build-inference --> build the inference image

make docker-push-finetuning --> push the finetuning image make docker-push-inference --> push the inference image

launch finetuning job on ovh

ovhai job run --gpu 1 --volume llm-datasets@1azgra:/workspace/datasets:ro --volume llm-jobs@1azgra:/workspace/jobs:rwd --env HF_TOKEN=<huggingface_token> --env WANDB_KEY=<wandb_key> --env WANDB_PROJECT=<wandb_project> --env WANDB_NAME=<wandb_name> ghcr.io/dataesr/llm-finetuning:latest -- uv run main.py --mode ["train", "push"] --model_name <huggingface_model_name> --pipeline ["causallm", "custom"] --dataset_name <dataset_name_from_object_storage> --hf_hub <huggingface_hub_id> --hf_hub_private --push_model_dir <dir_of_finetuned_model>

examples:

  • Simple Fine-Tuning ovhai job run --gpu 1 --volume llm-datasets@1azgra:/workspace/datasets:ro --volume llm-jobs@1azgra:/workspace/jobs:rwd --env HF_TOKEN=hf_abcdefg ghcr.io/dataesr/llm-finetuning:latest -- uv run main.py --model_name meta-llama/Llama-3.2-1B --dataset_name test.json

  • Fine-tuning and push to hub ovhai job run --gpu 1 --volume llm-datasets@1azgra:/workspace/datasets:ro --volume llm-jobs@1azgra:/workspace/jobs:rwd --env HF_TOKEN=hf_abcdefg ghcr.io/dataesr/llm-finetuning:latest -- uv run main.py --model_name meta-llama/Llama-3.2-1B --dataset_name test.json --hf_hub dataesr/hub_name --hf_hub_private

  • Push trained model to hub ovhai job run --cpu 1 --volume llm-jobs@1azgra:/workspace/jobs:ro --env HF_TOKEN=hf_abcdefg ghcr.io/dataesr/llm-finetuning:latest -- uv run main.py --mode push --push_model_dir llama-3.2-1b/finetuned --hf_hub dataesr/hub_name --hf_hub_private

  • Delete all jobs files ovhai bucket object delete llm-jobs@1azgra --all --yes

Launch inference app on ovh ovhai app run --gpu 1 --env HF_TOKEN=<huggingface_token> --env MODEL_NAME=<huggingface_model_name> --default-http-port 8000 --unsecure-http ghcr.io/dataesr/llm-inference:latest

ovhai app run --gpu 1 --env HF_TOKEN=hf_abcdef --env MODEL_NAME=dataesr/openchat-3.6-8b-acknowledgments --default-http-port 8000 --unsecure-http ghcr.io/dataesr/llm-inference:latest

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages