epic: llama.cpp params are settable via API call or `model.yaml`

## Goal

- Cortex can handle all llama.cpp params correctly
- Model running params (i.e. POST `/v1/models/<model_id>/start`)
- Inference params (i.e. POST `/chat/completions`)
- Function Calling, eg for llama.cpp

## Tasklist

I am using this epic to aggregate all llama.cpp params issues, including llama3.1 function calling + tool use

- [x] https://github.com/janhq/cortex.cpp/issues/618d
- [x] https://github.com/janhq/jan/issues/3785
- [x] https://github.com/janhq/cortex.cpp/issues/295
- [ ] https://github.com/janhq/cortex.cpp/pull/354 (prev PR with function calling/tool use)
- [ ] min/max range for discrete int/float params, to prevent out-of-bounds errors (e.g. https://github.com/janhq/jan/pull/3609)
- [x] https://github.com/janhq/cortex.cpp/issues/618

### model.yaml
- `model.yaml` as optional? (i.e. depend on GGUF params) 
    - See @tikikun's comment in https://github.com/janhq/cortex.cpp/issues/1151#issuecomment-2339478427
- `model.yaml` should be well documented with approrpriate naming conventions
    - Model loading params?
    - Inference params?
    - To exclude engine params?

### Out-of-scope:
- TensorRT-LLM
- ONNX
- [x] https://github.com/janhq/cortex.cpp/issues/1163

## Related

- https://github.com/janhq/jan/issues/3140
- https://github.com/janhq/jan/issues/3508

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

epic: llama.cpp params are settable via API call or `model.yaml` #1151

Goal

Tasklist

model.yaml

Out-of-scope:

Related

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

epic: llama.cpp params are settable via API call or model.yaml #1151

Description

Goal

Tasklist

model.yaml

Out-of-scope:

Related

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

epic: llama.cpp params are settable via API call or `model.yaml` #1151