Skip to content

(Optional) Self hosted embedding model solution #822

@wsxiaoys

Description

@wsxiaoys
  1. Need to design and implement standard interface like TextGeneration.
  2. Consider add bert based embedding to upstream llama.cpp for integrate encoder-decoder model

Related:
https://github.com/ggerganov/llama.cpp/blob/master/examples/embedding/embedding.cpp
ggml-org/llama.cpp#2872

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    Status

    Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions