LLaVA C++ Server

Bart Trzynadlowski, 2023

Simple API server for llama.cpp implementation of LLaVA.

Usage

Download one of ggml-model-*.gguf and mmproj-model-f16.gguf from here. Then, simply invoke:

bin/llava-server -m ggml-model-q5_k.gguf --mmproj mmproj-model-f16.gguf

This will start a server on localhost:8080. You can change the hostname and port with --host and --port, respectively, and enable HTTP logging with --log-http. You should be able to interact with the server at localhost:8080 in a web browser.

API

The LLaVA endpoint is at /llava. The request body takes the following parameters:

Name	Type	Required	Description
user_prompt	string	yes	The prompt (e.g., "what is this?")
image_file	file	yes	Image data in binary form.
system_prompt	string	no	System prompt.

Build Instructions

The llama.cpp and cpp-httplib repositories are included as gitmodules. After cloning, make sure to first run:

git submodule init
git submodule update

Then to build, simply run:

make

So far, this has only been tested on macOS, but should work anywhere else llama.cpp builds.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
assets		assets
cpp-httplib @ f63ba7d		cpp-httplib @ f63ba7d
llama.cpp @ 2a4bcba		llama.cpp @ 2a4bcba
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.txt		LICENSE.txt
Makefile		Makefile
Makefile.inc		Makefile.inc
README.md		README.md
llava_request.hpp		llava_request.hpp
llava_server.cpp		llava_server.cpp
web_server.cpp		web_server.cpp
web_server.hpp		web_server.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLaVA C++ Server

Usage

API

Build Instructions

About

Uh oh!

Releases

Packages

Languages

License

trzy/llava-cpp-server

Folders and files

Latest commit

History

Repository files navigation

LLaVA C++ Server

Usage

API

Build Instructions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages