-
Notifications
You must be signed in to change notification settings - Fork 176
vLLM custom connector setup guide #3858
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
|
||
| ## Requirements | ||
|
|
||
| * Docker or Podman. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There were a few places throughout the guide that referred to a Docker container. I changed those to just refer to a container since it seems Podman is an acceptable alternative too.
|
|
||
| 1. Configure your host server with the necessary GPU resources. | ||
| 2. Run the desired model in a vLLM container. | ||
| 3. Use a reverse proxy like Nginx to securely expose the endpoint to {{ecloud}}. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it just Elastic Cloud that this works with? Not other deployment types?
| 1. When you want to invoke a tool, never describe the call in text. | ||
| 2. Always return the invocation in the `tool_calls` field. | ||
| 3. The `content` field must remain empty for any assistant message that performs a tool call. | ||
| 4. Only use tool calls defined in the "tools" parameter. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note to self: Following https://github.com/elastic/sdh-security-team/issues/1417 to confirm if this system prompt fix works
can we make the existing page generic then link to two methods:
|
Resolves #3474 by creating a tutorial for how to connect a custom LLM running in vLLM to Elastic.
Technical reviewers, I left a few questions for you in comments. Also: