Skip to content

Conversation

@nirrozenbaum
Copy link
Contributor

the separation seems unnecessary.
there is no place where one call is done without the other + the code is very simple a short.

no logic change.

@netlify
Copy link

netlify bot commented May 11, 2025

Deploy Preview for gateway-api-inference-extension ready!

Name Link
🔨 Latest commit 86039c8
🔍 Latest deploy log https://app.netlify.com/sites/gateway-api-inference-extension/deploys/68208becd58655000834f352
😎 Deploy Preview https://deploy-preview-819--gateway-api-inference-extension.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label May 11, 2025
@k8s-ci-robot k8s-ci-robot requested review from Jeffwan and robscott May 11, 2025 11:32
@k8s-ci-robot k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label May 11, 2025
Signed-off-by: Nir Rozenbaum <[email protected]>
@kfswain
Copy link
Collaborator

kfswain commented May 11, 2025

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 11, 2025
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: kfswain, nirrozenbaum

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 11, 2025
@k8s-ci-robot k8s-ci-robot merged commit 62f226c into kubernetes-sigs:main May 11, 2025
8 checks passed
@nirrozenbaum nirrozenbaum deleted the env branch May 11, 2025 16:26
k8s-ci-robot pushed a commit that referenced this pull request May 12, 2025
* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to NewSchedulerContext API change

Signed-off-by: Shmuel Kallner <[email protected]>

* Populate the RequestId field of LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates to tests

Signed-off-by: Shmuel Kallner <[email protected]>

* Added PostResponse plugins to scheduler config

Signed-off-by: Shmuel Kallner <[email protected]>

* Added scheduler.OnResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Added dispatcher.HandleResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Refactored server response header handling to invoke PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Added simple test for PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Setup the logger in the SchedulerContext appropriately for reponses

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to rebase issues

* merge functions in env utils (#819)

Signed-off-by: Nir Rozenbaum <[email protected]>

* generalize scheduling cycle state concept (#818)

* generalize scheduling cycle state concept

Signed-off-by: Nir Rozenbaum <[email protected]>

* typo

Signed-off-by: Nir Rozenbaum <[email protected]>

* make linter happy

Signed-off-by: Nir Rozenbaum <[email protected]>

* make prefix state struct internal to package instead of public

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* remove Model field from LLMRequest (#782)

* remove Model field from LLMRequest

Signed-off-by: Nir Rozenbaum <[email protected]>

* rebase handling

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Insure that wanted response header messages have all of the response headers in them

Signed-off-by: Shmuel Kallner <[email protected]>

---------

Signed-off-by: Shmuel Kallner <[email protected]>
Signed-off-by: Nir Rozenbaum <[email protected]>
Co-authored-by: Nir Rozenbaum <[email protected]>
nayihz pushed a commit to nayihz/gateway-api-inference-extension that referenced this pull request May 14, 2025
nayihz pushed a commit to nayihz/gateway-api-inference-extension that referenced this pull request May 14, 2025
* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to NewSchedulerContext API change

Signed-off-by: Shmuel Kallner <[email protected]>

* Populate the RequestId field of LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates to tests

Signed-off-by: Shmuel Kallner <[email protected]>

* Added PostResponse plugins to scheduler config

Signed-off-by: Shmuel Kallner <[email protected]>

* Added scheduler.OnResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Added dispatcher.HandleResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Refactored server response header handling to invoke PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Added simple test for PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Setup the logger in the SchedulerContext appropriately for reponses

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to rebase issues

* merge functions in env utils (kubernetes-sigs#819)

Signed-off-by: Nir Rozenbaum <[email protected]>

* generalize scheduling cycle state concept (kubernetes-sigs#818)

* generalize scheduling cycle state concept

Signed-off-by: Nir Rozenbaum <[email protected]>

* typo

Signed-off-by: Nir Rozenbaum <[email protected]>

* make linter happy

Signed-off-by: Nir Rozenbaum <[email protected]>

* make prefix state struct internal to package instead of public

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* remove Model field from LLMRequest (kubernetes-sigs#782)

* remove Model field from LLMRequest

Signed-off-by: Nir Rozenbaum <[email protected]>

* rebase handling

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Insure that wanted response header messages have all of the response headers in them

Signed-off-by: Shmuel Kallner <[email protected]>

---------

Signed-off-by: Shmuel Kallner <[email protected]>
Signed-off-by: Nir Rozenbaum <[email protected]>
Co-authored-by: Nir Rozenbaum <[email protected]>
kaushikmitr pushed a commit to kaushikmitr/llm-instance-gateway that referenced this pull request May 15, 2025
kaushikmitr pushed a commit to kaushikmitr/llm-instance-gateway that referenced this pull request May 15, 2025
* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to NewSchedulerContext API change

Signed-off-by: Shmuel Kallner <[email protected]>

* Populate the RequestId field of LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates to tests

Signed-off-by: Shmuel Kallner <[email protected]>

* Added PostResponse plugins to scheduler config

Signed-off-by: Shmuel Kallner <[email protected]>

* Added scheduler.OnResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Added dispatcher.HandleResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Refactored server response header handling to invoke PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Added simple test for PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Setup the logger in the SchedulerContext appropriately for reponses

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to rebase issues

* merge functions in env utils (kubernetes-sigs#819)

Signed-off-by: Nir Rozenbaum <[email protected]>

* generalize scheduling cycle state concept (kubernetes-sigs#818)

* generalize scheduling cycle state concept

Signed-off-by: Nir Rozenbaum <[email protected]>

* typo

Signed-off-by: Nir Rozenbaum <[email protected]>

* make linter happy

Signed-off-by: Nir Rozenbaum <[email protected]>

* make prefix state struct internal to package instead of public

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* remove Model field from LLMRequest (kubernetes-sigs#782)

* remove Model field from LLMRequest

Signed-off-by: Nir Rozenbaum <[email protected]>

* rebase handling

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Insure that wanted response header messages have all of the response headers in them

Signed-off-by: Shmuel Kallner <[email protected]>

---------

Signed-off-by: Shmuel Kallner <[email protected]>
Signed-off-by: Nir Rozenbaum <[email protected]>
Co-authored-by: Nir Rozenbaum <[email protected]>
irar2 pushed a commit to irar2/gateway-api-inference-extension that referenced this pull request Jun 3, 2025
* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to NewSchedulerContext API change

Signed-off-by: Shmuel Kallner <[email protected]>

* Populate the RequestId field of LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates to tests

Signed-off-by: Shmuel Kallner <[email protected]>

* Added PostResponse plugins to scheduler config

Signed-off-by: Shmuel Kallner <[email protected]>

* Added scheduler.OnResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Added dispatcher.HandleResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Refactored server response header handling to invoke PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Added simple test for PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Setup the logger in the SchedulerContext appropriately for reponses

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to rebase issues

* merge functions in env utils (kubernetes-sigs#819)

Signed-off-by: Nir Rozenbaum <[email protected]>

* generalize scheduling cycle state concept (kubernetes-sigs#818)

* generalize scheduling cycle state concept

Signed-off-by: Nir Rozenbaum <[email protected]>

* typo

Signed-off-by: Nir Rozenbaum <[email protected]>

* make linter happy

Signed-off-by: Nir Rozenbaum <[email protected]>

* make prefix state struct internal to package instead of public

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* remove Model field from LLMRequest (kubernetes-sigs#782)

* remove Model field from LLMRequest

Signed-off-by: Nir Rozenbaum <[email protected]>

* rebase handling

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Insure that wanted response header messages have all of the response headers in them

Signed-off-by: Shmuel Kallner <[email protected]>

---------

Signed-off-by: Shmuel Kallner <[email protected]>
Signed-off-by: Nir Rozenbaum <[email protected]>
Co-authored-by: Nir Rozenbaum <[email protected]>
rlakhtakia pushed a commit to rlakhtakia/gateway-api-inference-extension that referenced this pull request Jun 11, 2025
rlakhtakia pushed a commit to rlakhtakia/gateway-api-inference-extension that referenced this pull request Jun 11, 2025
* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to NewSchedulerContext API change

Signed-off-by: Shmuel Kallner <[email protected]>

* Populate the RequestId field of LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates to tests

Signed-off-by: Shmuel Kallner <[email protected]>

* Added PostResponse plugins to scheduler config

Signed-off-by: Shmuel Kallner <[email protected]>

* Added scheduler.OnResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Added dispatcher.HandleResponse to handle responses

Signed-off-by: Shmuel Kallner <[email protected]>

* Refactored server response header handling to invoke PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Added simple test for PostResponse plugins

Signed-off-by: Shmuel Kallner <[email protected]>

* Setup the logger in the SchedulerContext appropriately for reponses

Signed-off-by: Shmuel Kallner <[email protected]>

* Updates due to rebase issues

* merge functions in env utils (kubernetes-sigs#819)

Signed-off-by: Nir Rozenbaum <[email protected]>

* generalize scheduling cycle state concept (kubernetes-sigs#818)

* generalize scheduling cycle state concept

Signed-off-by: Nir Rozenbaum <[email protected]>

* typo

Signed-off-by: Nir Rozenbaum <[email protected]>

* make linter happy

Signed-off-by: Nir Rozenbaum <[email protected]>

* make prefix state struct internal to package instead of public

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* remove Model field from LLMRequest (kubernetes-sigs#782)

* remove Model field from LLMRequest

Signed-off-by: Nir Rozenbaum <[email protected]>

* rebase handling

Signed-off-by: Nir Rozenbaum <[email protected]>

---------

Signed-off-by: Nir Rozenbaum <[email protected]>

* Added the LLMResponse struct and RequestId to LLMRequest

Signed-off-by: Shmuel Kallner <[email protected]>

* Insure that wanted response header messages have all of the response headers in them

Signed-off-by: Shmuel Kallner <[email protected]>

---------

Signed-off-by: Shmuel Kallner <[email protected]>
Signed-off-by: Nir Rozenbaum <[email protected]>
Co-authored-by: Nir Rozenbaum <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants