Skip to content

[Feature]: Streaming multi-modal input/output #25066

@DarkLight1337

Description

@DarkLight1337

This is a tracking issue for enabling streaming MM I/O.

Outline

Streaming input:

  • Support streaming multi-modal inputs at API level.
  • Handle streaming inputs in the multi-modal processor.
  • Define an interface for models to indicate support for streaming inputs.
  • Update V1 model runner and scheduler to handle partial MM encoding requests (this is the hardest part IMO)

Streaming output:

  • Implement RequestOutputKind.DELTA for multi-modal outputs in V1 output processor.
  • Support streaming multi-modal outputs at API level.

Notes

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

Planning

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions