-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Open
Labels
feature requestNew feature or requestNew feature or requestmulti-modalityRelated to multi-modality (#4194)Related to multi-modality (#4194)
Description
This is a tracking issue for enabling streaming MM I/O.
Outline
Streaming input:
- Support streaming multi-modal inputs at API level.
- Handle streaming inputs in the multi-modal processor.
- Define an interface for models to indicate support for streaming inputs.
- Update V1 model runner and scheduler to handle partial MM encoding requests (this is the hardest part IMO)
Streaming output:
- Implement
RequestOutputKind.DELTA
for multi-modal outputs in V1 output processor. - Support streaming multi-modal outputs at API level.
Notes
- Currently we are waiting for AWS's proposal
- Take inspiration from [Model][VLM] Add Qwen2.5-Omni model support (end-to-end full support) #16347?
- See also #22695
noooop, wulaoshi, ekagra-ranjan, oi02lyl and huangyc0618huangyc0618wwl2755, qandrew, huangyc0618 and ekagra-ranjan
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or requestmulti-modalityRelated to multi-modality (#4194)Related to multi-modality (#4194)
Type
Projects
Status
Planning