-
Notifications
You must be signed in to change notification settings - Fork 6.1k
Closed
Labels
staleIssues that haven't received updatesIssues that haven't received updates
Description
Model/Pipeline/Scheduler description
Now that the first text2video pipeline is merged, the next frontier is to enable efficient fine-tuning of these models.
There's already Dreambooth method for the Diffusion-based text2img models, consisting of making a regularization dataset and then finetuning the model in a specific way on both the input and on the generated dataset.
Given that the nature of these models is essentially the same (as I know for maintaining the Auto1111 extension for this model), it can be enabled by shifting the Unet model from 2d to 3d and changing the class dataset pipeline from images to video. It would be really awesome to do 🙂
Open source status
- The model implementation is available
- The model weights are available (Only relevant if addition is not a scheduler).
Provide useful links for the implementation
No response
Metadata
Metadata
Assignees
Labels
staleIssues that haven't received updatesIssues that haven't received updates