Skip to content

Commit 0676ae1

Browse files
committed
Merge branch 'main' into lora/improve-docs
2 parents ad837b0 + 256e696 commit 0676ae1

File tree

78 files changed

+11678
-319
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

78 files changed

+11678
-319
lines changed

docs/source/en/_toctree.yml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -105,6 +105,8 @@
105105
title: MPS
106106
- local: optimization/habana
107107
title: Habana Gaudi
108+
- local: optimization/tome
109+
title: Token Merging
108110
title: Optimization/Special Hardware
109111
- sections:
110112
- local: conceptual/philosophy
@@ -152,6 +154,8 @@
152154
title: DDPM
153155
- local: api/pipelines/dit
154156
title: DiT
157+
- local: api/pipelines/if
158+
title: IF
155159
- local: api/pipelines/latent_diffusion
156160
title: Latent Diffusion
157161
- local: api/pipelines/paint_by_example
@@ -262,6 +266,8 @@
262266
title: VP-SDE
263267
- local: api/schedulers/vq_diffusion
264268
title: VQDiffusionScheduler
269+
- local: api/schedulers/dpm_sde
270+
title: DPMSolverSDEScheduler
265271
title: Schedulers
266272
- sections:
267273
- local: api/experimental/rl

docs/source/en/api/pipelines/audioldm.mdx

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -25,14 +25,14 @@ This pipeline was contributed by [sanchit-gandhi](https://huggingface.co/sanchit
2525

2626
## Text-to-Audio
2727

28-
The [`AudioLDMPipeline`] can be used to load pre-trained weights from [cvssp/audioldm](https://huggingface.co/cvssp/audioldm) and generate text-conditional audio outputs:
28+
The [`AudioLDMPipeline`] can be used to load pre-trained weights from [cvssp/audioldm-s-full-v2](https://huggingface.co/cvssp/audioldm-s-full-v2) and generate text-conditional audio outputs:
2929

3030
```python
3131
from diffusers import AudioLDMPipeline
3232
import torch
3333
import scipy
3434

35-
repo_id = "cvssp/audioldm"
35+
repo_id = "cvssp/audioldm-s-full-v2"
3636
pipe = AudioLDMPipeline.from_pretrained(repo_id, torch_dtype=torch.float16)
3737
pipe = pipe.to("cuda")
3838

@@ -56,7 +56,7 @@ Inference:
5656
### How to load and use different schedulers
5757

5858
The AudioLDM pipeline uses [`DDIMScheduler`] scheduler by default. But `diffusers` provides many other schedulers
59-
that can be used with the AudioLDM pipeline such as [`PNDMScheduler`], [`LMSDiscreteScheduler`], [`EulerDiscreteScheduler`],
59+
that can be used with the AudioLDM pipeline such as [`PNDMScheduler`], [`LMSDiscreteScheduler`], [`EulerDiscreteScheduler`],
6060
[`EulerAncestralDiscreteScheduler`] etc. We recommend using the [`DPMSolverMultistepScheduler`] as it's currently the fastest
6161
scheduler there is.
6262

@@ -68,12 +68,14 @@ method, or pass the `scheduler` argument to the `from_pretrained` method of the
6868
>>> from diffusers import AudioLDMPipeline, DPMSolverMultistepScheduler
6969
>>> import torch
7070
71-
>>> pipeline = AudioLDMPipeline.from_pretrained("cvssp/audioldm", torch_dtype=torch.float16)
71+
>>> pipeline = AudioLDMPipeline.from_pretrained("cvssp/audioldm-s-full-v2", torch_dtype=torch.float16)
7272
>>> pipeline.scheduler = DPMSolverMultistepScheduler.from_config(pipeline.scheduler.config)
7373
7474
>>> # or
75-
>>> dpm_scheduler = DPMSolverMultistepScheduler.from_pretrained("cvssp/audioldm", subfolder="scheduler")
76-
>>> pipeline = AudioLDMPipeline.from_pretrained("cvssp/audioldm", scheduler=dpm_scheduler, torch_dtype=torch.float16)
75+
>>> dpm_scheduler = DPMSolverMultistepScheduler.from_pretrained("cvssp/audioldm-s-full-v2", subfolder="scheduler")
76+
>>> pipeline = AudioLDMPipeline.from_pretrained(
77+
... "cvssp/audioldm-s-full-v2", scheduler=dpm_scheduler, torch_dtype=torch.float16
78+
... )
7779
```
7880
7981
## AudioLDMPipeline

0 commit comments

Comments
 (0)