【开源推荐】DiffRhythm: 速度超快，简单至极，利用潜在扩散实现端到端全长歌曲生成 #6218

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

JoeDeanx opened this issue Mar 5, 2025 · 0 comments

Labels

weekly

JoeDeanx commented Mar 5, 2025

DiffRhythm: 一个突破性的 AI 歌曲生成模型

DiffRhythm 是西北工业大学 ASLP 实验室开发的一个创新性 AI 音乐生成模型。它有以下几个突出特点:

完整歌曲生成 - 可以同时生成人声和伴奏,生成长达 4 分 45 秒的完整歌曲。这解决了以往 AI 音乐模型只能生成单一音轨或短片段的局限。
极快的推理速度 - 得益于非自回归结构设计,仅需 10 秒即可生成一首完整歌曲,比传统语言模型方案快得多。
简单优雅的设计 - 模型结构简单,无需复杂的数据准备,推理时只需提供歌词和风格提示即可。这种简洁性保证了模型的可扩展性。
多语言支持 - 支持中英文歌曲生成,可以适应不同语言和音乐风格。

该项目已开源,开发者可以在 GitHub 上获取代码。这是音乐生成领域的一个重要突破,为 AI 创作提供了新的可能性。

GitHub地址: DiffRhythm Github
网站地址：DiffRhythm

ruanyf added the weekly label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment