Skip to content

Conversation

zhangsmallshark
Copy link
Contributor

Hello team, Deepspeed-Domino contains all related files for Domino project.

Copy link
Contributor

@GuanhuaWang GuanhuaWang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thx @zhangsmallshark and @shenzheyu for great work.

added a few high level comments, we need to make loss and iter time both fixed! thx

@GuanhuaWang
Copy link
Contributor

@zhangsmallshark , regarding to fix loss commit da0c63b

Maybe I miss something, but I don't see any real code change regarding to fwd/bwd/step. The only changes in this commit just add timers, comment some printout vals. idk how loss is fixed in this commit

@shenzheyu
Copy link
Contributor

@tjruwase Hi Olatunji, we've resolved all the comments. Could you please help us by starting the final review?

@tjruwase
Copy link
Contributor

tjruwase commented Nov 7, 2024

@tjruwase Hi Olatunji, we've resolved all the comments. Could you please help us by starting the final review?

LGTM! Thanks!

@GuanhuaWang GuanhuaWang merged commit faa0420 into deepspeedai:master Nov 7, 2024
2 checks passed
zhangsmallshark added a commit to zhangsmallshark/DeepSpeedExamples that referenced this pull request Feb 12, 2025
* add domino

* use transformer from deepspeed

* clean args

* mega opt

* add opt & timer

* add opt

* fix loss

* folder name

* Change arguent in pretrain script

* Add readme for domino

* Update readme for domino

* Fixing usage issues

* update dataset

* megatron dependencies

* path

* Update README.md

* remove imports

* update import

* Update README.md

* Minor example script changes

* train bash

* require

* Update README.md

---------

Co-authored-by: chengming-zhang <[email protected]>
Co-authored-by: Zheyu SHEN <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: Olatunji Ruwase <[email protected]>
Co-authored-by: Logan Adams <[email protected]>
Signed-off-by: zhangsmallshark <[email protected]>
shenzheyu added a commit to shenzheyu/DeepSpeedExamples that referenced this pull request Mar 5, 2025
* add domino

* use transformer from deepspeed

* clean args

* mega opt

* add opt & timer

* add opt

* fix loss

* folder name

* Change arguent in pretrain script

* Add readme for domino

* Update readme for domino

* Fixing usage issues

* update dataset

* megatron dependencies

* path

* Update README.md

* remove imports

* update import

* Update README.md

* Minor example script changes

* train bash

* require

* Update README.md

---------

Co-authored-by: chengming-zhang <[email protected]>
Co-authored-by: Zheyu SHEN <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: Olatunji Ruwase <[email protected]>
Co-authored-by: Logan Adams <[email protected]>
Signed-off-by: Zheyu SHEN <[email protected]>
hwchen2017 pushed a commit that referenced this pull request Jun 8, 2025
* add domino

* use transformer from deepspeed

* clean args

* mega opt

* add opt & timer

* add opt

* fix loss

* folder name

* Change arguent in pretrain script

* Add readme for domino

* Update readme for domino

* Fixing usage issues

* update dataset

* megatron dependencies

* path

* Update README.md

* remove imports

* update import

* Update README.md

* Minor example script changes

* train bash

* require

* Update README.md

---------

Co-authored-by: chengming-zhang <[email protected]>
Co-authored-by: Zheyu SHEN <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: Olatunji Ruwase <[email protected]>
Co-authored-by: Logan Adams <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants