DLRM training for MLPerf v1.0 submission. not merged. #163

liangan1 · 2021-05-21T01:49:07Z

1)Vertical split embedding to scale-out to much more ranks.
2)LAMB to enable large batch size.

2)LAMB to enable large batch size.

…ON (#163)

Peach-He · 2022-01-13T06:24:59Z

torch_patches/models/dlrm_mlperf_v1.0_training.diff

+                optimizer_dense = optimizers[0][0]([
+                    {"params": [p for emb in dlrm.emb_dense for p in emb.parameters()], "lr": args.learning_rate},
+                    {"params": dlrm.bot_l.parameters(), "lr": args.learning_rate},
+                    {"params": dlrm.top_l.parameters(), "lr": args.learning_rate}
+                ], lr=args.lamblr, bf16=args.bf16)
+                optimizer_sparse = optimizers[1]([
+                    {"params": [p for emb in dlrm.emb_sparse for p in emb.parameters()],
+                     "lr": args.learning_rate / ext_dist.my_size},
+                ], lr=args.learning_rate)
+                optimizer = (optimizer_dense, optimizer_sparse)


I want to upgrade this DLRM optimization to latest IPEX 1.10, any example on optimizing multi optimizers with ipex?

There is no such case now.

liangan1 added 2 commits May 21, 2021 09:44

1)Vertical split embedding to scale-out to much more ranks.

5845989

2)LAMB to enable large batch size.

Add get_hybridparallel_friendly_dataset.py

653f646

EikanWang pushed a commit that referenced this pull request Oct 4, 2021

set AliasAnalysisKind of embedding_bag and interaction to PURE_FUNCTI…

4201cef

…ON (#163)

Peach-He reviewed Jan 13, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DLRM training for MLPerf v1.0 submission. not merged. #163

DLRM training for MLPerf v1.0 submission. not merged. #163

Uh oh!

liangan1 commented May 21, 2021

Uh oh!

Peach-He Jan 13, 2022

Uh oh!

liangan1 Jan 13, 2022

Uh oh!

Uh oh!

DLRM training for MLPerf v1.0 submission. not merged. #163

Are you sure you want to change the base?

DLRM training for MLPerf v1.0 submission. not merged. #163

Uh oh!

Conversation

liangan1 commented May 21, 2021

Uh oh!

Peach-He Jan 13, 2022

Choose a reason for hiding this comment

Uh oh!

liangan1 Jan 13, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!