[compliance_checker] add wegihts_initialiazation rule #92

mwawrzos · 2021-03-23T17:40:29Z

This is an update to the compliance checker that introduces tests against the rules described in #80.

Update for the following models is ready:

recommendation
object_detection
image_segmentation
rnn_speech_recognition (it was ready before)
language_model
single_stage_detector
image_classification
reinforcement

recommendation, objeobject_detection and image_segmentation

github-actions · 2021-03-23T17:40:46Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

johntran-nv · 2021-04-21T18:19:59Z

@bitfort and @petermattson , what do you think is an appropriate approval list for this one? I believe most folks are already aware of this requirement as we talked about it in the SWG, and folks even brought it up in other topics, so I'm inclined to just accept, but I think it would be good to get some other eyes on it.

petermattson · 2021-04-21T19:53:30Z

We should probably send a "summary of logging changes, final warning for comment" to the Training WG mailing, and summarize this, RCP checking etc. Key thing here IMO is not to surprise people, more than we expect folks to have major comments. After a decent period, we do a normal review for finer points. WDYT?

…

On Wed, Apr 21, 2021 at 11:20 AM johntran-nv ***@***.***> wrote: @bitfort <https://github.com/bitfort> and @petermattson <https://github.com/petermattson> , what do you think is an appropriate approval list for this one? I believe most folks are already aware of this requirement as we talked about it in the SWG, and folks even brought it up in other topics, so I'm inclined to just accept, but I think it would be good to get some other eyes on it. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#92 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AIIVUHO467BC7LNEMLG3PULTJ4JOJANCNFSM4ZVU3ZLQ> .

johntran-nv · 2021-04-22T03:01:46Z

Good idea, Peter. I sent out an email to the training alias, and said that I'd come back and merge on Friday, if no one objects.

emizan76 · 2021-04-22T04:58:58Z

A couple of issues / questions just came up:

Has weight initialization logging been added to the references? I see only RNNT and Unet3D supporting that. Or the reference implementations do not have to follow this rule? I see a log and compliance directory in the training repo but I do not know how actively they are maintained and which submission rules are followed by the references.
Is there an assumption that tensor names should be the same as the reference, or the ones listed in this PR? If submissions use different tensor names then it is tedious work to make this logging.

mwawrzos · 2021-04-22T09:13:22Z

Answering the questions:

I was trying to do this way, so references are updated first, and the compliance checker is updated accordingly, but the submitters WG committee was busy with other topics, and there was no time to bring references update topic. @TheKanter said on one of the benchmark infra WG meetings, that contractors will be hired, and they can update references. Apart from that, most of the references are outdated now, some of them don't even support MLPerf logging. That is why I decided to propose changes this way.
Tensor names were supposed to be straightforward. I'm open to change them. If some benchmark is described in too much detail, or too little detail, it is also subject to change. The intention is to make submission review easier, so finding sources initializing weights is simpler. Does tensor name change to reference-framework-like make it more acceptable?

petermattson · 2021-04-22T19:58:05Z

Folks, should we maybe consider pushing this out a version until we can at least implement and test in the references? Aligning tensor names is a *lot* of work, and tensors for some of the benchmarks may be *very* big. Not sure we want to bloat logging files that much.

…

On Thu, Apr 22, 2021 at 2:13 AM Marek Wawrzos ***@***.***> wrote: Answering the questions: 1. I was trying to do this way, so references are updated first, and the compliance checker is updated accordingly, but the submitters WG committee was busy with other topics, and there was no time to bring references update topic. @TheKanter <https://github.com/TheKanter> said on one of the benchmark infra WG meetings, that contractors will be hired, and they can update references. Apart from that, most of the references are outdated now, some of them don't even support MLPerf logging. That is why I decided to propose changes this way. 2. Tensor names were supposed to be straightforward. I'm open to change them. If some benchmark is described in too much detail, or too little detail, it is also subject to change. The intention is to make submission review easier, so finding sources initializing weights is simpler. Does tensor name change to reference-framework-like make it more acceptable? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#92 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AIIVUHLVHBVZIJKESIAFE73TJ7SETANCNFSM4ZVU3ZLQ> .

johntran-nv · 2021-04-23T17:57:20Z

That's a good suggestion, Peter. Let's defer this to v1.1, then. I'll leave it open.

petermattson · 2021-04-23T23:52:02Z

Thanks Jon!

…

On Fri, Apr 23, 2021 at 10:57 AM johntran-nv ***@***.***> wrote: That's a good suggestion, Peter. Let's defer this to v1.1, then. I'll leave it open. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#92 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AIIVUHNZCJ4HYCMGMWN2HSLTKGYI7ANCNFSM4ZVU3ZLQ> .

xyhuang · 2021-09-21T17:52:02Z

Fixed in #153

[compliance_checker][wegihts_initialiazation] update three models

b97e08c

recommendation, objeobject_detection and image_segmentation

mwawrzos added 8 commits March 24, 2021 18:02

[compliance_checker][DLRM] right weights_initialization tensor names

70a3f0f

add wegihts_initialiazation for BERT

32414b0

replace tabs with spaces

dfb9fbd

update bert tensor names to better match the reference

fdf60a7

syntax fix in wegihts_initialiazation tests

489f975

[compliance_checker_update] wegihts_initialiazation logging for resnet

3b9069c

[compliance_checker] wegihts_initialiazation for ssd

bd5e195

[compliance_checker] wegihts_initialiazation in miniGo

cebba9d

emizan76 approved these changes Apr 21, 2021

View reviewed changes

xyhuang closed this Sep 21, 2021

github-actions bot locked and limited conversation to collaborators Sep 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[compliance_checker] add wegihts_initialiazation rule #92

[compliance_checker] add wegihts_initialiazation rule #92

Uh oh!

mwawrzos commented Mar 23, 2021 •

edited

Loading

Uh oh!

github-actions bot commented Mar 23, 2021 •

edited

Loading

Uh oh!

johntran-nv commented Apr 21, 2021

Uh oh!

petermattson commented Apr 21, 2021 via email

Uh oh!

johntran-nv commented Apr 22, 2021

Uh oh!

emizan76 commented Apr 22, 2021

Uh oh!

mwawrzos commented Apr 22, 2021

Uh oh!

petermattson commented Apr 22, 2021 via email

Uh oh!

johntran-nv commented Apr 23, 2021

Uh oh!

petermattson commented Apr 23, 2021 via email

Uh oh!

xyhuang commented Sep 21, 2021

Uh oh!

Uh oh!

[compliance_checker] add wegihts_initialiazation rule #92

[compliance_checker] add wegihts_initialiazation rule #92

Uh oh!

Conversation

mwawrzos commented Mar 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johntran-nv commented Apr 21, 2021

Uh oh!

petermattson commented Apr 21, 2021 via email

Uh oh!

johntran-nv commented Apr 22, 2021

Uh oh!

emizan76 commented Apr 22, 2021

Uh oh!

mwawrzos commented Apr 22, 2021

Uh oh!

petermattson commented Apr 22, 2021 via email

Uh oh!

johntran-nv commented Apr 23, 2021

Uh oh!

petermattson commented Apr 23, 2021 via email

Uh oh!

xyhuang commented Sep 21, 2021

Uh oh!

Uh oh!

mwawrzos commented Mar 23, 2021 •

edited

Loading

github-actions bot commented Mar 23, 2021 •

edited

Loading