[WIP] Gradient clipping #1967

danieltudosiu · 2021-04-07T19:22:10Z

Description

Added a basic gradient clipping draft for the SupervisedTrainer. The implementation choice must be discussed.

Status

Work in progress

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

Added a basic gradient clipping draft for the SupervisedTrainer. Signed-off-by: Petru-Daniel Tudosiu <[email protected]>

danieltudosiu · 2021-04-07T19:28:26Z

@wyli / @Nic-Ma I need your opinion about how to attack this one.

In my mind, each Trainer class must have its own gradient clipping function since the number of optimizers and scalers can vary.

What do you think of the logic draft? Should I write it for the GAN one as well (I observed the GAN one is not on par with the SupervisedTrainer)?

I think the following solutions do not work:

backward_hook
- "inf" norm will not be usable since it needs all model's parameters
Ignite Handler
- It will need to be created after the creation of the engine to be able to access the scaler and amp attributes of the trainer.

wyli · 2022-03-15T23:09:43Z

closing this, as preferred solutions are described in #3892 (reply in thread)

[WIP] Gradient clipping draft

e238c56

Added a basic gradient clipping draft for the SupervisedTrainer. Signed-off-by: Petru-Daniel Tudosiu <[email protected]>

danieltudosiu changed the title ~~[WIP] Gradient clipping draft~~ [WIP] Gradient clipping Apr 7, 2021

danieltudosiu mentioned this pull request Apr 8, 2021

Gradient handling in Workflows #1964

Closed

wyli closed this Mar 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Gradient clipping #1967

[WIP] Gradient clipping #1967

Uh oh!

danieltudosiu commented Apr 7, 2021

Uh oh!

danieltudosiu commented Apr 7, 2021

Uh oh!

wyli commented Mar 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] Gradient clipping #1967

[WIP] Gradient clipping #1967

Uh oh!

Conversation

danieltudosiu commented Apr 7, 2021

Description

Status

Types of changes

Uh oh!

danieltudosiu commented Apr 7, 2021

Uh oh!

wyli commented Mar 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants