Refactor train! #1017

DhairyaLGandhi · 2020-02-03T09:16:12Z

This is in response to #666 where we can visualise train! as a wrapper around the step! function, while maintaining the same api.

To actually accumulate the loss as in #666 (comment), we could of course go with the loss function being a simple closure here, getting rid of the need to send around the mini batch.

With the changes to Zygote, it might be nice to actually have this in our interface.

cc @MikeInnes @oxinabox

oxinabox · 2020-02-03T09:36:29Z

src/optimise/train.jl

  throw(StopException())
 end

+function step!(loss, ps, minibatch, opt)


Can this have a docstring, and be exported?

Yeah, it would be good to converge on its implementation, with that I'll add the doctrings and a basic test too.

CarloLucibello · 2020-02-07T07:55:12Z

src/optimise/train.jl

+  gs = gradient(ps) do
+    loss(minibatch...)
+  end
+  update!(opt, ps, gs)


Isn't this return incompatible with the interface proposed in #666 (comment)?

It returns nothing, which should be fine for most cases. For that comment specifically, yes, but it really is just a matter of agreeing on the semantics desired out of it. returning the loss here should be fine, but seeing as it is stepping through a batch, and doing a training step, I feel the correct return should be a nothing, with a slightly more trained model as it were

DhairyaLGandhi · 2020-03-02T06:20:55Z

Pinging @MikeInnes for his thoughts on the semantics that would make most sense here.

MikeInnes · 2020-03-17T14:35:35Z

I agree that closing over the batch would make the most sense. I also agree it's best to avoid returning things like losses, since as discussed earlier we can just close over those things, and we will eventually want an out-of-place version of this that returns a model.

train! will likely need a fair bit of refactoring once we figure out new optimisers + accelerator support, but we can figure that out later.

initial step fn

4f988d5

oxinabox reviewed Feb 3, 2020

View reviewed changes

CarloLucibello reviewed Feb 7, 2020

View reviewed changes

johnnychen94 mentioned this pull request Mar 10, 2020

[WIP] Added some Callback functions #1067

Closed

DhairyaLGandhi mentioned this pull request Jan 12, 2021

Rethink train design and better callbacks support #1461

Open

This was referenced Jan 20, 2021

Expose train loop to user code #1471

Open

No sanity checks on destructure and loadparams! #1408

Open

DhairyaLGandhi mentioned this pull request Jan 13, 2022

add step! #1833

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Refactor train! #1017

Refactor train! #1017

Uh oh!

DhairyaLGandhi commented Feb 3, 2020

Uh oh!

oxinabox Feb 3, 2020

Uh oh!

DhairyaLGandhi Feb 3, 2020

Uh oh!

CarloLucibello Feb 7, 2020

Uh oh!

DhairyaLGandhi Feb 27, 2020

Uh oh!

DhairyaLGandhi commented Mar 2, 2020

Uh oh!

MikeInnes commented Mar 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Refactor train! #1017

Are you sure you want to change the base?

Refactor train! #1017

Uh oh!

Conversation

DhairyaLGandhi commented Feb 3, 2020

Uh oh!

oxinabox Feb 3, 2020

Choose a reason for hiding this comment

Uh oh!

DhairyaLGandhi Feb 3, 2020

Choose a reason for hiding this comment

Uh oh!

CarloLucibello Feb 7, 2020

Choose a reason for hiding this comment

Uh oh!

DhairyaLGandhi Feb 27, 2020

Choose a reason for hiding this comment

Uh oh!

DhairyaLGandhi commented Mar 2, 2020

Uh oh!

MikeInnes commented Mar 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants