update the lr when switching to adam optimizer #2393

n-poulsen · 2023-09-21T10:17:51Z

When switching to the Adam optimizer on Apple computers (as SGD crashes on Apple Silicon), the learning rate was not updated. This often (almost always) caused the models to diverge.

This PR updates the learning rate, to the same values (and schedule) as the default for multi-animal projects (which use Adam):

cfg["multi_step"] = [[1e-4, 7500], [5e-5, 12000], [1e-5, 200000]]

This was tested on an M2 MacBook Air, on a dataset which diverge in less than 5 iterations using the previous schedule, and now converges nicely.

update the lr when switching to adam optimizer

6405963

n-poulsen requested review from MMathisLab and jeylau September 21, 2023 10:17

jeylau approved these changes Sep 21, 2023

View reviewed changes

MMathisLab approved these changes Sep 21, 2023

View reviewed changes

MMathisLab merged commit c0dba2b into main Sep 21, 2023

MMathisLab deleted the update_adam_lr branch September 21, 2023 12:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

update the lr when switching to adam optimizer #2393

update the lr when switching to adam optimizer #2393

Uh oh!

n-poulsen commented Sep 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

update the lr when switching to adam optimizer #2393

update the lr when switching to adam optimizer #2393

Uh oh!

Conversation

n-poulsen commented Sep 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants