Skip to content

fix calculations when skipping NaN batches#20

Open
borawhocodess wants to merge 8 commits intoautoml:mainfrom
borawhocodess:nans
Open

fix calculations when skipping NaN batches#20
borawhocodess wants to merge 8 commits intoautoml:mainfrom
borawhocodess:nans

Conversation

@borawhocodess
Copy link
Copy Markdown
Contributor

Because of the continue at line 65, batches containing NaNs are skipped. This led to loss averaging and gradient accumulation not being fully correct, since skipped batches were still implicitly counted.

This was noticed while timing my speedrunning script and inspecting skipped batches.

Integration into the TFM playground hasn’t been thoroughly tested, but the logic should be correct.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant