You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the forward of class WideResNet,there is a BN normalizing the output of the last res-block and if I comment this line, the loss in the model will become NaN after few steps (in a high learining rate). I want to ask whether the case just appear in MPL or I should note it in other algorithm?
The text was updated successfully, but these errors were encountered:
In the forward of class WideResNet,there is a BN normalizing the output of the last res-block and if I comment this line, the loss in the model will become NaN after few steps (in a high learining rate). I want to ask whether the case just appear in MPL or I should note it in other algorithm?
The text was updated successfully, but these errors were encountered: