Skip to content

gd-zhang/noisy-quadratic-model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

noisy-quadratic-model

The major contributors of this repository include Roger Grosse and Guodong Zhang.

Introduction

This repository contains the toy code to reproduce the NQM results from the paper Which Alorithmic Choices Matter at Which Batch Sizes?.

Particularly, you can reproduce our results on momentum (left figure), preconditioning (both figures), exponential moving average (right figures) and learning rate decay with this code. Here are a few figures from our paper.

Momentum Exponential Moving Average

Citation

To cite this work, please use

@inproceedings{zhang2019algorithmic,
  title={Which algorithmic choices matter at which batch sizes? insights from a noisy quadratic model},
  author={Zhang, Guodong and Li, Lala and Nado, Zachary and Martens, James and Sachdeva, Sushant and Dahl, George E and Shallue, Christopher J and Grosse, Roger},
  booktitle={Advances in Neural Information Processing Systems},
  year={2019}
}

About

Large-batch Training, Neural Network Optimization

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published