When Does Preconditioning Help or Hurt Generalization?

Publication
arXiv:2006.10732
Date