首页|A Distributed Data-Parallel PyTorch Implementation of the Distributed
Shampoo Optimizer for Training Neural Networks At-Scale
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
Jose Gallego-Posada Tsung-Hsien Lee Hao-Jun Michael Shi Michael Rabbat Zhijing Li Kaushik Rangadurai Shintaro Iwasaki Dheevatsa Mudigere

评论