|国家预印本平台
| 注册
首页|A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

Jose Gallego-Posada Tsung-Hsien Lee Hao-Jun Michael Shi Michael Rabbat Zhijing Li Kaushik Rangadurai Shintaro Iwasaki Dheevatsa Mudigere

Arxiv_logoArxiv

A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

Jose Gallego-Posada Tsung-Hsien Lee Hao-Jun Michael Shi Michael Rabbat Zhijing Li Kaushik Rangadurai Shintaro Iwasaki Dheevatsa Mudigere

作者信息

引用本文复制引用

Jose Gallego-Posada,Tsung-Hsien Lee,Hao-Jun Michael Shi,Michael Rabbat,Zhijing Li,Kaushik Rangadurai,Shintaro Iwasaki,Dheevatsa Mudigere.A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale[EB/OL].(2023-09-12)[2025-12-13].https://arxiv.org/abs/2309.06497.

学科分类

计算技术、计算机技术

评论

首发时间 2023-09-12
下载量:0
|
点击量:25
段落导航相关论文