9069a8976ff06f6443e7f4172990a580@2024@MLSYS

Total: 1

#1 L-GreCo: Layerwise-adaptive Gradient Compression For Efficient Data-parallel Deep Learning [PDF2] [Copy] [Kimi] [REL]

Authors: Ilia Markov ; Kaveh Alim ; Elias Frantar ; Dan Alistarh

No summary was provided.