ren@usenixsecurity24@USENIX

Total: 1

#1 Accelerating Secure Collaborative Machine Learning with Protocol-Aware RDMA [PDF3] [Copy] [Kimi10] [REL]

Authors: Zhenghang Ren ; Mingxuan Fan ; Zilong Wang ; Junxue Zhang ; Chaoliang Zeng ; Zhicong Huang ; Cheng Hong ; Kai Chen

Secure Collaborative Machine Learning (SCML) suffers from high communication cost caused by secure computation protocols. While modern datacenters offer high-bandwidth and low-latency networks with Remote Direct Memory Access (RDMA) capability, existing SCML implementation remains to use TCP sockets, leading to inefficiency. We present CORA1 to implement SCML over RDMA. By using a protocol-aware design, CORA identifies the protocol used by the SCML program and sends messages directly to the remote party's protocol buffer, improving the efficiency of message exchange. CORA exploits the chance that the SCML task is determined before execution and the pattern is largely input-irrelevant, so that CORA can plan message destinations on remote hosts at compile time. CORA can be readily deployed with existing SCML frameworks such as Piranha with its socket-like interface. We evaluate CORA in SCML training tasks, and our results show that CORA can reduce communication cost by up to 11x and achieve 1.2x - 4.2x end-to-end speedup over TCP in SCML training.