[CS] REMINDER: Kuntai Du Candidacy Exam/Nov 11th

via cs cs at mailman.cs.uchicago.edu
Fri Nov 8 13:01:25 CST 2024


This is an announcement of Kuntai Du's Candidacy Exam.
===============================================
Candidate: Kuntai Du

Date: Monday, November 11

Time:   2:00PM -3:00PM CST

Remote Location:  https://uchicago.zoom.us/j/4015085138?pwd=eU5ZNzc3Mmg3bTFiQ2E3ejZqSXlWQT09

Title: Optimizing Tensor Communication for Distributed Inference

Abstract: Distributed DNN inference pipelines like video analytics and LLM inference often require transferring large tensor data, such as video pixels or KV caches. However, the network bandwidth is limited, making it infeasible to transmit all data in real-time, necessitating an efficient communication support for such distributed inference pipeline. Prior work uses strategies like sender-side heuristics (e.g., content-based video compression or selective KV cache eviction) and periodic profiling to optimize for tensor communication. However, we argue that the system needs to be built based on real-time feedback from the receiver so that the system prioritizes the transfer of those data that are needed by the receiver. This allows us to transmit large tensors in real-time under limited network bandwidth without significantly degrading the performance of distributed DNN inference pipelines.

Advisors: Junchen Jiang

Committee Members: Junchen Jiang, Ariel Holtzman, Shan Lu, Ganesh Ananthanarayanan, Ion Stoica.



More information about the cs mailing list