[CS] Kuntai Du Dissertation Defense/Jun 6, 2025

via cs cs at mailman.cs.uchicago.edu
Tue May 27 12:01:14 CDT 2025


This is an announcement of Kuntai Du's Dissertation Defense.
===============================================
Candidate: Kuntai Du

Date: Friday, June 06, 2025

Time: 12 PM CST

Remote Location: https://uchicago.zoom.us/j/4015085138?pwd=eU5ZNzc3Mmg3bTFiQ2E3ejZqSXlWQT09 


Title: Optimizing tensor transmission for distributed machine learning inference pipeline via real-time feedback

Abstract: In order to fully take advantage of hardware capability, machine learning pipelines, including video analytics pipeline and LLM inference pipeline, are becoming more and more distributed. We focus on optimizing the tensor transmission in these pipelines. Our key observation is that: the end-to-end system can achieve much higher performance if the tensor transmission is driven by the feedback from the tensor receiver side, but this feedback is delay-sensitive and must be obtained in real-time. This thesis identifies such feedback, namely region-of-interest in video analytics pipeline, and KV cache hit information in LLM inference pipeline, and proposes real-time feedback-driven tensor transmission that maximally realizes the potential of this feedback.  



Advisors: Junchen Jiang

Committee Members: Junchen Jiang, Ariel Holtzman, Shan Lu, Ganesh Ananthanarayanan, Ion Stoica.



More information about the cs mailing list