[CS] Yihua Cheng Candidacy Exam/Dec 2, 2024

via cs cs at mailman.cs.uchicago.edu
Mon Nov 25 10:38:57 CST 2024


This is an announcement of Yihua Cheng's Candidacy Exam.
===============================================
Candidate: Yihua Cheng

Date: Monday, December 02, 2024

Time: 11 am CST

Remote Location: https://uchicago.zoom.us/j/3150893650?pwd=RFgxNEE2MUQ0QzlsVXF3Ym94bDQ2Zz09

Location: JCL 298

Title: Revolutionizing Large Language Model Serving with Knowledge Delivery Network

Abstract: As the use of large language models (LLMs) expands rapidly, so does the range of knowledge needed to supplement various LLM queries. Thus, enabling modular and efficient injection of new knowledge in LLM inference is critical. We argue that compared to the more popular fine-tuning and in-context learning, using KV caches as the medium of knowledge could simultaneously improve the modularity of knowledge injection and the efficiency of LLM serving with low cost and fast response. To make it practical, we envision a Knowledge Delivery Network (KDN), a new component in LLM services that dynamically optimizes the storage, transfer, and composition of KV cache across LLM engines and other compute and storage resources. Just like content delivery networks (CDNs), such as Akamai, enabled the success of the Internet ecosystem through their efficient data delivery, KDNs will be critical to the success of LLM applications through their efficient knowledge delivery.

Advisors: Junchen Jiang

Committee Members: Junchen Jiang, Kexin Pei, Hui Zhang



More information about the cs mailing list