[CS] Yibo Jiang Dissertation Defense May 21, 2025

via cs cs at mailman.cs.uchicago.edu
Mon May 19 10:17:36 CDT 2025


This is an announcement of Yibo Jiang's Dissertation Defense.
===============================================
Candidate: Yibo Jiang

Date: Wednesday, May 21, 2025

Time:  1 pm CST

Remote Location: https://uchicago.zoom.us/j/98115267085?pwd=PspaiWHB3ZFIxTLvbjLnGkdZBO5mVH.1  Meeting ID: 981 1526 7085 Passcode: 107782

Location: JCL 298

Title: Geometric and Algebraic Structures in Foundation Model Representations

Abstract: Foundation models, such as large language models (LLMs), operate within vector spaces, whereas human perception of concepts does not naturally align with this framework. This raises a fundamental question: how do these models internalize the structure of concepts within a vector space and how do they use it? To address this, the thesis investigates two structural properties—partial orthogonality and linearity—and also studies how information in representations can be effectively leveraged by the self-attention architecture. The first part examines how models represent the intuitive notion of "semantic independence." Rather than formally defining semantic independence, the focus is on the algebraic axioms of independence and how they can be represented in the forms of partial orthogonality in the embedding space. The second part investigates linear representations. While the concept of linearity appears straightforward, its underlying basis—especially in LLMs trained solely on next-token prediction—remains a largely unresolved mystery. This thesis provides new insights into this phenomenon by showing the connection between linear representations and the implicit bias of gradient descent. Finally, the third part examines representations in a practical setting—fact retrieval—and explores how self-attention can effectively combine stored information in representations to retrieve the most relevant outputs, functioning like associative memory.

Advisors: Victor Veitch

Commitee: Victor Veitch, Bryon Aragam, Ari Holtzman, Yuxin Chen


More information about the cs mailing list