[CS] Yibo Jiang MS Presentation/Mar 25 2024

Megan Woodward meganwoodward at uchicago.edu
Tue Mar 12 10:49:45 CDT 2024

This is an announcement of Yibo Jiang's MS Presentation
Candidate: Yibo Jiang

Date: Monday, March 25, 2024

Time:  11 am CST

Location: JCL 298

Title: Uncovering Meanings of Embeddings via Partial Orthogonality

Abstract: Machine learning tools often rely on embedding text as vectors of real numbers. In this paper, we study how the semantic structure of language is encoded in the algebraic structure of such embeddings. Specifically, we look at a notion of" semantic independence" capturing the idea that, eg," eggplant" and" tomato" are independent given" vegetable". Although such examples are intuitive, it is difficult to formalize such a notion of semantic independence. The key observation here is that any sensible formalization should obey a set of so-called independence axioms, and thus any algebraic encoding of this structure should also obey these axioms. This leads us naturally to use partial orthogonality as the relevant algebraic structure. We develop theory and methods that allow us to demonstrate that partial orthogonality does indeed capture semantic independence. Complementary to this, we also introduce the concept of independence preserving embeddings where embeddings preserve the conditional independence structures of a distribution, and we prove the existence of such embeddings and approximations to them.

Advisors: Victor Veitch

Committee Members: Victor Veitch, Haifeng Xu, Bryon Aragam, and Yuxin Chen

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/cs/attachments/20240312/b5b5878b/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: yibo-master.pdf
Type: application/pdf
Size: 641960 bytes
Desc: yibo-master.pdf
URL: <http://mailman.cs.uchicago.edu/pipermail/cs/attachments/20240312/b5b5878b/attachment-0001.pdf>

More information about the cs mailing list