[Theory] TODAY: [Talks at TTIC] 3/24 TTIC Colloquium: Phillip Isola, MIT
Brandie Jones via Theory
theory at mailman.cs.uchicago.edu
Mon Mar 24 12:00:00 CDT 2025
*When:* Monday, March 24th at *2PM** CT*
*Where: *Talk will be given *live, in-person* at
TTIC, 6045 S. Kenwood Avenue
5th Floor, Room 530
*Virtually:* via Panopto (livestream
<https://uchicago.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=50949600-30e2-4fd2-9ac3-b248011e9af3>
)
*Who: * Phillip Isola, MIT
*Title:* Language as a Camera
*Abstract: * Visual content can be conveyed in many ways. It can be
photographed and captured in an array of pixels, or instead it can be
described through text rich in imagery. Computer vision has traditionally
only dealt with the former format, leaving language processing as the
domain of other fields. In this talk I will reconsider this choice: should
computer vision also deal with language as a fundamental visual format? I
will share our recent work asking: what do language models know about the
visual world? Are they good models of visual data? What kinds of visual
structures do they represent? And how can they be leveraged to improve
vision systems.
*Host: <greg at ttic.edu>Shiry Ginosar <shiry at ttic.edu>*
--
*Brandie Jones *
*Executive **Administrative Assistant*
Toyota Technological Institute
6045 S. Kenwood Avenue
Chicago, IL 60637
www.ttic.edu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/theory/attachments/20250324/26cae5d4/attachment.html>
More information about the Theory
mailing list