[Theory] NOW: [TTIC Talks] 3/4 Talks at TTIC: Tamar Shaham, MIT

Brandie Jones via Theory theory at mailman.cs.uchicago.edu
Tue Mar 4 09:55:00 CST 2025


*When:*        Tuesday, March 4th at *10AM CT*


*Where:       *Talk will be given *live, in-person* at

                       TTIC, 6045 S. Kenwood Avenue

                       5th Floor, Room 530


*Virtually:*  via Panopto (livestream
<https://uchicago.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=c90cb587-4d89-4143-99b6-b26e0114525b>
)


*Who: *         Tamar Shaham, MIT

*Title:*        Understanding and Enhancing Deep Neural Networks with
Automated Interpretability
*Abstract: *Deep neural networks are becoming incredibly sophisticated;
they can generate realistic images, engage in complex dialogues, analyze
intricate data, and execute tasks that appear almost human-like. But how do
such models achieve these abilities?

In this talk, I will present a line of work that aims to explain behaviors
of deep neural networks. This includes a new approach for evaluating
cross-domain knowledge encoded in generative models, tools for uncovering
core mechanisms in large language models, and their behavior under
fine-tuning. I will show how to automate and scale the scientific process
of interpreting neural networks with the Automated Interpretability Agent,
a system that autonomously designs experiments on models’ internal
representations to explain their behaviors. I will demonstrate how such
understanding enables mitigating biases and enhancing models’ performance.
The talk will conclude with a discussion of future directions, including
developing universal interpretability tools and extending interpretability
methods to automate scientific discovery.

*Short Bio*: Tamar Rott Shaham is a postdoctoral researcher at MIT CSAIL in
Prof. Antonio Torralba’s lab. She earned her PhD from the ECE faculty at
the Technion, supervised by Prof. Tomer Michaeli. Tamar has received
several awards, including the ICCV 2019 Best Paper Award (Marr Prize), the
Google WTM Scholarship, the Adobe Research Fellowship, the Rothchild
Postdoctoral Fellowship, the Vatat-Zuckerman Postdoctoral Scholarship, and
the Schmidt Postdoctoral Award.

*Host:  <zhiyuanli at ttic.edu> <nati at ttic.edu>Greg Shaknarovich
<greg at ttic.edu>*



-- 
*Brandie Jones *
*Executive **Administrative Assistant*
Toyota Technological Institute
6045 S. Kenwood Avenue
Chicago, IL  60637
www.ttic.edu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/theory/attachments/20250304/32537d18/attachment.html>


More information about the Theory mailing list