[Colloquium] November 19th: CS Distinguished Lecture Series - Been Kim (Google DeepMind) Alignment and interpretability: how we might get it right

Holly Santos via Colloquium colloquium at mailman.cs.uchicago.edu
Wed Oct 23 08:14:38 CDT 2024


Department of Computer Science Distinguished Lecture Series Presents

Been Kim
Google DeepMind
Senior Staff Research Scientist

Tuesday, November 19th
2:00pm - 3:00pm
In Person: John Crerar Library Rm 390

Title: Alignment and interpretability: how we might get it right

Abstract: The main goal of interpretability is to enable communication between humans and machines, whether it's a value, knowledge, or an objective. In this talk, I argue that a better way to enable this communication is for humans to expand what they know and learn new things. Doing so enables us to also expand what machines know—by building better-aligned machines. I share why considering the representational gap is crucial in solving the alignment problem, and I provide an example of bridging the knowledge gap.

Bio: Been Kim is a senior staff research scientist at Google DeepMind.  Her research focuses on helping humans to communicate with complex machine learning models: 1) building tools to aid human's collaboration with machines (and detect when those tools fail) 2) study machines' general nature and 3) leveraging machines' knowledge to benefit humans. She gave a talk at  the G20 meeting in Argentina in 2019 and a keynote at ICLR 2022 and ECML 2020. Her work TCAV received UNESCO  Netexplo award, was featured at Google I/O 19'. Her work is in a chapter of Brian Christian's book on  "The Alignment Problem". She is the General chair at ICLR2024, was a Senior Program Chair at ICLR 2023 and advisory board at TRAILS. She has been a senior area chair at  NeurIPS, ICML, ICLR, AISTATS and others for the past few years. She is a steering committee  member of FAccT conference and SATML. She received her PhD. from MIT.

[Been_head.jpeg]

Host: Rebecca Willett

——

Holly Santos
Executive Assistant to Hank Hoffmann, Chairman
Department of Computer Science
The University of Chicago
5730 S Ellis Ave-217   Chicago, IL 60637
P: 773-834-8977
hsantos at uchicago.edu

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20241023/3f247758/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Been_head.jpeg
Type: image/jpeg
Size: 29699 bytes
Desc: Been_head.jpeg
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20241023/3f247758/attachment-0001.jpeg>


More information about the Colloquium mailing list