[Colloquium] TODAY 2/14 Ari Holtzman (Washington) Controlling Large Language Models: Generating (Useful) Text from Models We Don't Fully Understand

Holly Santos hsantos at uchicago.edu
Tue Feb 14 08:49:50 CST 2023


Department of Computer Science Seminar

Ari Holtzman
PhD Candidate, Computer Science & Engineering
University of Washington

Tuesday, February 14th
2:00pm - 3:00pm
In Person: John Crerar Library 390

Zoom:
https://uchicagogroup.zoom.us/j/91667606873?pwd=UmYvSWNGdm1ES3UwZExicG04Ri9mZz09

Meeting ID: 916 6760 6873
Passcode: 323372

Title: Controlling Large Language Models: Generating (Useful) Text from Models We Don’t Fully Understand
Abstract:
Generative language models have recently exploded in popularity, with services such as ChatGPT deployed to millions of users. These neural models are fascinating, useful, and incredibly mysterious: rather than designing what we want them to do, we nudge them in the right direction and must discover what they are capable of. But how can we rely on such inscrutable systems?

This talk will describe a number of key characteristics we want from generative models of text, such as coherence and correctness, and show how we can design algorithms to more reliably generate text with these properties. We will also highlight some of the challenges of using such models, including the need to discover and name new and often unexpected emergent behavior. Finally, we will discuss the implications this has for the grand challenge of understanding models at a level where we can safely control their behavior.

Bio:
Ari Holtzman is a PhD student at the University of Washington. His research has focused broadly on generative models of text: how we can use them and how can we understand them better. His research interests have spanned everything from dialogue, including winning the first Amazon Alexa Prize in 2017, to fundamental research on text generation, such as proposing Nucleus Sampling, a decoding algorithm used broadly in deployed systems such as the GPT-3 API and academic research. Ari completed an interdisciplinary degree at NYU combining Computer Science and the Philosophy of Language.

[cid:65F15779-5CCC-44FA-9B07-E794F520FDF0]

---
Holly Santos
Executive Assistant to Michael J. Franklin, Chairman
Department of Computer Science
The University of Chicago
5730 S Ellis Ave-217   Chicago, IL 60637
P: 773-834-8977
hsantos at uchicago.edu<mailto:hsantos at uchicago.edu>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20230214/88bf4312/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 57334164-E575-496C-AF97-878D9EE88522.jpeg
Type: image/jpeg
Size: 19324 bytes
Desc: 57334164-E575-496C-AF97-878D9EE88522.jpeg
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20230214/88bf4312/attachment-0001.jpeg>


More information about the Colloquium mailing list