[Colloquium] TOMORROW 4/6 Stephen Tu (Google) The Foundations of Machine Learning for Feedback Control

Holly Santos hsantos at uchicago.edu
Wed Apr 5 08:58:15 CDT 2023


Department of Computer Science Seminar

Stephen Tu
Research Scientist
Robotics, Google

Thursday April 6th
2:00pm - 3:00pm
In Person: John Crerar Library 298

Zoom:
https://uchicagogroup.zoom.us/j/95802262150?pwd=L01ZSXU4ZnN3RmpaMUVYSVA1VnhmQT09<https://urldefense.com/v3/__https://uchicagogroup.zoom.us/j/95802262150?pwd=L01ZSXU4ZnN3RmpaMUVYSVA1VnhmQT09__;!!BpyFHLRN4TMTrA!_tdJmZJ3725vc6BxZPKrNybiCBpGz-UlywH4pdo-U3Kleu-jdXuEaALby1mBJshxqgFuFHIGYOXWKiny-bu1J4Y$>

Meeting ID: 958 0226 2150
Passcode: 730093

Title: The foundations of machine learning for feedback control

Abstract:
Recent breakthroughs in machine learning offer unparalleled optimism for the future capabilities of artificial intelligence. However, despite impressive progress, modern machine learning methods still operate under the fundamental assumption that the data at test time is generated by the same distribution from which training examples are collected. In order to build robust intelligent systems—self-driving vehicles, robotic assistants, smart grids—which safely interact with and control the surrounding environment, one must reason about the feedback effects of models deployed in closed-loop.

In this talk, I will discuss my work on developing a principled understanding of learning-based feedback systems, grounded within the context of robotics. First, motivated by the fact that many real world systems naturally produce sequences of data with long-range dependencies, I will present recent progress on the fundamental problem of learning from temporally correlated data streams. I will show that in many situations, learning from correlated data can be as efficient as if the data were independent. I will then examine how incremental stability—a core idea in classical control theory—can be used to study feedback-induced distribution shift. In particular, I will characterize how an expert policy’s stability properties affect the end-to-end sample complexity of imitation learning. I will conclude by showing how these insights lead to practical algorithms and data collection strategies for imitation learning.

Bio:
Stephen Tu is a research scientist at Robotics at Google in New York City. His research interests are focused on a principled understanding of the effects of using machine learning models for feedback control, with specific emphasis on robotics applications. He received his Ph.D. from the University of California, Berkeley in EECS under the supervision of Ben Recht.

[EE496A4F-4715-441A-AE1D-FFFD1987826F.png]
----
Holly Santos
Executive Assistant to Michael J. Franklin, Chairman
Department of Computer Science
The University of Chicago
5730 S Ellis Ave-217   Chicago, IL 60637
P: 773-834-8977
hsantos at uchicago.edu

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20230405/c087da31/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: EE496A4F-4715-441A-AE1D-FFFD1987826F.png
Type: image/png
Size: 98137 bytes
Desc: EE496A4F-4715-441A-AE1D-FFFD1987826F.png
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20230405/c087da31/attachment-0001.png>


More information about the Colloquium mailing list