<html><head><meta http-equiv="content-type" content="text/html; charset=us-ascii"></head><body style="overflow-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;"><i>Department of Computer Science and Data Science Institute Presents</i><br><br><b>Jiachen Wang<br>PhD Candidate<br>Princeton University<br><br>Tuesday, February 25th<br>2:00pm - 3:00pm <br>In-Person: John Crerar Library Rm 390</b><div><b><br></b></div><div><b style="white-space: pre-wrap; color: rgb(0, 0, 0);"><a href="https://uchicagogroup.zoom.us/j/93787118774?pwd=mibucNdLXQ0BnV6aKkhxXLMKIkTPRp.1">https://uchicagogroup.zoom.us/j/93787118774?pwd=mibucNdLXQ0BnV6aKkhxXLMKIkTPRp.1</a></b></div><div><br style="white-space: pre-wrap; caret-color: rgb(31, 31, 31); color: rgb(31, 31, 31);"><b style="white-space: pre-wrap; caret-color: rgb(31, 31, 31); color: rgb(31, 31, 31);">Meeting ID: 937 8711 8774</b><br style="white-space: pre-wrap; caret-color: rgb(31, 31, 31); color: rgb(31, 31, 31);"><b style="white-space: pre-wrap; caret-color: rgb(31, 31, 31); color: rgb(31, 31, 31);">Passcode: 009256</b><br style="white-space: pre-wrap; caret-color: rgb(31, 31, 31); color: rgb(31, 31, 31);"><br><b>Title: Fueling Responsible AI Through Data Attribution</b><br><br><b>Abstract:</b> Understanding how training data shapes model behavior is fundamental to building trustworthy AI systems. Data attribution techniques quantify the influence of individual training examples on machine learning models, providing key insights for developing data-centric algorithms (e.g., data curation) as well as addressing data-related challenges (e.g., privacy, safety, and copyright protection).<div><br></div><div>In this talk, I will present our recent advances in the foundations and practical frameworks of data attribution.  First, I will introduce a general, game-theoretic data attribution framework that optimizes for stochastic algorithms.  I will then discuss how we can efficiently conduct data attribution in the challenging setting of large-scale deep learning models (e.g., large language models).  These techniques guide data quality management, explain model predictions, and boost trustworthy AI development from a data-centric perspective. <br><br><b>Bio:</b> Jiachen ("Tianhao") is a Ph.D. student at Princeton University, advised by Prof. Prateek Mittal. His research focuses on developing theoretical foundations and practical tools for trustworthy machine learning from a data-centric perspective. Most recently, he has been developing scalable, theoretically grounded data attribution and curation techniques for foundation models. His contributions have been recognized through multiple fellowships and oral/spotlight presentations at top AI/ML venues. He was selected as a Rising Star in Data Science in 2024.<br><br><img id="<E65D0A0B-1350-477E-8828-D9979A0042FD>" src="cid:254361A9-B98B-41A6-85CE-5F8B012B677E" alt="headshot_Jiachen_Wang_1.jpeg" class="Apple-web-attachment"><br><br>Host: Raul Castro Fernandez<br><div><br></div><div>---</div><div>

Holly Santos<br>Executive Assistant to Hank Hoffmann, Liew Family Chair<br>Department of Computer Science<br>The University of Chicago<br>5730 S Ellis Ave-217   Chicago, IL 60637<br>P: 773-834-8977<br>hsantos@uchicago.edu<br><br><br class="Apple-interchange-newline"><br class="Apple-interchange-newline"><br class="Apple-interchange-newline">

</div>

<br></div></div></body></html>