<div dir="ltr"><div dir="ltr"><div class="gmail_default" style="font-size:small"><div dir="ltr" style="color:rgb(80,0,80)"><div class="gmail_default"><div class="gmail_default"><font style="font-family:arial,sans-serif;color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>When:</b> </font></font><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><font face="arial, sans-serif"> Friday</font><span class="gmail_default" style="font-family:arial,sans-serif">, August 12th</span><font face="arial, sans-serif"> at</font><b><font face="arial, sans-serif"> </font><span style="background-color:rgb(255,255,0)"><font face="verdana, sans-serif">1:30 pm CT</font></span></b></font></font></div><div class="gmail_default"><p style="font-variant-numeric:normal;font-variant-east-asian:normal;font-stretch:normal;line-height:normal;margin:0px"><font face="arial, sans-serif" color="#000000"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><b><span style="background-color:rgb(255,255,0)"><br></span></b></font></font></font></p><div class="gmail_default"><font face="arial, sans-serif"><b><font color="#500050">Where: </font><font color="#000000"> </font></b><font color="#000000">Talk will be given </font><font color="#0000ff" style="font-weight:bold"><u>live, in-person</u></font><font style="font-weight:bold"> </font>at</font></div><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font color="#500050"> </font><font color="#000000"> TTIC, 6045 S. Kenwood Avenue</font></font></p><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif" color="#000000"> 5th Floor, Room 530<b> </b></font></p><br></div><div class="gmail_default"><b style="color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap">Virtually:</b><span style="font-size:14px;color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap"> via Panopto (</span><a href="https://uchicago.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=e1bef456-5061-49f6-bf5e-aee9010924ab" target="_blank"><b><font color="#0000ff">livestream</font></b></a>)<br clear="all"></div><div class="gmail_default"><br></div><div class="gmail_default"><div dir="ltr"><div class="gmail_default"><div class="gmail_default"><div class="gmail_default"><div class="gmail_default"><p class="MsoNormal" style="margin:0in 0in 0.0001pt;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><b>Who: </b> <font color="#500050"> </font><font color="#000000"><font color="#500050"> </font> </font></font></font></font>Hung-yi Lee, National Taiwan University</p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><br></p><div class="MsoNormal" align="center" style="margin:0in 0in 8pt;font-size:11pt;text-align:center;line-height:15.6933px;font-family:Calibri,sans-serif"><hr size="2" width="100%" align="center"></div><div><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Title:</b><span style="letter-spacing:normal;color:rgb(34,34,34)"> </span><span style="letter-spacing:normal;color:rgb(34,34,34)">Recent Progress of Self-supervised Learning for Speech Processing</span></p><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Abstract: </b><span style="letter-spacing:normal;color:rgb(34,34,34)">Self-supervised learning (SSL) has shown to be vital for advancing research in natural language processing (NLP), computer vision (CV), and speech processing. The paradigm pre-trains a shared model on large volumes of unlabeled data and achieves state-of-the-art for various tasks with minimal adaptation. Then this talk will share the recent advances and findings on SSL models for speech processing done at 2022 Eighth Frederick Jelinek Memorial Summer Workshop (JSALT). I'll start by discussing how to train a better SSL model, including compressing it, making it more robust, and enhancing pre-training with visual information. We then discuss efficient ways to leverage SSL models in downstream tasks, including adapters and hints. We then talk about applying SSL models to prosody-related tasks and unsupervised ASR, and share some possible extended uses of unsupervised ASR. Finally, we'll share a speech SSL toolkit.</span></p></div></div></div></div></div></div></div></div></div><div class="gmail_quote" style="color:rgb(80,0,80)"><div dir="ltr" class="gmail_attr"><b>Bio:</b> Hung-yi Lee (李宏毅) is an associate professor of the Department of Electrical Engineering of National Taiwan University (NTU), with a joint appointment at the Department of Computer Science & Information Engineering of the university. His recent research focuses on developing technology that can reduce the requirement of annotated data for speech processing (including voice conversion and speech recognition) and natural language processing (including abstractive summarization and question answering). He won Salesforce Research Deep Learning Grant in 2019, AWS ML Research Award in 2020, Outstanding Young Engineer Award from The Chinese Institute of Electrical Engineering in 2018, Young Scholar Innovation Award from Foundation for the Advancement of Outstanding Scholarship in 2019, Ta-You Wu Memorial Award from Ministry of Science and Technology of Taiwan in 2019, and The 59th Ten Outstanding Young Person Award in Science and Technology Research & Development of Taiwan. He owns a YouTube channel teaching deep learning in Mandarin with about 100k Subscribers. <br></div><div dir="ltr" class="gmail_attr"><br></div><div dir="ltr" class="gmail_attr"><b style="white-space:pre-wrap;font-family:arial,sans-serif"><font color="#000000">Host:</font></b><b style="white-space:pre-wrap;font-family:arial,sans-serif"> </b><a href="mailto:klivescu@ttic.edu" target="_blank" style="white-space:pre-wrap;font-family:arial,sans-serif"><b><font color="#0000ff">Karen Livescu</font></b></a></div></div></div><div class="gmail_default" style="font-size:small"><br></div><div class="gmail_default" style="font-size:small"><br></div><div class="gmail_default" style="font-size:small"><br></div><div><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small">Mary C. Marre</span><br></div><div><div><font face="arial, helvetica, sans-serif" size="1">Faculty Administrative Support</font></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1"><b>Toyota Technological Institute</b></font></i></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1">6045 S. Kenwood Avenue</font></i></div><div><font size="1"><i><font face="arial, helvetica, sans-serif" color="#3d85c6">Chicago, IL 60637</font></i><br></font></div><div><b><i><a href="mailto:mmarre@ttic.edu" target="_blank"><font face="arial, helvetica, sans-serif" size="1">mmarre@ttic.edu</font></a></i></b></div></div></div></div></div><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Aug 12, 2022 at 12:15 PM Mary Marre <<a href="mailto:mmarre@ttic.edu">mmarre@ttic.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr"><div style="font-size:small"><div dir="ltr"><div><div><font style="font-family:arial,sans-serif;color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>When:</b> </font></font><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><font face="arial, sans-serif"> Friday</font><span class="gmail_default" style="font-family:arial,sans-serif">, August 12th</span><font face="arial, sans-serif"> at</font><b><font face="arial, sans-serif"> </font><span style="background-color:rgb(255,255,0)"><font face="verdana, sans-serif">1:30 pm CT</font></span></b></font></font></div><div><p style="font-variant-numeric:normal;font-variant-east-asian:normal;font-stretch:normal;line-height:normal;color:rgb(80,0,80);margin:0px"><font face="arial, sans-serif" color="#000000"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><b><span style="background-color:rgb(255,255,0)"><br></span></b></font></font></font></p><div><font face="arial, sans-serif"><b><font color="#500050">Where: </font><font color="#000000"> </font></b><font color="#000000">Talk will be given </font><font color="#0000ff" style="font-weight:bold"><u>live, in-person</u></font><font style="color:rgb(80,0,80);font-weight:bold"> </font><font style="color:rgb(80,0,80)">at</font></font></div><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font color="#500050"> </font><font color="#000000"> TTIC, 6045 S. Kenwood Avenue</font></font></p><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif" color="#000000"> 5th Floor, Room 530<b> </b></font></p><br></div><div><b style="color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap">Virtually:</b><span style="font-size:14px;color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap"> via Panopto (</span><a href="https://uchicago.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=e1bef456-5061-49f6-bf5e-aee9010924ab" target="_blank"><b><font color="#0000ff">livestream</font></b></a>)<br clear="all"></div><div><br></div><div><div dir="ltr"><div><div><div><div><p class="MsoNormal" style="margin:0in 0in 0.0001pt;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><font style="color:rgb(80,0,80)"><b>Who: </b> </font><font color="#500050" style="color:rgb(80,0,80)"> </font><font color="#000000"><font color="#500050"> </font> </font></font></font></font>Hung-yi Lee, National Taiwan University</p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><br></p><div class="MsoNormal" align="center" style="margin:0in 0in 8pt;font-size:11pt;text-align:center;line-height:15.6933px;font-family:Calibri,sans-serif"><hr size="2" width="100%" align="center"></div><div><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Title:</b><span style="letter-spacing:normal;color:rgb(34,34,34)"> </span><span style="letter-spacing:normal;color:rgb(34,34,34)">Recent Progress of Self-supervised Learning for Speech Processing</span></p><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Abstract: </b><span style="letter-spacing:normal;color:rgb(34,34,34)">Self-supervised learning (SSL) has shown to be vital for advancing research in natural language processing (NLP), computer vision (CV), and speech processing. The paradigm pre-trains a shared model on large volumes of unlabeled data and achieves state-of-the-art for various tasks with minimal adaptation. Then this talk will share the recent advances and findings on SSL models for speech processing done at 2022 Eighth Frederick Jelinek Memorial Summer Workshop (JSALT). I'll start by discussing how to train a better SSL model, including compressing it, making it more robust, and enhancing pre-training with visual information. We then discuss efficient ways to leverage SSL models in downstream tasks, including adapters and hints. We then talk about applying SSL models to prosody-related tasks and unsupervised ASR, and share some possible extended uses of unsupervised ASR. Finally, we'll share a speech SSL toolkit.</span></p></div></div></div></div></div></div></div></div></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr"><b>Bio:</b> Hung-yi Lee (李宏毅) is an associate professor of the Department of Electrical Engineering of National Taiwan University (NTU), with a joint appointment at the Department of Computer Science & Information Engineering of the university. His recent research focuses on developing technology that can reduce the requirement of annotated data for speech processing (including voice conversion and speech recognition) and natural language processing (including abstractive summarization and question answering). He won Salesforce Research Deep Learning Grant in 2019, AWS ML Research Award in 2020, Outstanding Young Engineer Award from The Chinese Institute of Electrical Engineering in 2018, Young Scholar Innovation Award from Foundation for the Advancement of Outstanding Scholarship in 2019, Ta-You Wu Memorial Award from Ministry of Science and Technology of Taiwan in 2019, and The 59th Ten Outstanding Young Person Award in Science and Technology Research & Development of Taiwan. He owns a YouTube channel teaching deep learning in Mandarin with about 100k Subscribers. <br></div><div dir="ltr" class="gmail_attr"><br></div><div dir="ltr" class="gmail_attr"><b style="white-space:pre-wrap;font-family:arial,sans-serif"><font color="#000000">Host:</font></b><b style="white-space:pre-wrap;color:rgb(80,0,80);font-family:arial,sans-serif"> </b><a href="mailto:klivescu@ttic.edu" style="white-space:pre-wrap;font-family:arial,sans-serif" target="_blank"><b><font color="#0000ff">Karen Livescu</font></b></a></div></div></div><div><div dir="ltr"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small">Mary C. Marre</span><br></div><div><div><font face="arial, helvetica, sans-serif" size="1">Faculty Administrative Support</font></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1"><b>Toyota Technological Institute</b></font></i></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1">6045 S. Kenwood Avenue</font></i></div><div><font size="1"><i><font face="arial, helvetica, sans-serif" color="#3d85c6">Chicago, IL 60637</font></i><br></font></div><div><b><i><a href="mailto:mmarre@ttic.edu" target="_blank"><font face="arial, helvetica, sans-serif" size="1">mmarre@ttic.edu</font></a></i></b></div></div></div></div></div><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Aug 11, 2022 at 1:24 PM Mary Marre <<a href="mailto:mmarre@ttic.edu" target="_blank">mmarre@ttic.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr"><div style="font-size:small"><div><div dir="ltr"><div><div><font style="font-family:arial,sans-serif;color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>When:</b> </font></font><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><font face="arial, sans-serif"> Friday</font><span class="gmail_default" style="font-family:arial,sans-serif">, August 12th</span><font face="arial, sans-serif"> at</font><b><font face="arial, sans-serif"> </font><span style="background-color:rgb(255,255,0)"><font face="verdana, sans-serif">1:30 pm CT</font></span></b></font></font></div><div><p style="font-variant-numeric:normal;font-variant-east-asian:normal;font-stretch:normal;line-height:normal;color:rgb(80,0,80);margin:0px"><font face="arial, sans-serif" color="#000000"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><b><span style="background-color:rgb(255,255,0)"><br></span></b></font></font></font></p><div><font face="arial, sans-serif"><b><font color="#500050">Where: </font><font color="#000000"> </font></b><font color="#000000">Talk will be given </font><font color="#0000ff" style="font-weight:bold"><u>live, in-person</u></font><font style="color:rgb(80,0,80);font-weight:bold"> </font><font style="color:rgb(80,0,80)">at</font></font></div><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font color="#500050"> </font><font color="#000000"> TTIC, 6045 S. Kenwood Avenue</font></font></p><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif" color="#000000"> 5th Floor, Room 530<b> </b></font></p><br></div><div><b style="color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap">Virtually:</b><span style="font-size:14px;color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap"> via Panopto (</span><a href="https://uchicago.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=e1bef456-5061-49f6-bf5e-aee9010924ab" target="_blank"><b><font color="#0000ff">livestream</font></b></a>)<br clear="all"></div><div><br></div><div><div dir="ltr"><div><div><div><div><p class="MsoNormal" style="margin:0in 0in 0.0001pt;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><font style="color:rgb(80,0,80)"><b>Who: </b> </font><font color="#500050" style="color:rgb(80,0,80)"> </font><font color="#000000"><font color="#500050"> </font> </font></font></font></font>Hung-yi Lee, National Taiwan University</p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><br></p><div class="MsoNormal" align="center" style="margin:0in 0in 8pt;font-size:11pt;text-align:center;line-height:15.6933px;font-family:Calibri,sans-serif"><hr size="2" width="100%" align="center"></div><div><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Title:</b><span style="letter-spacing:normal;color:rgb(34,34,34)"> </span><span style="letter-spacing:normal;color:rgb(34,34,34)">Recent Progress of Self-supervised Learning for Speech Processing</span></p><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Abstract: </b><span style="letter-spacing:normal;color:rgb(34,34,34)">Self-supervised learning (SSL) has shown to be vital for advancing research in natural language processing (NLP), computer vision (CV), and speech processing. The paradigm pre-trains a shared model on large volumes of unlabeled data and achieves state-of-the-art for various tasks with minimal adaptation. Then this talk will share the recent advances and findings on SSL models for speech processing done at 2022 Eighth Frederick Jelinek Memorial Summer Workshop (JSALT). I'll start by discussing how to train a better SSL model, including compressing it, making it more robust, and enhancing pre-training with visual information. We then discuss efficient ways to leverage SSL models in downstream tasks, including adapters and hints. We then talk about applying SSL models to prosody-related tasks and unsupervised ASR, and share some possible extended uses of unsupervised ASR. Finally, we'll share a speech SSL toolkit.</span></p></div></div></div></div></div></div></div></div></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr"><b>Bio:</b> Hung-yi Lee (李宏毅) is an associate professor of the Department of Electrical Engineering of National Taiwan University (NTU), with a joint appointment at the Department of Computer Science & Information Engineering of the university. His recent research focuses on developing technology that can reduce the requirement of annotated data for speech processing (including voice conversion and speech recognition) and natural language processing (including abstractive summarization and question answering). He won Salesforce Research Deep Learning Grant in 2019, AWS ML Research Award in 2020, Outstanding Young Engineer Award from The Chinese Institute of Electrical Engineering in 2018, Young Scholar Innovation Award from Foundation for the Advancement of Outstanding Scholarship in 2019, Ta-You Wu Memorial Award from Ministry of Science and Technology of Taiwan in 2019, and The 59th Ten Outstanding Young Person Award in Science and Technology Research & Development of Taiwan. He owns a YouTube channel teaching deep learning in Mandarin with about 100k Subscribers. <br></div><div dir="ltr" class="gmail_attr"><br></div><div dir="ltr" class="gmail_attr"><b style="white-space:pre-wrap;font-family:arial,sans-serif"><font color="#000000">Host:</font></b><b style="white-space:pre-wrap;color:rgb(80,0,80);font-family:arial,sans-serif"> </b><a href="mailto:klivescu@ttic.edu" style="white-space:pre-wrap;font-family:arial,sans-serif" target="_blank"><b><font color="#0000ff">Karen Livescu</font></b></a></div></div></div><div><div dir="ltr"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small"><br></span></div><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small"><br></span></div><br></div></div></div></div><div><div dir="ltr"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small">Mary C. Marre</span><br></div><div><div><font face="arial, helvetica, sans-serif" size="1">Faculty Administrative Support</font></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1"><b>Toyota Technological Institute</b></font></i></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1">6045 S. Kenwood Avenue</font></i></div><div><font size="1"><i><font face="arial, helvetica, sans-serif" color="#3d85c6">Chicago, IL 60637</font></i><br></font></div><div><b><i><a href="mailto:mmarre@ttic.edu" target="_blank"><font face="arial, helvetica, sans-serif" size="1">mmarre@ttic.edu</font></a></i></b></div></div></div></div></div><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sat, Aug 6, 2022 at 12:02 PM Mary Marre <<a href="mailto:mmarre@ttic.edu" target="_blank">mmarre@ttic.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div style="font-size:small"><div dir="ltr"><div><div><font style="font-family:arial,sans-serif;color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>When:</b> </font></font><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><font face="arial, sans-serif"> Friday</font><span class="gmail_default" style="font-family:arial,sans-serif">, August 12th</span><font face="arial, sans-serif"> at</font><b><font face="arial, sans-serif"> </font><span style="background-color:rgb(255,255,0)"><font face="verdana, sans-serif">1:30 pm CT</font></span></b></font></font></div><div><p style="font-variant-numeric:normal;font-variant-east-asian:normal;font-stretch:normal;line-height:normal;color:rgb(80,0,80);margin:0px"><font face="arial, sans-serif" color="#000000"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><b><span style="background-color:rgb(255,255,0)"><br></span></b></font></font></font></p><div><font face="arial, sans-serif"><b><font color="#500050">Where: </font><font color="#000000"> </font></b><font color="#000000"><span>Talk</span> will be given </font><font color="#0000ff" style="font-weight:bold"><u>live, in-person</u></font><font style="color:rgb(80,0,80);font-weight:bold"> </font><font style="color:rgb(80,0,80)">at</font></font></div><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font color="#500050"> </font><font color="#000000"> <span>TTIC</span>, 6045 S. Kenwood Avenue</font></font></p><p class="MsoNormal" style="margin:0in;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif" color="#000000"> 5th Floor, Room 530<b> </b></font></p><br></div><div><b style="color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap">Virtually:</b><span style="font-size:14px;color:rgb(60,64,67);font-family:Roboto,Arial,sans-serif;letter-spacing:0.2px;white-space:pre-wrap"> via Panopto (</span><a href="https://uchicago.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=e1bef456-5061-49f6-bf5e-aee9010924ab" target="_blank"><b><font color="#0000ff">livestream</font></b></a>)<br clear="all"></div><div><br></div><div><div dir="ltr"><div><div><div><div><p class="MsoNormal" style="margin:0in 0in 0.0001pt;line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><font style="color:rgb(80,0,80)"><b>Who: </b> </font><font color="#500050" style="color:rgb(80,0,80)"> </font><font color="#000000"><font color="#500050"> </font> </font></font></font></font>Hung-yi Lee, National Taiwan University</p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><br></p><div class="MsoNormal" align="center" style="margin:0in 0in 8pt;font-size:11pt;text-align:center;line-height:15.6933px;font-family:Calibri,sans-serif"><hr size="2" width="100%" align="center"></div><div><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Title:</b><span style="letter-spacing:normal;color:rgb(34,34,34)"> </span><span style="letter-spacing:normal;color:rgb(34,34,34)">Recent Progress of Self-supervised Learning for Speech Processing</span></p><p style="color:rgb(60,64,67);letter-spacing:0.2px;white-space:pre-wrap"><b style="letter-spacing:normal;color:rgb(34,34,34)">Abstract: </b><span style="letter-spacing:normal;color:rgb(34,34,34)">Self-supervised learning (SSL) has shown to be vital for advancing research in natural language processing (NLP), computer vision (CV), and speech processing. The paradigm pre-trains a shared model on large volumes of unlabeled data and achieves state-of-the-art for various tasks with minimal adaptation. Then this talk will share the recent advances and findings on SSL models for speech processing done at 2022 Eighth Frederick Jelinek Memorial Summer Workshop (JSALT). I'll start by discussing how to train a better SSL model, including compressing it, making it more robust, and enhancing pre-training with visual information. We then discuss efficient ways to leverage SSL models in downstream tasks, including adapters and hints. We then talk about applying SSL models to prosody-related tasks and unsupervised ASR, and share some possible extended uses of unsupervised ASR. Finally, we'll share a speech SSL toolkit.</span></p></div></div></div></div></div></div></div></div></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr"><b>Bio:</b> Hung-yi Lee (李宏毅) is an associate professor of the Department of Electrical Engineering of National Taiwan University (NTU), with a joint appointment at the Department of Computer Science & Information Engineering of the university. His recent research focuses on developing technology that can reduce the requirement of annotated data for speech processing (including voice conversion and speech recognition) and natural language processing (including abstractive summarization and question answering). He won Salesforce Research Deep Learning Grant in 2019, AWS ML Research Award in 2020, Outstanding Young Engineer Award from The Chinese Institute of Electrical Engineering in 2018, Young Scholar Innovation Award from Foundation for the Advancement of Outstanding Scholarship in 2019, Ta-You Wu Memorial Award from Ministry of Science and Technology of Taiwan in 2019, and The 59th Ten Outstanding Young Person Award in Science and Technology Research & Development of Taiwan. He owns a YouTube channel teaching deep learning in Mandarin with about 100k Subscribers. <br></div><div dir="ltr" class="gmail_attr"><br></div><div dir="ltr" class="gmail_attr"><b style="white-space:pre-wrap;font-family:arial,sans-serif"><font color="#000000">Host:</font></b><b style="white-space:pre-wrap;color:rgb(80,0,80);font-family:arial,sans-serif"> </b><a href="mailto:klivescu@ttic.edu" style="white-space:pre-wrap;font-family:arial,sans-serif" target="_blank"><b><font color="#0000ff">Karen Livescu</font></b></a></div></div></div><div><div dir="ltr"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small"><br></span></div><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small"><br></span></div><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small"><br></span></div><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small">Mary C. Marre</span><br></div><div><div><font face="arial, helvetica, sans-serif" size="1">Faculty Administrative Support</font></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1"><b>Toyota Technological Institute</b></font></i></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1">6045 S. Kenwood Avenue</font></i></div><div><font size="1"><i><font face="arial, helvetica, sans-serif" color="#3d85c6">Chicago, IL 60637</font></i><br></font></div><div><b><i><a href="mailto:mmarre@ttic.edu" target="_blank"><font face="arial, helvetica, sans-serif" size="1">mmarre@ttic.edu</font></a></i></b></div></div></div></div></div></div>
</blockquote></div></div>
</blockquote></div></div>
</blockquote></div></div>