<div dir="ltr"><div dir="ltr"><div class="gmail_default" style="font-size:small"><div style="color:rgb(80,0,80)"><font face="arial, sans-serif"><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>When:</b>    </font></font><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit">    Thur<span class="gmail_default">sday, March 24th</span> at<b> <span style="background-color:rgb(255,255,0)">11:0<span class="gmail_default">0</span> am CT</span></b></font></font></font></div><p style="font-variant-numeric:normal;font-variant-east-asian:normal;font-stretch:normal;line-height:normal;color:rgb(80,0,80);margin:0px"><font face="arial, sans-serif" color="#000000"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><b><span style="background-color:rgb(255,255,0)"><br></span></b></font></font></font></p><div class="gmail_default" style="color:rgb(80,0,80)"><font face="arial, sans-serif"><b>Where:       </b><font color="#500050">Talk will be given </font><font color="#0000ff" style="font-weight:bold"><u>live, in-person</u></font><font color="#0000ff" style="font-weight:bold"> </font><font color="#000000">at</font></font></div><p class="MsoNormal" style="margin:0in;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif">                   TTIC, 6045 S. Kenwood Avenue</font></p><p class="MsoNormal" style="margin:0in;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif">                   5th Floor, Room 530<b><span style="color:black"> </span></b></font></p><p class="MsoNormal" style="margin:0in;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><b><span style="color:black"><br></span></b></font></p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>Where:</b>       </font></font>Zoom Virtual Talk (<b><a href="https://uchicagogroup.zoom.us/webinar/register/WN_nXAsFqgjSRCfQU0EYNiFGQ" target="_blank"><font color="#0000ff">register in advance here</font></a></b>)</font></p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><br></font></p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><font color="#000000"><b>Who: </b> </font><font color="#500050">    </font><font color="#000000">   <span class="gmail_default"> </span><span class="gmail_default"></span></font></font></font>Arman Cohan, </font><span style="color:rgb(34,34,34)">Allen Institute for AI (AI2) and University of Washington</span></p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><br></font></p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><br></font></p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><b style="font-family:arial,sans-serif;color:rgb(34,34,34)"><br></b></p><p class="MsoNormal" style="margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><b style="font-family:arial,sans-serif;color:rgb(34,34,34)">Title:          </b><span style="font-family:arial,sans-serif;color:rgb(34,34,34)">Beyond Sentences and Paragraphs: Towards Document and Multi-document Understanding</span></p><div style="color:rgb(80,0,80)"><div style="color:rgb(34,34,34)"><font face="arial, sans-serif"><br><b>Abstract: </b>During the past few years, there has been significant progress in natural language understanding, primarily due to the advancements in transfer learning methods and the increasing scale of pre-trained language models. However, the majority of progress has been made on tasks concerning short texts with sentences or paragraphs as the basic unit of analysis. Yet, many real-world natural language tasks require understanding full documents which includes learning effective representation of documents, resolving longer range dependencies, structure, and argumentation. Further, certain tasks require incorporating additional context from multiple related documents (e.g., understanding a scientific paper) and aggregating information across multiple documents. In this talk, I will discuss some of our recent works on addressing these challenges. I will first discuss general methods for document representation learning that help to achieve strong downstream performance on a variety of document-level tasks. Then I will focus on how we can have a general pre-trained language model that can process long documents. Using this framework, I will discuss extensions to multi-document natural language understanding for a variety of classification, extraction, and summarization tasks. I will also briefly discuss a few of our newly developed benchmarks from challenging domains that enable us to better measure progress on document natural language understanding.<br><br><b>Bio: </b></font><span style="font-family:arial,sans-serif">Arman Cohan is a Research Scientist at the Allen Institute for AI (AI2) and an Affiliate Assistant Professor at the University of Washington. His broad research interest is developing natural language processing (NLP) methods for addressing information overload. This includes models and benchmarks for document and multi-document understanding, natural language generation and summarization, as well as information discovery and filtering. He is additionally interested in real-world interdisciplinary applications of NLP in the science and health domains. His research has been recognized with multiple awards, including a best paper award at EMNLP 2017, an honorable mention at COLING 2018, and the 2019 Harold N. Glassman Distinguished Doctoral Dissertation award.</span></div></div><div style="color:rgb(80,0,80)"><font face="arial, sans-serif"><b><span class="gmail_default"><br></span></b></font></div><div class="gmail_default" style="color:rgb(80,0,80)"><font face="arial, sans-serif"><b>Host: <a href="mailto:klivescu@ttic.edu" target="_blank">Karen Livescu</a></b></font></div><br class="gmail-Apple-interchange-newline"></div><div class="gmail_default" style="font-size:small"><br></div><div class="gmail_default" style="font-size:small"><br></div><div class="gmail_default" style="font-size:small"><br></div><div><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small">Mary C. Marre</span><br></div><div><div><font face="arial, helvetica, sans-serif" size="1">Faculty Administrative Support</font></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1"><b>Toyota Technological Institute</b></font></i></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1">6045 S. Kenwood Avenue</font></i></div><div><font size="1"><i><font face="arial, helvetica, sans-serif" color="#3d85c6">Chicago, IL  60637</font></i><br></font></div><div><b><i><a href="mailto:mmarre@ttic.edu" target="_blank"><font face="arial, helvetica, sans-serif" size="1">mmarre@ttic.edu</font></a></i></b></div></div></div></div></div><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Mar 18, 2022 at 9:59 AM Mary Marre <<a href="mailto:mmarre@ttic.edu">mmarre@ttic.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div><div style="font-size:small;color:rgb(80,0,80)"><font face="arial, sans-serif"><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>When:</b>    </font></font><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit">    Thur<span class="gmail_default">sday, March 24th</span> at<b> <span style="background-color:rgb(255,255,0)">11:0<span class="gmail_default">0</span> am CT</span></b></font></font></font></div><p style="font-size:small;font-variant-numeric:normal;font-variant-east-asian:normal;font-stretch:normal;line-height:normal;color:rgb(80,0,80);margin:0px"><font face="arial, sans-serif" color="#000000"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><b><span style="background-color:rgb(255,255,0)"><br></span></b></font></font></font></p><div style="font-size:small;color:rgb(80,0,80)"><font face="arial, sans-serif"><b>Where:       </b><font color="#500050">Talk will be given </font><font color="#0000ff" style="font-weight:bold"><u>live, in-person</u></font><font color="#0000ff" style="font-weight:bold"> </font><font color="#000000">at</font></font></div><p class="MsoNormal" style="font-size:small;margin:0in;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif">                   TTIC, 6045 S. Kenwood Avenue</font></p><p class="MsoNormal" style="font-size:small;margin:0in;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif">                   5th Floor, Room 530<b><span style="color:black"> </span></b></font></p><p class="MsoNormal" style="font-size:small;margin:0in;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><b><span style="color:black"><br></span></b></font></p><p class="MsoNormal" style="font-size:small;margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="color:rgb(0,0,0);vertical-align:inherit"><font style="vertical-align:inherit"><b>Where:</b>       </font></font>Zoom Virtual Talk (<b><a href="https://uchicagogroup.zoom.us/webinar/register/WN_nXAsFqgjSRCfQU0EYNiFGQ" target="_blank"><font color="#0000ff">register in advance here</font></a></b>)</font></p><p class="MsoNormal" style="font-size:small;margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><br></font></p><p class="MsoNormal" style="font-size:small;margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><font style="vertical-align:inherit"><font style="vertical-align:inherit"><font color="#000000"><b>Who: </b> </font><font color="#500050">    </font><font color="#000000">   <span class="gmail_default"> </span><span class="gmail_default"></span></font></font></font>Arman Cohan, </font><span style="color:rgb(34,34,34)">Allen Institute for AI (AI2) and University of Washington</span></p><p class="MsoNormal" style="font-size:small;margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><br></font></p><p class="MsoNormal" style="font-size:small;margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><font face="arial, sans-serif"><br></font></p><p class="MsoNormal" style="font-size:small;margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><b style="font-family:arial,sans-serif;color:rgb(34,34,34)"><br></b></p><p class="MsoNormal" style="font-size:small;margin:0in 0in 0.0001pt;color:rgb(80,0,80);line-height:normal;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"><b style="font-family:arial,sans-serif;color:rgb(34,34,34)">Title:          </b><span style="font-family:arial,sans-serif;color:rgb(34,34,34)">Beyond Sentences and Paragraphs: Towards Document and Multi-document Understanding</span></p><div style="color:rgb(80,0,80)"><div style="color:rgb(34,34,34)"><font face="arial, sans-serif"><br><b>Abstract: </b>During the past few years, there has been significant progress in natural language understanding, primarily due to the advancements in transfer learning methods and the increasing scale of pre-trained language models. However, the majority of progress has been made on tasks concerning short texts with sentences or paragraphs as the basic unit of analysis. Yet, many real-world natural language tasks require understanding full documents which includes learning effective representation of documents, resolving longer range dependencies, structure, and argumentation. Further, certain tasks require incorporating additional context from multiple related documents (e.g., understanding a scientific paper) and aggregating information across multiple documents. In this talk, I will discuss some of our recent works on addressing these challenges. I will first discuss general methods for document representation learning that help to achieve strong downstream performance on a variety of document-level tasks. Then I will focus on how we can have a general pre-trained language model that can process long documents. Using this framework, I will discuss extensions to multi-document natural language understanding for a variety of classification, extraction, and summarization tasks. I will also briefly discuss a few of our newly developed benchmarks from challenging domains that enable us to better measure progress on document natural language understanding.<br><br><b>Bio: </b></font><span style="font-family:arial,sans-serif">Arman Cohan is a Research Scientist at the Allen Institute for AI (AI2) and an Affiliate Assistant Professor at the University of Washington. His broad research interest is developing natural language processing (NLP) methods for addressing information overload. This includes models and benchmarks for document and multi-document understanding, natural language generation and summarization, as well as information discovery and filtering. He is additionally interested in real-world interdisciplinary applications of NLP in the science and health domains. His research has been recognized with multiple awards, including a best paper award at EMNLP 2017, an honorable mention at COLING 2018, and the 2019 Harold N. Glassman Distinguished Doctoral Dissertation award.</span></div></div><div style="color:rgb(80,0,80)"><font face="arial, sans-serif"><b><span class="gmail_default"><br></span></b></font></div><div style="color:rgb(80,0,80)"><font face="arial, sans-serif"><b>Host: <a href="mailto:klivescu@ttic.edu" target="_blank">Karen Livescu</a></b></font></div><br></div><div style="font-size:small"><br></div><div style="font-size:small"><br></div><div><div dir="ltr"><div dir="ltr"><div><span style="font-family:arial,helvetica,sans-serif;font-size:x-small">Mary C. Marre</span><br></div><div><div><font face="arial, helvetica, sans-serif" size="1">Faculty Administrative Support</font></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1"><b>Toyota Technological Institute</b></font></i></div><div><i><font face="arial, helvetica, sans-serif" color="#3d85c6" size="1">6045 S. Kenwood Avenue</font></i></div><div><font size="1"><i><font face="arial, helvetica, sans-serif" color="#3d85c6">Chicago, IL  60637</font></i><br></font></div><div><b><i><a href="mailto:mmarre@ttic.edu" target="_blank"><font face="arial, helvetica, sans-serif" size="1">mmarre@ttic.edu</font></a></i></b></div></div></div></div></div></div>
</blockquote></div></div>