[Theory] 8/17 Thesis Defense: Ruotian Luo, TTIC

Mary Marre mmarre at ttic.edu
Fri Aug 13 08:25:47 CDT 2021


*Thesis Defense: Ruotian Luo, TTIC*

*When:  *     Tuesday*,* August 17th at *8:00 - 10:00 am CT*



*Virtually: *  *Register in advance here
<https://uchicago.zoom.us/meeting/register/tJUrfuuuqD4pGtFW2b2FRbZI7dz7Y2ePyH6B>*



*Who: *        Ruotian Luo, TTIC


*Thesis title: *Goal-Driven Text Descriptions for Images

*Abstract: *While visual understanding has achieved significant progress in
recent years, only perception is not enough for building AI; an AI agent
also needs to know how to talk, especially how to communicate with a human.
In the thesis, I study how we design goals for generating more meaningful
texts given visual inputs. First, I will introduce a method to generate
informative image tags, based on the idea of information utility: how much
information does a tag convey about the image. Then, I will discuss how we
incorporate “discriminability” in generating referring expressions and
image captions. Specifically, I will introduce a speaker-listener framework
where the speaker generates the expressions/captions, and the listener
tells if they are discriminative or not.

*Thesis Advisor:* *Greg Shakhnarovich* <greg at ttic.edu>




Mary C. Marre
Faculty Administrative Support
*Toyota Technological Institute*
*6045 S. Kenwood Avenue*
*Chicago, IL  60637*
*mmarre at ttic.edu <mmarre at ttic.edu>*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/theory/attachments/20210813/7961b97e/attachment.html>


More information about the Theory mailing list