[Colloquium] REMINDER: 3/29 Talks at TTIC: Ramakrishna Vedantam, Georgia Tech

Wed Mar 28 14:18:03 CDT 2018

 When:     Thursday, March 29th at *11:00 am*

Where:    TTIC, 6045 S Kenwood Avenue, 5th Floor, Room 526

*Who:       *Ramakrishna Vedantam, Georgia Tech

Title: Connecting Vision and Language for Interpretation, Grounding and
Imagination

Abstract:
Understanding how to model vision and language jointly is a long-standing
challenge in artificial intelligence. Vision is one of the primary sensors
we use to perceive the world, while language is our data structure to
represent and communicate knowledge. In this talk, we will take up three
lines of attack to this problem: interpretation, grounding, and imagination.

In interpretation, the goal will be to get machine learning models to
understand an image and describe its contents using natural language in a
contextually relevant manner. In grounding, we will connect natural
language to referents in the physical world, and show how this can help
learn common sense. Finally, we will study how to ‘imagine’ visual concepts
completely and accurately across the full range and (potentially unseen)
compositions of their visual attributes. We will study these problems from
computational as well as algorithmic perspectives and suggest exciting
directions for future work.

Host: Greg Shakhnarovich <greg at ttic.edu>

Mary C. Marre
Administrative Assistant
*Toyota Technological Institute*
*6045 S. Kenwood Avenue*
*Room 504*
*Chicago, IL  60637*
*p:(773) 834-1757*
*f: (773) 357-6970*
*mmarre at ttic.edu <mmarre at ttic.edu>*

On Thu, Mar 22, 2018 at 4:05 PM, Mary Marre <mmarre at ttic.edu> wrote:

> When:     Thursday, March 29th at *11:00 am*
>
> Where:    TTIC, 6045 S Kenwood Avenue, 5th Floor, Room 526
>
> *Who:       *Ramakrishna Vedantam, Georgia Tech
>
> Title: Connecting Vision and Language for Interpretation, Grounding and
> Imagination
>
> Abstract:
> Understanding how to model vision and language jointly is a long-standing
> challenge in artificial intelligence. Vision is one of the primary sensors
> we use to perceive the world, while language is our data structure to
> represent and communicate knowledge. In this talk, we will take up three
> lines of attack to this problem: interpretation, grounding, and imagination.
>
> In interpretation, the goal will be to get machine learning models to
> understand an image and describe its contents using natural language in a
> contextually relevant manner. In grounding, we will connect natural
> language to referents in the physical world, and show how this can help
> learn common sense. Finally, we will study how to ‘imagine’ visual concepts
> completely and accurately across the full range and (potentially unseen)
> compositions of their visual attributes. We will study these problems from
> computational as well as algorithmic perspectives and suggest exciting
> directions for future work.
>
>
> Host: Greg Shakhnarovich <greg at ttic.edu>
>
>
>
>
>
> Mary C. Marre
> Administrative Assistant
> *Toyota Technological Institute*
> *6045 S. Kenwood Avenue*
> *Room 504*
> *Chicago, IL  60637*
> *p:(773) 834-1757 <(773)%20834-1757>*
> *f: (773) 357-6970 <(773)%20357-6970>*
> *mmarre at ttic.edu <mmarre at ttic.edu>*
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20180328/9e8fa755/attachment-0001.html>