[Colloquium] Talks at TTIC: Vicente Ordonez, University of North Carolina at Chapel Hill

Wed Mar 25 08:04:47 CDT 2015

When:     Wednesday, April 1st at 11am

Where:    TTIC, 6045 S Kenwood Avenue, 5th Floor, Room 526

Who:       Vicente Ordonez, University of North Carolina at Chapel Hill

Title:        Language and Perceptual Categorization in Computer Vision

Abstract:
Recently, there has been great progress in both computer vision and natural
language processing in representing and recognizing semantic units like
objects, attributes, named entities, or constituents. These advances
provide opportunities to create systems able to interpret and describe the
visual world using natural language. This is in contrast to traditional
computer vision systems, which typically output a set of disconnected
labels, object locations, or annotations for every pixel in an image. The
rich visually descriptive language produced by people incorporates world
knowledge and human intuition that often can not be captured by other types
of annotations. In this talk, I will present several approaches that
explore the connections between language, perception, and vision at three
levels: learning how to name objects, generating referring expressions for
objects in natural scenes, and producing general image descriptions. These
methods provide a framework to augment computer vision systems with
linguistic information and to take advantage of the vast amount of text
associated with images on the web. I will also discuss some of the
intuitions from linguistics and perception behind these efforts and how
they potentially connect to the larger goal of creating visual systems that
can better learn from and communicate with people.

Bio:
Vicente Ordonez is a PhD student in the Department of Computer Science at
the University of North Carolina at Chapel Hill under the guidance of Prof.
Tamara Berg. His research interests are at the at the intersection of
Computer Vision and Natural Language Understanding. He also works on big
scale visual analytics by learning models that can perform high-level
perceptual tasks. He is a recipient of the 2013 IEEE David Marr Prize in
Computer Vision, a 2012 Yahoo! Key Scientific Challenges Award and a
Renaissance Technologies Fellowship. His work has been published in both
vision and language conferences and journals (ICCV, ECCV, CVPR, NIPS, ACL,
EMNLP, TPAMI, IJCV, TACL).

Host:  Greg Shakhnarovich, greg at ttic.edu

-- 
*Dawn Ellis*
Administrative Coordinator,
Bookkeeper
773-834-1757
dellis at ttic.edu

TTIC
6045 S. Kenwood Ave.
Chicago, IL. 60637
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20150325/5c817cbc/attachment.htm