[Colloquium] REMINDER!!! Colloquium at TTIC

Dawn Ellis dellis at ttic.edu
Fri Oct 4 08:27:47 CDT 2013


Monday, October 7th
11:00 am
TTIC 6045 S. Kenwood Ave. Room #526

*Speaker:*
Juri Ganitkevitch

*Title:*
Large-Scale Paraphrasing – Extraction and Application

*Abstract:*
We present the methods and infrastructure behind the paraphrase database
PPDB, a collection of over 220 million syntactically labeled English
paraphrase pairs. PPDB is extracted from over 100 million sentences of
bilingual parallel data and further scored using distributional similarity
information drawn from vast amounts of monolingual text. We discuss the
pivoting approach used to extract syntactic paraphrases from bilingual
corpora, present challenges and solutions in scaling extraction and
application methods to data of this size, and give an overview of our
current and forthcoming efforts to expand PPDB's breadth of coverage and
depth of annotation.

*Bio:*
Juri is a terminal-stage Ph.D. student at the Center for Language and
Speech Processing at Johns Hopkins University, advised by Chris
Callison-Burch. His main research interest is in scaling paraphrase
extraction techniques to large amounts of data, as well as pushing
paraphrase applications towards natural language understanding.

(Please contact Kevin Gimpel at kgimpel at ttic.edu if you would like to meet
with Juri.)

-- 
*Dawn Ellis*
Administrative Coordinator,
Bookkeeper
773-834-1757
dellis at ttic.edu

TTIC
6045 S. Kenwood Ave.
Chicago, IL. 60637



-- 
*Dawn Ellis*
Administrative Coordinator,
Bookkeeper
773-834-1757
dellis at ttic.edu

TTIC
6045 S. Kenwood Ave.
Chicago, IL. 60637
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20131004/14365ded/attachment.htm 


More information about the Colloquium mailing list