[Colloquium] CS Seminar Apr. 4: Raul Castro Fernandez, MIT

Sandra Wallace swallace at cs.uchicago.edu
Mon Mar 25 06:15:44 CDT 2019


UNIVERSITY OF CHICAGO
DEPARTMENT OF COMPUTER SCIENCE
PRESENTS


Raul Castro Fernandez
MIT
	

Thursday, April 4, 2019 at 3:30 pm
Crerar 390


Title:  Data Discovery: Unleashing the Value of Data

Abstract:
Organizations use only a small portion of all data they own. Consequently, most of the potential value is untapped. This happens because their analysts suffer a data discovery problem: when solving a task that requires data, analysts spend more time finding the relevant data than solving the task at hand. The core problem is that there is not adequate infrastructure to support the many different discovery problems organizations face. Hence, finding data remains largely a manual and time-consuming process.

In this talk I'll present Aurum, a system that radically changes how users interact with their organizations' data. With Aurum users can solve discovery problems in minutes instead of weeks. To achieve this, Aurum has three novel features: 1) it makes data discovery programmable so users can solve many different discovery problems by writing different programs; 2) it solves data discovery queries fast, so users can solve their problems in minutes instead of weeks; 3) it scales to large amounts of data, so no relevant data is left behind. In addition, I'll explain how Aurum handles not only structured data such as tables in databases, data lakes, and spreadsheets, but also unstructured data such as PDF files, word documents, and even conversations from Slack channels.

I'll conclude with a vision for how to make data easier to work with and to program, a key ingredient needed to exploit all data available in organizations and enable new applications.

Bio:
In my research I build high-performance systems for discovering, preparing, and processing data. I often use techniques from data management, statistics, and machine learning. At MIT I work with professors Sam Madden and Mike Stonebraker. Before MIT, I completed my PhD at Imperial College London with Peter Pietzuch.

Host:  Aaron Elmore

PDF:

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20190325/be6fb9a1/attachment-0003.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: PastedGraphic-2.tiff
Type: image/tiff
Size: 63442 bytes
Desc: not available
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20190325/be6fb9a1/attachment-0001.tiff>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20190325/be6fb9a1/attachment-0004.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: RCF talk.pdf
Type: application/pdf
Size: 520594 bytes
Desc: not available
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20190325/be6fb9a1/attachment-0001.pdf>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20190325/be6fb9a1/attachment-0005.html>


More information about the Colloquium mailing list