[Colloquium] Seminar Announcement: Making a Case for Distributed File Systems at Exascales - TODAY!

Ninfa Mayorga ninfa at ci.uchicago.edu
Fri May 13 06:55:02 CDT 2011


Computation Institute- Data Lunch Seminar (DLS)

Speaker: Ioan Raicu, Assistant Professor, Department of Computer  
Science, Illinois Institute of Technology
Date: May 13, 2011
Time: 12:00 PM - 1:00 PM
Location: The University of Chicago, Searle 240A, 5735 S. Ellis Avenue

Making a Case for Distributed File Systems at Exascales

Abstract: Exascale computers will enable the unraveling of significant  
scientific mysteries. Predictions are that 2019 will be the year of  
exascales, with millions of compute nodes and billions of threads of  
execution. The current state-of-the-art storage (but decades-old  
approach) in High-End Computing (HEC), in which storage is segregated  
from compute nodes and connected by a network, will not scale with the  
expected exponential growth in concurrency. At exascales, basic  
functionality at high concurrency levels will suffer poor performance,  
and combined with system mean-time-to-failure in hours, will lead to a  
performance collapse for heroic applications. Storage has the  
potential to be the Achilles heel of exascale systems. We propose that  
future HEC systems be designed with non-volatile memory on every  
compute node. Every compute node would actively participate in the  
metadata and data management, leveraging many-core processors high  
bisection bandwidth in torus networks. Distributed metadata management  
would be used, implemented in a distributed data-structure, tailored  
for HEC, supporting constant time operations by emphasizing  
trustworthy/reliable hardware, fast network interconnects, non- 
existent node "churn", low latencies, and scientific computing data- 
access patterns. The data would be partitioned and spread out over  
many nodes based on the data access patterns. Replication would be  
used to ensure data availability, and cooperative caching would  
deliver high aggregate throughput. Data would be indexed, by including  
descriptive, provenance, and system metadata on each file. There would  
be a variety of data-access semantics, from POSIX-like interfaces for  
generality, to relaxed semantics for increased scalability. This talk  
discusses this revolutionary new storage architecture that will make  
exascale computing more tractable, touching all disciplines in HEC,  
fueling scientific discovery and global economic development. This new  
architec ture will extend the knowledgebase beyond HEC into commodity  
systems as the fastest machines generally become mainstream systems in  
a matter of years.

Bio: Dr. Ioan Raicu is an assistant professor in the Department of  
Computer Science (CS) at Illinois Institute of Technology (IIT), as  
well as a guest research faculty in the Math and Computer Science  
Division (MCS) at Argonne National Laboratory (ANL). He is also the  
founder and director of the Data-Intensive Distributed Systems  
Laboratory (DataSys) at IIT. He has received the prestigious NSF  
CAREER award (2011 - 2015) for his innovative work on distributed file  
systems for exascale computing. He was a NSF/CRA Computation  
Innovation Fellow at Northwestern University in 2009 - 2010, and  
obtained his Ph.D. in Computer Science from University of Chicago  
under the guidance of Dr. Ian Foster in March 2009. He is a 3-year  
award winner of the GSRP Fellowship from NASA Ames Research Center.  
His research work and interests are in the general area of distributed  
systems. His work focuses on a relatively new paradigm of Many-Task  
Computing (MTC), which aims to bridge the gap between tw o predominant  
paradigms from distributed systems, High-Throughput Computing (HTC)  
and High-Performance Computing (HPC). His work has focused on defining  
and exploring both the theory and practical aspects of realizing MTC  
across a wide range of large-scale distributed systems. He is  
particularly interested in resource management in large scale  
distributed systems with a focus on many-task computing, data  
intensive computing, cloud computing, grid computing, and many-core  
computing. His work has been funded by the NASA Ames Research Center,  
DOE Office of Advanced Scientific Computing Research, the NSF/CRA  
CIFellows program, and the NSF CAREER program. More information can be  
found at http://www.cs.iit.edu/~iraicu/ and http://datasys.cs.iit.edu/.


Information: Lunch will be provided



-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20110513/a0f90911/attachment.htm 


More information about the Colloquium mailing list