[Colloquium] Seminar Announcement: Scalability in the Presence of Variability

Ninfa Mayorga via Colloquium colloquium at mailman.cs.uchicago.edu
Tue Sep 26 16:11:15 CDT 2017


Computation Institute Presentation -  Data Lunch Seminar (DLS)

Speaker:   Brian Kocoloski, Department of Computer Science and Engineering, Washington University, St. Louis
Host:  Kyle Chard
Date:  September 29, 2017
Time: 12:00 PM - 1:00 PM
Location:  Computation Institute, 5735 S. Ellis Ave., Searle 240B

Title:  Scalability in the Presence of Variability

Abstract: 
Supercomputers are used to solve some of the world's most computationally demanding problems. Exascale systems, to be comprised of over one million cores and capable of 1018 floating point operations per second, will probably exist in the next five years, and will provide unprecedented computational power for parallel computing workloads. Unfortunately, while these machines hold tremendous promise and opportunity for applications in High Performance Computing (HPC), graph processing, and machine learning, it will be a major challenge to fully realize their potential, because to do so requires balanced execution across the entire system and its millions of processing elements. When different processors take different amounts of time to perform the same amount of work, performance imbalance arises, large portions of the system sit idle, and time and energy are wasted. Larger systems incorporate more processors and thus greater opportunity for imbalance to arise, as well as larger performance/energy penalties when it does.

In this talk, we explain how to design system software to mitigate variability on large scale parallel machines. Our approaches span (1) the design, implementation, and evaluation of a new high performance operating system to reduce some classes of performance variability, (2) a new performance evaluation framework to holistically characterize key features of variability on new and emerging architectures, and (3) a distributed modeling framework that derives predictions of how and where imbalance is manifesting in order to drive reactive operations such as load balancing and speed scaling. Collectively, these efforts provide a holistic set of tools to promote scalability through the mitigation of variability.
 
Bio: 
Brian Kocoloski is an assistant professor in the Department of Computer Science and Engineering at Washington University in St Louis. Previously, he was a co-op engineer for Advanced Micro Devices (AMD) and an intern at Sandia National Laboratories. He is currently collaborating at the Computational Institute as part of the Chameleon Cloud team. 
 
Kocoloski’s research seeks to make it easier to efficiently utilize large parallel computers. His work has primarily been focused in system software, where he has designed operating systems and virtualization mechanisms to improve the scalability of tightly synchronized parallel workloads. Generally, his research interests lie at the intersection of operating systems, parallel runtime systems, and resource management policies for large scale distributed systems.

Information:  Lunch will be provided 


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.uchicago.edu/pipermail/colloquium/attachments/20170926/c66e1b5d/attachment.html>


More information about the Colloquium mailing list