[Colloquium] Ray Sinurat MS Presentation/Feb 6, 2024

meganwoodward at uchicago.edu meganwoodward at uchicago.edu
Tue Jan 23 13:07:54 CST 2024


This is an announcement of Ray Sinurat's MS Presentation
===============================================
Candidate: Ray Sinurat

Date: Tuesday, February 06, 2024

Time: 10 am CST

Location: JCL 390

Title: Towards Continually Learning Application Performance Models

Abstract: Machine learning-based performance models are increasingly being
used to build critical job scheduling and application optimization decisions.
Traditionally, these models assume that data distribution does not change as
more samples are collected over time. However, owing to the complexity and
heterogeneity of production HPC systems, they are susceptible to hardware
degradation, replacement, and/or software patches, which can lead to drift in the
data distribution that can adversely affect the performance models. To this end,
we develop continually learning performance models that account for the distribution
drift, alleviate catastrophic forgetting, and improve generalizability. Our best model
was able to retain accuracy, regardless of having to learn the new distribution of data
inflicted by system changes, while demonstrating a 2x improvement in the prediction
accuracy of the whole data sequence in comparison to the naive approach.


Advisors: Haryadi Gunawi

Committee Members: Haryadi Gunawi, Sandeep Madireddy, Kexin Pei



More information about the Colloquium mailing list