[CS] Zixuan Zhao MS Presentation/Dec.2nd

via cs cs at mailman.cs.uchicago.edu
Mon Nov 25 10:38:27 CST 2024


This is an announcement of Zixuan Zhao's MS Presentation
===============================================
Candidate: Zixuan Zhao

Date: Monday, December 2nd

Time: 9AM -10AM CST

Location: JCL 298 

Remote Location:  https://uchicago.zoom.us/j/94031362675?pwd=byDRZuWhe1ifTIltIuLjlJBSPvQTJ2.1

Meeting ID: 940 3136 2675
Passcode: 071540

Title: Exploring Non-Contrastive Self-Supervised Learning

Abstract: Self-supervised learning (SSL) is a machine learning paradigm where models learn meaningful representations from unlabeled data by solving algorithmically-generated prediction tasks using inherent structure within the data itself. This method has been crucial in training large language models (LLMs) through next-token prediction and powering image models like DINO. Most vision SSL algorithms generate two augmented views of an image, learning representations by pulling these views closer while pushing apart those from different images. Non-contrastive SSL (nc-SSL) algorithms avoid comparing different image pairs, which risks a collapse where all images degenerates to the same representation. Many nc-SSL methods address this using a predictor and stop-gradient operation, though the reasons for their effectiveness are not fully understood. We developed a codebase to replicate various SSL algorithms and studied their properties. Focusing on DirectCopy, which simplifies the predictor with a non-trainable, closed-form solution, we found a counterintuitive result: an increasing training loss often correlated with better representation learning in 92% of high-performing cases. We also explored biologically plausible adaptations of DirectCopy, showing their performance is comparable to contrastive methods. Finally, we propose FreeCopy, a DirectCopy variation that uses cross-correlation to make the two branches free of data-vectors, bridging nc-SSL methods like BYOL and regularization-based approaches such as Barlow Twins. Our results offer new insights into nc-SSL mechanisms and suggest paths for further study.


Advisors: Rick Stevens

Committee Members: Rick Stevens, Yali Amit, Fangfang Xia



More information about the cs mailing list