[CS] REMINDER: Ziyu Ye MS Presentation/Dec.6th

Thu Dec 5 09:11:58 CST 2024

This is an announcement of Ziyu Ye's MS Presentation
===============================================
Candidate: Ziyu Ye

Date: Friday December 6 2024

Time: 12:00 AM – 1:00 PM CST

Location: JCL 346

Title: Towards Scalable and Self-Improving Artificial Intelligence: On Alignment and Mathematical Reasoning

Abstract: This thesis explores advancements in developing scalable and self-improving artificial intelligence through two primary areas: alignment and mathematical reasoning. First, we introduce Evolving Alignment via Asymmetric Self-Play (eva), a new framework that casts reinforcement learning from human feedback (RLHF) as an asymmetric game between a creator and a solver. Unlike conventional RLHF methods that rely on static prompt distributions, eva enables the generation of progressively informative prompts and solver improvements, resulting in scalable alignment and state-of-the-art performance across benchmarks, without any additional human-crafted prompts.

Next, we propose Reasoning in Reasoning (RiR), a hierarchical framework for neural theorem proving. RiR integrates decomposition and search-based reasoning through a planner-actor game, breaking down complex theorems into sub-goals to improve generalizability and search space efficiency. Empirical results on theorem proving datasets, such as LeanDojo and miniF2F, show that RiR achieves significant performance gains and operates nearly three times faster than existing baselines. We also provide information-theoretic insights into the principles behind RiR's effectiveness.

Together, these contributions push the boundaries of scalable, self-improving AI systems capable of robust alignment and sophisticated reasoning, bridging theoretical insights and practical advancements in training large language models.

Advisors: Yuxin Chen

Committee Members: Yuxin Chen and Haifeng Xu