<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><div class=""><span class="" style="font-size: 14px; orphans: 2; widows: 2;">UNIVERSITY OF CHICAGO</span><br class="" style="font-size: 14px; orphans: 2; widows: 2;"><span class="" style="font-size: 14px; orphans: 2; widows: 2;">DEPARTMENT OF COMPUTER SCIENCE</span></div><div class="" style="orphans: 2; widows: 2;"><span class="" style="font-size: 14px;">PRESENTS</span></div><div class="" style="orphans: 2; widows: 2;"><span class="" style="font-size: 14px;"><br class=""></span></div><div class="" style="orphans: 2; widows: 2;"><img apple-inline="yes" id="8C0DCD76-A3D1-43E5-8E0F-D61E2EBC0DD3" height="139" width="123" apple-width="yes" apple-height="yes" src="cid:F1C97548-F137-46E4-BEF5-4F4F8B71786A@cs.uchicago.edu" class=""></div><div class="" style="orphans: 2; widows: 2;"><br class=""></div><div class="" style="orphans: 2; widows: 2;"><div class="" style="margin: 0in 0in 0.0001pt;"><b style="font-size: 14px;" class="">Yang Yuan</b></div><div class="" style="margin: 0in 0in 0.0001pt;"><span class=""><span class="" style="font-size: 14px;"><b class="">Cornell University</b></span></span></div><div class="" style="margin: 0in 0in 0.0001pt;"><br class=""></div><div class="" style="margin: 0in 0in 0.0001pt;"><b class=""><font class="" style="font-size: 14px;"><br class=""></font></b></div><div class="" style="margin: 0in 0in 0.0001pt;"><span class="" style="font-size: 14px;"><b class=""><font class="">Tuesday, February 6, 2018 at 3:30 pm <br class="">Ryerson 251</font></b><br class=""></span></div></div><div class="" style="orphans: 2; widows: 2;"><span class="" style="font-size: 14px;"><br class=""></span></div><div class=""><br class=""></div><div class=""><div class=""><b class="" style="color: rgb(33, 33, 33); font-family: Roboto, Helvetica, Arial, sans-serif; font-size: 14px;">Title:  </b><font color="#212121" face="Roboto, Helvetica, Arial, sans-serif" class=""><span style="font-size: 14px;" class="">Provable and Practical Algorithms for Non-convex Problems in Machine Learning</span></font></div><div class="" style="color: rgb(33, 33, 33); font-family: Roboto, Helvetica, Arial, sans-serif; font-size: 14px;"><b class=""><br class=""></b></div><div class="" style="color: rgb(33, 33, 33); font-family: Roboto, Helvetica, Arial, sans-serif; font-size: 14px;"><b class="">Abstract:</b></div><div class=""><font color="#212121" face="Roboto, Helvetica, Arial, sans-serif" class=""><span class=""><span style="font-size: 14px;" class="">Machine learning has become one of the most exciting research areas in the world, with various applications. However, there exists a noticeable gap between theory and practice. On one hand, simple algorithm like stochastic gradient descent (SGD) works very well in practice, without satisfactory theoretical explanations. On the other hand, the algorithms from the theory community, although with solid guarantees, tend to be less efficient compared with the techniques widely used in practice, which are usually hand tuned or ad hoc based on intuition. <br class=""><br class="">In this talk, I would like to discuss my effort to bridge theory and practice from two directions. The first direction is “practice to theory”, i.e., to explain and analyze the existing algorithms and empirical observations in machine learning. I will first briefly talk about how SGD escapes saddle points, and then present a two-phase convergence analysis of SGD for the two-layer neural network with ReLU activation. <br class=""><br class="">The other direction is “theory to practice”, i.e., using deep theory tools to obtain new, better and practical algorithms. Along this direction, I will introduce our new algorithm Harmonica that uses Fourier analysis and compressed sensing for tuning hyperparameters. Harmonica supports parallel sampling and works well for tuning neural networks with 30+ hyperparameters.<br class=""><br class=""> </span><br class=""></span></font><b class="" style="color: rgb(33, 33, 33); font-family: Roboto, Helvetica, Arial, sans-serif; font-size: 14px;">Bio:</b></div><div class=""><font color="#212121" face="Roboto, Helvetica, Arial, sans-serif" class=""><span style="font-size: 14px;" class=""><i class="">Yang Yuan is a sixth year CS PhD candidate at Cornell University, advised by Professor Robert Kleinberg. He did his undergraduate study at Peking University (2008-2012). He was a visiting student at MIT/Microsoft New England (2014-2015), and Princeton University (2016 Fall).  He works on the topics at the intersection of machine learning, theory and optimization. </i></span></font></div><div class=""><font color="#212121" face="Roboto, Helvetica, Arial, sans-serif" class=""><span style="font-size: 14px;" class=""><i class=""><br class=""></i></span></font></div><div class=""><div class="" style="color: rgb(33, 33, 33); font-family: Roboto, Helvetica, Arial, sans-serif; font-size: 14px;"><b class="">Host:  Risi Kondor</b></div><div class="" style="color: rgb(33, 33, 33); font-family: Roboto, Helvetica, Arial, sans-serif; font-size: 14px;"><br class=""></div><div class="" style="color: rgb(33, 33, 33); font-family: Roboto, Helvetica, Arial, sans-serif;"><i class=""><font size="2" class="">Refreshments served after the talk in Ry. 255</font></i></div></div></div></body></html>