Tagged
Calculus of Variations
PDE and Machine Learning (3): Variational Principles and Optimization
What is the essence of neural-network training? When we run gradient descent in a high-dimensional parameter space, is there a deeper continuous-time dynamics at work? As the network width tends to …