Control for neural network analysis

Postdoc position, 24 months

Université Côte d’Azur, J. A. Dieudonné CNRS Lab. (Nice)

Control has recently proved very instrumental for the analysis of neural networks, and for their use in classification. When dealing with very deep residual networks (ResNets), for instance, a relevant approximation is to assume that there is a continuum of layers, indexed by time. In this framework, the composition of a finite number of cells can be interpreted as discretising a continuous ODE, called the neural ODE. In turn, for a large number of layers, the continuous model is meaningful to understand the network properties. The weights, labeled by time, are the controls of the system, and there are several issues that can be efficiently addressed through this point of view. First, in the case of supervised learning: learning a classifier is just learning some input-output map, a standard task in control theory. And meeting the requirements tied to the known data is associated with controllability issues, namely ensemble controllability, ultimately related to controllability on the group of diffeomorphisms on the ambient manifold (Agrachev and Sarychev, 2022; Scagliotti, 2023). Lie bracket techniques are in order for these questions, as well as other more constructive approaches that exploit the structure of the nonlinear activation functions of the original network (Li, Lin and Chen, 2023; Zuazua, 2022; Ruiz-Balet and Zuazua, 2022). Another line of active search to contribute to is the analysis of the convergence of training of neural networks, modelled by neural ODEs, and its relation to the well known turnpike phenomenon in optimal control (Geshkovski and Zuazua, 2022).

Contact

Jean-Baptiste Caillau and Ludovic Rifford

References

Agrachev, A. A.; Sarychev, A. V. Control in the Spaces of Ensembles of PointsSIAM J. Control Optim.58 (2020), no. 3, 1579–1596.

Agrachev, A. A.; Sarychev, A. V. Control on the Manifolds of Mappings with a View to the Deep LearningJ. Dyn. Control Syst. 28 (2021), 989–1008.

Li, Q.; Lin, T; Shen, Z. Deep learning via dynamical systems: An approximation perspectiveJ. Eur. Math. Soc. 25 (2023), 1671–1709.

Ruiz-Balet, D.; Zuazua, E. Neural ode control for classification, approximation and transportSIAM Review 65 (2022), no. 3, 735-773.

Scagliotti, A. Deep Learning approximation of diffeomorphisms via linear-control systemsMCRF 13(2023), no. 3, 1226-1257.