论文标题
功能学习过程的免费动力
Free Dynamics of Feature Learning Processes
论文作者
论文摘要
回归模型通常倾向于以回归器组合的形式恢复一个嘈杂的信号,这也称为机器学习中的特征,这本身就是学习过程的结果。已知先前的协方差矩阵与信号的对齐方式在模型的概括属性中起关键作用,即在训练过程中对不可分割的数据进行预测。我们介绍了学习过程的统计物理图。首先,我们重新审视脊回归以获得火车和测试误差的紧凑渐近表达式,从而表明发生了有效概括的条件。由于确切的测试训练样本误差比与随机矩阵属性相结合,因此建立了它。一路以自我能量的形式出现了有效的山脊惩罚\ textemdash \恰好是测试误差比\ textemdash \的火车,该火车提供了对问题的非常简单的参数化。该公式似乎方便解决特征矩阵本身的学习过程。我们根据问题的基本自由度来得出一个自主动力学系统,以决定种群矩阵和信号之间相对对齐的演变。还获得了这些方程的宏观对应物,并揭开了各种动力学机制,从而可以解释模拟学习过程的动力学,并以高精度的单个实验运行的轨迹重现轨迹。
Regression models usually tend to recover a noisy signal in the form of a combination of regressors, also called features in machine learning, themselves being the result of a learning process.The alignment of the prior covariance feature matrix with the signal is known to play a key role in the generalization properties of the model, i.e. its ability to make predictions on unseen data during training. We present a statistical physics picture of the learning process. First we revisit the ridge regression to obtain compact asymptotic expressions for train and test errors, rendering manifest the conditions under which efficient generalization occurs. It is established thanks to an exact test-train sample error ratio combined with random matrix properties. Along the way in the form of a self-energy emerges an effective ridge penalty \textemdash\ precisely the train to test error ratio \textemdash\ which offer a very simple parameterization of the problem. This formulation appears convenient to tackle the learning process of the feature matrix itself. We derive an autonomous dynamical system in terms of elementary degrees of freedom of the problem determining the evolution of the relative alignment between the population matrix and the signal. A macroscopic counterpart of these equations is also obtained and various dynamical mechanisms are unveiled, allowing one to interpret the dynamics of simulated learning processes and reproduce trajectories of single experimental run with high precision.