Effectiveness of data-driven neural learning in terms of both local mimima trapping and convergence rate is addressed. Such issues are investigated in a case study involving the training of one-hidden-layer feedforward neural networks with the extended Kalman filter, which reduces the search for the optimal network parameters to a state estimation problem, as compared to descent-based methods. In this respect, the performances of the training are assessed by using the Cramer-Rao bound, along with a novel metric based on an empirical criterion to evaluate robustness with respect to local minima trapping. Numerical results are provided to illustrate the performances of the training based on the extended Kalman filter in comparison with gradient-based learning.
Data‐driven performance metrics for neural network learning
Angelo Alessandri;Marcello Sanguineti
2023-01-01
Abstract
Effectiveness of data-driven neural learning in terms of both local mimima trapping and convergence rate is addressed. Such issues are investigated in a case study involving the training of one-hidden-layer feedforward neural networks with the extended Kalman filter, which reduces the search for the optimal network parameters to a state estimation problem, as compared to descent-based methods. In this respect, the performances of the training are assessed by using the Cramer-Rao bound, along with a novel metric based on an empirical criterion to evaluate robustness with respect to local minima trapping. Numerical results are provided to illustrate the performances of the training based on the extended Kalman filter in comparison with gradient-based learning.File | Dimensione | Formato | |
---|---|---|---|
IJACSP23.pdf
accesso aperto
Tipologia:
Documento in versione editoriale
Dimensione
1.43 MB
Formato
Adobe PDF
|
1.43 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.