The way in which animals and humans learn new motor skills through free exploratory movements sequences solely governed by success or failure outcomes is not yet fully understood. Recent advances in machine learning techniques for continuous action spaces led us to construct a motor learning model investigate how animals progressively enhance the efficiency of their behaviors through numerous trials and errors. This study conducts a comprehensive comparison between deep learning models and experimental data from monkey behavior. Notably, we show that the progressive refinement of motor sequences, as they are observed in the animals, do not require the implementation of a complete model of their environment. Rather, it merely requires the capacity to anticipate both movement costs and final reward a few steps ahead in the future following a value iteration principle. Furthermore, the systematic deviations exhibited by the monkeys with respect to the computational model inform us on the presence of individual preferences in either minimizing the duration or the energy consumption, and also on the involvement of alternative ”cognitive” strategies.

Non-instructed Motor Skill Learning in Monkeys: Insights from Deep Reinforcement Learning Models.

Laurene Carminatti;
2024-01-01

Abstract

The way in which animals and humans learn new motor skills through free exploratory movements sequences solely governed by success or failure outcomes is not yet fully understood. Recent advances in machine learning techniques for continuous action spaces led us to construct a motor learning model investigate how animals progressively enhance the efficiency of their behaviors through numerous trials and errors. This study conducts a comprehensive comparison between deep learning models and experimental data from monkey behavior. Notably, we show that the progressive refinement of motor sequences, as they are observed in the animals, do not require the implementation of a complete model of their environment. Rather, it merely requires the capacity to anticipate both movement costs and final reward a few steps ahead in the future following a value iteration principle. Furthermore, the systematic deviations exhibited by the monkeys with respect to the computational model inform us on the presence of individual preferences in either minimizing the duration or the energy consumption, and also on the involvement of alternative ”cognitive” strategies.
File in questo prodotto:
File Dimensione Formato  
2023.12.04.569889v1.full.pdf

accesso aperto

Tipologia: Documento in Pre-print
Dimensione 9.12 MB
Formato Adobe PDF
9.12 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11567/1220803
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact