This study investigates motor skill learning in a specific situation in which the only information provided on performance consists of a reward signal. Subjects performed a point-to-point movement. After the end of the movement we displayed a score based on the distance from a hidden via-point - the maximum score corresponds to the via-point belonging to the trajectory path. The score was the sole information available on task performance and subjects got no additional clues of how to make it high. The task is highly redundant, as infinite trajectories are compatible with the maximum score. We specifically looked at how subjects explore the task space and how they exploit task redundancy to maximize their performance. We also compared the experimental data with the solution predicted by an optimal feedback control model. The main findings were that (i) movement outcomes are largely determined by the subject-specific history of exploration; (ii) during learning, path variability gradually decreased; (iii) for the majority of subjects the point of minimum variability gradually gets closer to the point at minimum distance from the hidden via-point; and (iv) overall, subjects don't converge to the 'optimal control' solution.

Quantifying performance and variability in reward-based motor skill learning

TAMAGNONE, IRENE;CASADIO, MAURA;SANGUINETI, VITTORIO
2013-01-01

Abstract

This study investigates motor skill learning in a specific situation in which the only information provided on performance consists of a reward signal. Subjects performed a point-to-point movement. After the end of the movement we displayed a score based on the distance from a hidden via-point - the maximum score corresponds to the via-point belonging to the trajectory path. The score was the sole information available on task performance and subjects got no additional clues of how to make it high. The task is highly redundant, as infinite trajectories are compatible with the maximum score. We specifically looked at how subjects explore the task space and how they exploit task redundancy to maximize their performance. We also compared the experimental data with the solution predicted by an optimal feedback control model. The main findings were that (i) movement outcomes are largely determined by the subject-specific history of exploration; (ii) during learning, path variability gradually decreased; (iii) for the majority of subjects the point of minimum variability gradually gets closer to the point at minimum distance from the hidden via-point; and (iv) overall, subjects don't converge to the 'optimal control' solution.
2013
9781632660251
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11567/783416
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact