CUED Publications database

Deviation from the matching law reflects an optimal strategy involving learning over multiple timescales.

Iigaya, K and Ahmadian, Y and Sugrue, LP and Corrado, GS and Loewenstein, Y and Newsome, WT and Fusi, S (2019) Deviation from the matching law reflects an optimal strategy involving learning over multiple timescales. Nat Commun, 10. 1466-.

Full text not available from this repository.

Abstract

Behavior deviating from our normative expectations often appears irrational. For example, even though behavior following the so-called matching law can maximize reward in a stationary foraging task, actual behavior commonly deviates from matching. Such behavioral deviations are interpreted as a failure of the subject; however, here we instead suggest that they reflect an adaptive strategy, suitable for uncertain, non-stationary environments. To prove it, we analyzed the behavior of primates that perform a dynamic foraging task. In such nonstationary environment, learning on both fast and slow timescales is beneficial: fast learning allows the animal to react to sudden changes, at the price of large fluctuations (variance) in the estimates of task relevant variables. Slow learning reduces the fluctuations but costs a bias that causes systematic behavioral deviations. Our behavioral analysis shows that the animals solved this bias-variance tradeoff by combining learning on both fast and slow timescales, suggesting that learning on multiple timescales can be a biologically plausible mechanism for optimizing decisions under uncertainty.

Item Type: Article
Uncontrolled Keywords: Animals Appetitive Behavior Behavior, Animal Learning Macaca mulatta Male Models, Theoretical Reward Time Factors Uncertainty
Subjects: UNSPECIFIED
Divisions: Div F > Computational and Biological Learning
Depositing User: Cron Job
Date Deposited: 14 Aug 2020 21:27
Last Modified: 04 Mar 2021 04:00
DOI: 10.1038/s41467-019-09388-3