Accurately Explaining Exploratory Decisions in a Non-Stationary Bandit Problem: A Recovery Study of the Kalman Filter Model
Alexander O. Savi
10.6084/m9.figshare.1286872.v1
https://figshare.com/articles/journal_contribution/Accurately_Explaining_Exploratory_Decisions_in_a_Non_Stationary_Bandit_Problem_A_Recovery_Study_of_the_Kalman_Filter_Model/1286872
<p>Abstract: Daw, O’Doherty, Dayan, Seymour, and Dolan (2006) claim that a model consisting of the Kalman filter and softmax rule can be used to explain human decisions in a non-stationary four-armed bandit task. This paper aims to evaluate whether the model's parameters can be recovered accurately, while keeping the original conditions as much as possible intact. It is shown that three parameters could not be recovered, which indicates serious identification problems. Our conclusion is that the model must be used with caution and suggestions are included to improve recovery.</p>
<p> </p>
<p>Included: Internship report and scripts (appendix).</p>
2015-01-12 10:27:58
Decision making
softmax rule
exploration-exploitation trade-off
identification problem
Statistics
Neuroscience and Physiological Psychology