Brain Advance Access originally published online on March 10, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Brain, Vol. 127, No. 4, 851-859, 2004
© 2004 Guarantors of Brain
doi: 10.1093/brain/awh100
Cortico-striatal contributions to feedback-based learning: converging data from neuroimaging and neuropsychology
1 Center for Molecular and Behavioral Neuroscience and 2 Department of Psychology, Rutgers University, Newark, 3 Department of Neurology, UMDNJ/Robert Wood Johnson Medical School, New Brunswick, New Jersey, and 4 Department of Psychology, UCLA, Los Angeles, California, USA
Correspondence to: Daphna Shohamy, Center for Molecular and Behavioral Neuroscience, Rutgers University, 197 University Avenue, Newark, NJ, USA E-mail: shohamy{at}axon.rutgers.edu
| Summary |
|---|
|
|
|---|
The striatum has been widely implicated in cognition, but a precise understanding of its role remains elusive. Here we present converging evidence for the role of the striatum in feedback-based learning. In a prior functional imaging study, healthy controls showed striatal activity during a feedback-based learning task, which was decreased when the same task was learned without feedback. In the present study, we show that individuals with striatal dysfunction due to Parkinsons disease are impaired on the feedback-based task, but not on a non-feedback version of the same task. Parkinsons patients and controls also used different learning strategies depending on feedback structure. This study provides direct behavioural evidence from humans that cortico-striatal systems are necessary for feedback-based learning on a cognitive task. These findings also link between learning impairments in Parkinsons disease and the physiological and computational evidence for the role of midbrain dopaminergic systems in feedback processing.
Key Words: cognition; learning and memory; basal ganglia; dopamine; Parkinsons disease
Abbreviations: MMSE= Mini-Mental State Examination; UPDRS = Unified Parkinsons Disease Rating Scale.
Received July 17, 2003. Revised December 11, 2003. Accepted December 12, 2003.
| Introduction |
|---|
|
|
|---|
Recent advances in understanding the neural bases of learning and memory have emphasized a critical role for cortico-striatal circuitry in supporting a habit or procedural learning system (Squire, 1994
Significant advances have been made in recent years into functional neurophysiological, neurochemical and neurocomputational characteristics of the striatum (Schultz et al., 1997
; Schultz, 2002
; Beiser and Houk, 1998
; Horvitz, 2000
). Collectively, these studies emphasize a role for dopaminergic projections to the striatum in modifying behavioural responses to environmentally salient stimuli based on response-contingent feedback (Ljunberg et al., 1992
; Schultz et al., 1997
; Hollerman et al., 2000
; Horvitz, 2000
; Schultz, 2002
). These findings suggest, therefore, that striatal disruption may lead to impaired learning when a task relies upon trial-by-trial feedback, but learning may be spared if the same task is learned in an observational manner, with no feedback.
Results from functional neuroimaging in humans support this idea. In a previous functional neuroimaging study of probabilistic classification learning, we found that the striatum was significantly more active during feedback-based learning than during observational learning with no feedback, despite the fact that performance levels were similar in both cases (Poldrack et al., 2001
). The same effect was also found in the midbrain dopaminergic regions. Because neuroimaging cannot establish the necessity of particular regions for task performance (see, for example, Poldrack, 2000
), it is critical to establish that patients with damage to striatal function are specifically impaired at feedback-based learning. Preliminary evidence for this claim can be found in previous neuropsychological studies, since many of the tasks where Parkinsons disease patients showed impaired learning did involve trial-by-trial feedback-based learning (e.g. Knowlton et al., 1996
; Myers et al., 2003
), while many of the tasks that were spared in Parkinsons disease patients did not (for discussion, see Reber and Squire, 1999
). However, those tasks differed in many ways besides the lack of feedback. Thus, although converging evidence points towards a role for the striatum in feedback-based learning, the specific necessity of the striatum for feedback-based learning remains to be established.
For example, in the study by Knowlton et al. (1996
), Parkinsons disease patients and amnesic patients were tested on a probabilistic classification learning task. The study found that while Parkinsons disease patients were impaired compared with control subjects, the amnesic patients initially performed as well as controls. On a post-test questionnaire, the individuals with amnesia failed to remember facts about the testing episode or to recognize visual stimuli that were used in the task; conversely, the Parkinsons disease patients could report facts related to the testing episode and the visual stimuli that appeared on the screen, despite having been unable to master the task. Thus, feedback per se was neither manipulated nor examined in that study. In fact, Knowlton et al. (1996
) assumed that the critical feature of the task related to the Parkinsons disease deficit was its probabilistic nature rather than the feedback-based nature of the learning. If so, then Parkinsons disease patients should presumably be impaired at learning the task even if the training involves non-feedback (observational) learningas long as the category structure is probabilistic.
However, an alternate interpretation of the results of Knowlton et al. (1996
) is that Parkinsons disease patients could learn some details of the task through observation (and hence show good performance on the questionnaire, reporting what they had seen during the experiment), but could not learn the category structures based on feedback. This latter interpretation would be consistent with the prior imaging study showing basal ganglia activation during feedback-based learning of a probabilistic categorization task, but not during non-feedback (observational) learning of the same task (Poldrack et al., 2001
). This would predict that Parkinsons disease patients impairment on probabilistic categorization might be ameliorated if the training involves non-feedback (observational) learning.
The purpose of the present study was to directly assess the role of feedback in learning and memory impairments in patients with Parkinsons disease. Patients and age-matched controls were required to learn a probabilistic classification learning task, similar to tasks previously found to be sensitive to striatal function in behavioural and imaging studies (Knowlton et al., 1996
; Poldrack et al., 2001
). Here, we compared performance on two versions of a probabilistic classification learning task: a feedback-based version and an observational version. In the feedback-based version, subjects were provided with trial-by-trial feedback based on their response to each trial. In the observational version, subjects were shown the stimuli together with the correct outcome, with no behavioural response and no feedback.
Consistent with evidence from electrophysiological and imaging studies, we predicted that Parkinsons disease patients would be impaired at the feedback-based version, but would perform as well as controls on the observational version. The role of cortico-striatal circuitry in feedback-based versus observational learning was further examined by investigating learning strategies in Parkinsons disease and control subjects in each version, using mathematical analyses of learning strategies described previously (see Gluck et al., 2002
). These strategy analyses allow a more detailed analysis of differences in how task performance is influenced by instructional conditions or brain disorders.
| Methods |
|---|
|
|
|---|
Participants
Participants included 28 individuals with a diagnosis of idiopathic Parkinsons disease, randomly assigned to either the feedback (n = 13) or the observational (n = 15) learning conditions. Patients for this study were recruited from the motor disorders clinic at Robert Wood Johnson University Hospital (New Jersey, USA), having met diagnostic criteria for Parkinsons disease as assessed by a neurologist (J.S.), and having given informed consent to participate.
All Parkinsons disease patients were in the mild to moderate stages of the disease, with scores on the HoehnYahr scale of motor function (Hoehn and Yahr, 1967
) that ranged from 1 to 3. Patients motor function was also rated according to the Unified Parkinsons Disease Rating Scale (UPDRS). All Parkinsons disease patients were non-demented, as indicated by scores >24 on the Mini-Mental State State Examination (MMSE; Folstein et al., 1975
). Parkinsons disease patients were also screened for clinical depression, as indicated by scores <15 on the Beck Depression Inventory (Beck et al., 1996
). All patients included in the study were treated with L-dopa, were stable on their medication doses for at least 3 months, and were responding well to the medication. Some patients were additionally being treated with dopamine receptor agonists (11 patients). No patients included in the study were treated with anti-cholinergics. Patients were tested within 3 h since their last dose of medication. Information about Parkinsons disease patients and controls is presented in Table 1.
|
Thwenty-eight healthy control participants were recruited and randomly assigned to either the feedback (n = 13) or the observational (n = 15) learning conditions. Controls were screened for the presence of any neurological disorder or history of psychiatric illness including depression. Controls and Parkinsons disease patients did not differ in terms of age, education or MMSE [ANOVA (analysis of variance) on age, education and MMSE as dependent variables, with group (Parkinsons disease or control) and condition (feedback or observational) as independent variables; all P > 0.05].
All participants signed statements of informed consent before participating in behavioural testing. All studies conformed to research guidelines established by Rutgers University, Robert Wood Johnson and the Federal Government.
Stimuli
Cues were features on a Mr PotatoHeadTM toy (hat, glasses, moustache or bow tie) and subjects were required to predict the preferred flavour of ice cream (vanilla or chocolate). Sample stimuli for each version are shown in Fig. 1.
|
Stimuli were created using a Mr Potato HeadTM set (Toy Story 2TM and Silly SuitcaseTM versions) (Playschool/Hasbro Inc., Pawtucket, RI, USA; items 2260/2289 and 2279). Each stimulus was based on a face to which eyes, ears and other features could be added. Stimuli were photographed directly into the computer using a digital camera; stimuli were then edited further using Adobe Photoshop to ensure consistent image size (2.95" high x 2" wide), resolution (100 pixels/inch) and alignment of the figure within the pictures.
All stimuli consisted of the basic Mr Potato HeadTM face with black eyes, red nose, white arms and green feet. The face had a visible hole and smiling surface texture where the mouth would appear. This basic face was altered by addition of one or more features: cue 1 = black hat, cue 2 = black moustache, cue 3 = red eyeglasses, cue 4 = white bow tie. Fourteen stimuli were devised following the scheme shown in Table 2. In Table2, each pattern is encoded via a numeric four-digit pattern corresponding to whether each of the four features (tie, glasses, moustache, hat) is present (1) or absent (0). Thus, pattern A = 0001 had the black hat, pattern B = 0010 had the black moustache, pattern C = 0011 had both hat and moustache, and so on. Background was a constant light brown with minimal visible shadows.
|
Once all stimuli were constructed, an additional two versions were made of each stimulus pattern: (i) one with the figure holding a vanilla ice cream cone in its right hand (appearing on the left of the picture); and (ii) one with the figure holding a chocolate ice cream cone in its left hand (appearing on the right of the picture). The ice cream cones were taken from the Lil Chefs Bakery Ice Cream Party set (Toys R Us, Paramus, NJ, USA; item 9326). The ice cream cones were photographed separately into the computer and then morphed onto the existing photographs using Adobe Photoshop, to ensure that the appearance of the Mr Potato HeadTM figure was identical in each version of the stimulus.
Two hundred trials were constructed from the 14 patterns. The two outcomes ("vanilla" and "chocolate") were equally probable, but each feature was independently associated with each outcome with a fixed probability as shown in Table 2: the probabilities of "vanilla" given feature 1 (hat), 2 (moustache), 3 (glasses) and 4 (bow tie) were 0.8, 0.6, 0.4 and 0.2, respectively. The probability of "chocolate" given each feature was correspondingly 0.2, 0.4, 0.6 and 0.8. Trials were then constructed to adhere to these probabilities. Table 2 shows the number of times each pattern occurred with each outcome. Thus, for example, feature 1 (hat) appears in seven patterns (AG), which together account for 100 trials; the outcome of vanilla occurs on 80 of these trials, so P("vanilla"/feature 1 present) = 0.8. The 200 trials defined in Table 2 were presented in a random, but fixed order, for all subjects.
Apparatus
The experiment was conducted on a Macintosh G3 or iBook computer with colour screen, programmed in the SuperCard language (Allegiant Technologies, San Diego, CA, USA). The keyboard was masked except for two keys labelled "vanilla" and "chocolate", which the subject used to enter responses. During the observation phase of the observational condition, the keyboard was masked except for one key, labelled NEXT, which the subject pressed to see the next stimulus.
Procedure
Feedback-based condition
The subject was seated in front of the computer at a comfortable viewing distance. Instructions appeared on the computer screen. The subject read these instructions aloud:
Welcome! In this game, you are working in an ice cream shop. Customers will come in and buy vanilla or chocolate ice cream cones. Each time a customer visits, try to guess whether he wants vanilla or chocolate. If you guess correctly, you will earn an extra $1 tip. Try to collect as many tips as you can. Good luck!
Participants were told that at first they would have to guess, but that they would gradually improve their performance.
On each of 200 training trials, the screen showed a PotatoHeadTM figure (without ice cream) along with the prompt: Which flavor do you think he wants? The subject responded by pressing one of the labelled keys; the word "vanilla" or "chocolate", corresponding to the subjects response, appeared below the prompt. At this point, the stimulus pattern was replaced by a picture of the same figure holding either a vanilla or chocolate ice cream cone. If the subjects guess was correct, the word "Correct" appeared at the bottom of the screen, a few coins were added to the image of the tip jar, and a sound of dropping coins was played through the computer speaker. If the guess was incorrect, the word "Incorrect" appeared at the bottom of the screen and the tip jar was unchanged. The figure with ice cream and the feedback remained on the screen during a one-second intertrial interval. If the subject did not respond within 2 s, a prompt appeared: Answer Now! If the subject did not respond within the next 3 s, the trial was terminated and the correct answer was shown.
Observational condition
The procedure was generally similar to that described for the feedback condition above. In the observational condition, training was broken into two phases: an observation phase and a test phase. Before starting the observation phase, the following instructions appeared on the screen:
Welcome to the ice cream parlor. You will see pictures of customers, along with their favorite flavor of ice cream either vanilla or chocolate. Pay attention closely. Later on, you will be asked to remember which flavor of ice cream each customer wants. When you are finished looking at each customer, press the NEXT key to see the next one. Good Luck!
On each trial, a Mr PotatoHeadTM figure appeared with his favourite ice cream (vanilla or chocolate) in hand. When the subject was ready to move to the next customer, they pressed the "NEXT" button on the keyboard. There was no time constraint; however, subjects spent on average 12 s for each observational trial, which was similar to the amount of time allotted for each trial in the feedback-based condition. The observation phase consisted of 100 trials. After the last observation trial, instructions appeared on the screen stating that a new phase was beginning, that subjects would now be shown customers without their ice cream, and that their job was to try and predict the correct flavour of ice cream for each customer. On each trial of the test phase, the screen showed a PotatoHeadTM figure (without ice cream) along with the prompt: Which flavor do you think he wants? The subject responded by pressing "vanilla" or "chocolate" on the keyboard. After selecting a response, subjects were shown the next trial, with no feedback. After the task was completed, subjects were told how many correct responses they had made overall.
Trial order for the 100 observation trials was identical to that used for the first 100 trials in the feedback condition. During the test phase, subjects were tested three times for each of the 14 patterns, for a total of 42 test trials; these trials were presented in a random, but fixed order, for all subjects.
Data collection
On each trial, the computer recorded the pattern, the subjects response and the actual outcome. The subjects response was defined as optimal if it matched the outcome that was most often associated with that pattern across the course of the experiment. For example, since pattern A = 0001 is most often associated with "vanilla" (see Table 2), a "vanilla" response is optimal for that patterneven though on a few trials the actual outcome is "chocolate". Following earlier studies by Knowlton et al. (1994
, 1996) and others, we accordingly defined a correct response as one that obeyed this optimal response rule, regardless of the actual outcome (i.e. whether the participant accurately predicted the outcome). Note that optimal response is undefined for patterns F = 0110 and I = 1001, which are equally often associated with each outcome.
For the feedback condition, percent correct scores were analysed in blocks of 50 trials. For the observational condition, percent correct scores were analysed for performance on the test phase, as described above.
Following prior studies (e.g. Poldrack et al., 2001
; Gluck et al., 2002
), subjects failing to reach a set criterion of 60% correct by the last block (feedback) or test phase (observational) were excluded from further analyses. Based on this criterion, three controls and five Parkinsons disease subjects failed the feedback-based task; four Parkinsons disease and six controls failed the observational task [these did not differ significantly;
2(1) = 0.002, P > 0.5].
Strategy analysis
Strategy analysis followed procedures described in Gluck et al. (2002
). To investigate response strategies, for the entire training session (200 training trials for the feedback-based and 42 trials for the observational condition), we generated response profiles based on how an ideal participant would respond on each trial if they had been following each strategy: multi-cue, one-cue or singleton (see details below). For each participant, we then calculated the degree to which each model fit the participants data, using a least-mean-square measure, with 0.0 indicating a perfect fit. Comparing all strategies examined, the model that most closely approximated a participants individual response profile was defined as the best-fit model for that participant. Because some participants may not be well fit by any pre-defined model, we excluded strategy analysis data from any participant who was not fit by any model within a tolerance of 0.1. Prior studies found that at least 95% of young control subjects were fit within this defined tolerance by one of the strategies described (Gluck et al., 2002
).
As described previously (Gluck et al., 2002
), we considered the following three classes of learning strategies:
(i) Multi-cue strategy: this is the optimal strategy for learning this task. Under this strategy, a participant should respond to each pattern of cues with the outcome most often associated with that pattern. This involves attending to the entire pattern (i.e. all four cues) present on each trial. A participant reliably following this strategy would be scored as making 100% correct optimal responses over the course of the experiment. In addition, we considered two sub-optimal strategies, in which participants focus on single cues or single patterns, rather than on all four cues:
(ii) One-cue strategy: using this strategy, a participant should respond to each pattern based on the presence or absence of a single cue, disregarding the other cues. For example, a participant might respond "vanilla" whenever cue 1 is present and "chocolate" otherwise, regardless of what other cues are present. A participant reliably following this strategy should generate 90% correct optimal responses. (Cue 4, which predicts chocolate with high accuracy, could also be used to generate 90% correct responses. Cues 2 or 3, which are associated less reliably with the two outcomes, could each be used to generate 67% correct responses.)
(iii) Singleton strategy: in this strategy, a participant should learn the outcomes associated with those patterns in which only a single cue appears. For example, a participant would learn that cues 1 and 2 each reliably predict "vanilla", while cues 3 and 4 reliably predict "chocolate". Thus, whenever cues 1 or 2 (alone or together) were present, participants responded "vanilla"; whenever cues 3 or 4 (alone or together) were present, participants responded "chocolate". However, whenever a combination of cues appeared which differed in association (e.g. pattern E with cues 2 and 4 present), responding was random. Note that in this strategy, the response to a pattern cannot be different than the sum of the responses to individual cues. Nevertheless, since patterns A, B, C, D, H and L occur with such high frequency during the experiment (accounting for 54% of all trials), a participant responding correctly to these patterns and randomly to the remaining patterns could achieve up to 77% correct over the course of the experiment.
| Results |
|---|
|
|
|---|
The results include analyses for those subjects who reached criterion performance of at least 60% correct by the last block of training, including 10 controls and eight Parkinsons disease patients for the feedback-based task, and nine controls and 11 Parkinsons disease on the observational task.
Overall learning
Figure 2 shows performance for Parkinsons disease patients and controls on the test phase of the observational task, compared with the corresponding block of the feedback-based task (trials 100150). Consistent with our prediction, Parkinsons disease patients were impaired at learning the feedback-based version, but were not impaired at learning the observational task. An ANOVA on performance by group (dependent variable) and condition (independent variables) revealed a significant main effect of group [F(1,33) = 12.9, P < 0.01] and a significant group x condition interaction [F(1,33) = 5.2, P < 0.05]. Post hoc Tukey analyses confirmed that this was due to a significant difference between Parkinsons disease and controls on the feedback-based task (P < 0.01), but not on the observational task (P = 0.8). Post hoc analyses of performance across conditions showed that Parkinsons disease patients were significantly worse on the feedback-based task compared with the observational task (P < 0.05), whereas there was no difference between the tasks for control subjects (P = 0.9). For comparison with previous studies, learning curves for the feedback-based condition are shown in Fig. 3.
|
|
Learning strategies
We found that subjects engaged in different types of learning strategies in the feedback-based version compared with the observational version. In the feedback-based version, all control subjects and all but two Parkinsons disease patients showed performance consistent with one of three models of probabilistic classification learning (multi-cue, one-cue or singleton strategies; for details see Gluck et al., 2002
Additionally, within the feedback-based task, there were differences between Parkinsons disease and controls in the kind of strategy used during learning as shown in Fig. 4. Specifically, while 50% of control subjects were fit by an optimal multi-cue strategy, only 16% of Parkinsons disease were fit by this strategy. Instead, most Parkinsons disease subjects (>60%) appeared to engage in a useful but sub-optimal singleton strategy. This was a significant difference [Yates corrected Chi-square,
2(1) = 3.88, P < 0.05].
|
Although in the observational task subjects were not fit by these same strategies, nonetheless, many subjects showed responses that appeared to be consistent with particular patterns of learning. Many subjects in each group made consistent responses (zero errors on certain patterns and 100% errors on other patterns), indicating that they may have formulated a specific rule-based strategy they followed in order to predict the outcomes for each pattern. Other subjects showed mixed responses with little or no consistency in responding to particular patterns. Figure 5 presents sample data from two subjects with similar overall levels of performance following each of these response strategies. To quantify the frequency of these strategies among the groups, we defined consistent responders as any subject who made uniform responses to at least nine patterns, and inconsistent responders as any subject who provided non-uniform responses to at least nine patterns. Using these measures, we found that approximately half the subjects in each group were consistent responders; there was no evidence for differences between Parkinsons disease and controls in terms of number of subjects showing consistent versus inconsistent response patterns [Chi-square comparison of number of subjects in each group who responded consistently with number of subjects who did not:
2(1) = 1.9, P > 0.1].
|
For comparison, we applied the same approach to the data from the feedback-based task, looking at consistent versus mixed responding to the last three encounters with each pattern. This analysis revealed that two Parkinsons disease patients and seven control subjects were defined as consistent responders. In line with what would be expected based on the strategy analyses, eight of those subjects defined as consistent responders were also best-fit by the optimal multi-cue strategy (one Parkinsons disease patient was not fit by any of the strategies).
| Discussion |
|---|
|
|
|---|
The results presented here provide behavioural evidence from humans that cortico-striatal systems are necessary for feedback-based learning on a cognitive task. The results present a direct confirmation of a prediction inspired by previous neuroimaging results from normal humans (Poldrack et al., 2001
The present findings are consistent with converging evidence suggesting that the basal ganglia process feedback-related information to modify learning. Electro physiological and anatomical studies show that cortico striatal synapses are modified based on dopaminergic signals from the SNc/VTA (substantia nigra pars compacta/ventral tegmental area), which carry stimulus-specific reward-related information (Calabresi et al., 1992
; Cepeda et al., 1993
; Wickens et al., 1996
; Reynolds et al., 2000
). These feedback-related signals are thought to be critical in modifying the organisms response to future encounters with the same stimulus (Wickens et al., 1996
; Reynolds et al., 2000
). In Parkinsons disease, patients suffer from profound loss of nigro-striatal dopamine neurons, disrupting striatal function (Agid et al., 1987
). Thus, the present findings support the idea that input from midbrain dopaminergic systems is critical in modifying behavioural responses based on trial-by-trial feedback.
The fact that Parkinsons disease patients showed no learning impairments on the observational version suggests that this type of learning is supported by brain systems which are spared in Parkinsons disease, and most likely by the medial temporal lobe. Our prior imaging studies found that while feedback-based learning primarily activated the striatum, learning this task with no feedback was associated with increased activation in the medial temporal lobe compared with feedback-based learning (Poldrack et al., 2001
). The finding in the present study that approximately half of the Parkinsons disease patients and controls learn this task by responding correctly to some patterns, but not others, is also reminiscent of the type of declarative learning strategies typically associated with the medial temporal lobe (Squire, 1994
).
The present findings are thus broadly consistent with prior studies demonstrating that individuals with Parkinsons disease are impaired at procedural learning, but are not impaired on an assessment of declarative memory for training events (such as the visual cues presented during training) (Knowlton et al., 1996
). Adopting a similar reasoning, one might suggest that in the present study, Parkinsons disease patients are impaired on the feedback-based version because it relies upon striatal-dependent procedural learning, while Parkinsons disease patients are spared on the observational version because this task relies upon striatal-independent declarative learning. Although this may be the case, the present study does not provide direct evidence to determine whether subjects were using procedural versus declarative learning. In fact, if the observational task were indeed learned by declarative rule-based mechanisms, all subjects might have been expected to show consistent responding to each stimulus during the test phase. However, we found that approximately half of the subjects in each group did not show such consistent responding, despite achieving overall similar levels of performance (see Fig. 4). Furthermore, recent studies have emphasized the difficulty in a priori defining a task as either procedural or declarative, since multiple strategies can be used to learn a probabilistic categorization tasksome of which may be more easily verbalized than others (Gluck et al., 2002
). Furthermore, although subjects typically can verbalize a learning strategy when asked, modelling response patterns revealed that the actual strategy used during learning is not typically related to subjects verbalized rules (Gluck et al., 2002
). One possibility, of course, is that healthy subjects normally make use of multiple parallel learning systems; in Parkinsons disease patients, where a feedback-based learning system may be damaged, learning must rely on alternate systems.
The present findings indicate that the Parkinsons disease patients perform significantly better when learning relies on observation rather than when the same information is trained in a feedback-based manner. Thus, one question is why the Parkinsons disease patients do not abandon the feedback-based learning strategies (which are impaired) and instead use the observational information that is presented during training to improve on the feedback-based task. The fact that the Parkinsons disease patients do not adopt observational strategies to learn the feedback-based task suggests that not only are they impaired at processing the feedback-related information, but that they can not modulate the use of feedback-based strategies when feedback-related information is presented during training. This may be related to a deficit in shifting or switching strategies, which is often attributed to cortico-striatal circuitry (Downes et al., 1989
; Owen et al., 1993
; Cools et al., 2001
).
The primary deficit in Parkinsons disease is a loss of dopaminergic neurons in the nigro-striatal pathways. However, this loss of dopamine in the striatum is also associated with disrupted frontal lobe function, as well as with disruption of other non-dopaminergic neurotransmitter systems, which are likely to contribute to the cognitive deficits found in Parkinsons disease patients. In addition, Parkinsons disease is commonly treated with L-dopa, which elevates dopamine levels in the brain and alleviates the motor symptoms of the disease. Studies examining the effect of L-dopa on cognitive deficits in Parkinsons disease have led to inconsistent results, with L-dopa either enhancing, having no effect, or impairing cognitive function, depending on task demands (e.g. Swainson et al., 2000
; Cools et al., 2001
). Cools et al. (2001
) have suggested that L-dopa may impair cognitive function by overdosing non-depleted fronto-striatal circuits. Since the patients in the present study were all tested while on-medication, future studies are necessary to determine the impact of L-dopa medication on feedback-based learning.
The present study manipulated the feedback-based structure of the learning task by eliminating all aspects of the feedback, including the reward that was associated with the outcome when it was correct. Therefore, in addition to feedback per se, the two versions of the task also differ in other respects. For example, in the observational condition, there is no requirement for explicit guessing of the predicted outcome, no decision-related motor response, and no reward associated with the correct outcome. Although collectively these aspects of the task are related to its feedback structure, future studies would be instrumental in dissociating the contribution of these feedback-related variables to the present results.
Conclusions
The present study suggests that the striatum plays an important role in feedback-based learning of cue-outcome associations. The findings presented here further suggest that prior inconsistencies, i.e. with Parkinsons disease patients impaired on some learning tasks but not others, can be explained by the specific task demands. In addition, the present study provides a physiological context for understanding the relationship between striatal disruption and the procedural learning deficits found in patients with Parkinsons disease.
| References |
|---|
|
|
|---|
Agid Y, Javoy-Agid F, Ruberg M. Biochemistry of neurotransmitters in Parkinsons disease. Mov Disord 1987; 2: 166230.
Beck AT, Steer RA, Brown GK. Manual for the Beck Depression Inventory. San Antonio (TX): Psychological Corporation; 1996.
Beiser DG, Houk JC. Model of cortical-basal ganglionic processing: encoding the serial order of sensory events. J Neurophysiol 1998; 79: 316888.
Bondi MW, Kaszniak AW. Implicit and explicit memory in Alzheimers disease and Parkinsons disease. J Clin Exp Neuropsychol 1991; 13: 33958.[ISI][Medline]
Calabresi P, Maj R, Pisani A, Mercuri NB, Bernardi G. Long term synaptic depression in the striatum: physiological and pharmacological characterization. J Neurosci 1992; 12: 422433.[Abstract]
Cepeda C, Buchwald NA, Levine MS. Neuromodulatory actions of dopamine in the neostriatum are dependent upon the excitatory amino acid receptor subtypes activated. Proc Natl Acad Sci USA 1993; 90: 957680.
Cools R, Barker RA, Sahakian BJ, Robbins TW. Enhanced or impaired cognitive function in Parkinsons disease as a function of dopaminergic medication and task demands. Cereb Cortex 2001; 11: 113643.
Downes JJ, Roberts AC, Sahakian BJ, Evenden JL, Morris RG, Robbins TW. Impaired extra-dimensional shift performance in medicated and unmedicated Parkinsons disease: evidence for a specific attentional dysfunction. Neuropsychologia 1989; 27: 132943.[CrossRef][ISI][Medline]
Eichenbaum HE, Cohen NJ. From conditioning to conscious recollection: memory systems of the brain. Oxford: Oxford University Press; 2001.
Folstein MF, Folstein SE, McHugh PR. Mini-mental state: a practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res 1975; 12: 18998.[CrossRef][ISI][Medline]
Gabrieli JD. Cognitive neuroscience of human memory. Annu Rev Psychol 1998; 49: 87115.[CrossRef][ISI][Medline]
Gluck MA, Shohamy D, Myers C. How do people solve the weather prediction task?: individual variability in strategies for probabilistic category learning. Learn Mem 2002; 9: 40818.
Heindel WC, Salmon DP, Shults CW, Walicke PA, Butters N. Neuropsychological evidence for multiple implicit memory systems: a comparison of Alzheimers, Huntingtons and Parkinsons disease patients. J Neurosci 1989; 9: 5827.[Abstract]
Hoehn MM, Yahr MD. Parkinsonism: onset, progression and mortality. Neurology 1967; 17: 42742.
Hollerman JR, Tremblay L, Schultz W. Involvement of basal ganglia and orbitofrontal cortex in goal-directed behavior. Prog Brain Res 2000; 126: 193215.[ISI][Medline]
Horvitz JC. Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events. Neuroscience 2000; 96: 6516.
Jog MS, Kubota Y, Connolly CI, Hillegaart V, Graybiel AM. Building neural representations of habits. Science 1999; 286: 17459.
Knowlton BJ, Squire LR, Gluck MA. Probabilistic classification learning in amnesia. Learn Mem 1994; 1: 10620.
Knowlton BJ, Mangels JA, Squire LR. A neostriatal habit learning system in humans. Science 1996; 273: 1399402.[Abstract]
Koivisto M, Portin R, Rinne JO. Perceptual priming in Alzheimers and Parkinsons diseases. Neuropsychologia 1996; 34: 44957.[CrossRef][ISI][Medline]
Ljungberg T, Apicella P, Schultz W. Responses of monkey dopamine neurons during learning of behavioral reactions. J Neurophysiol 1992; 67: 14563.
Myers CE, Shohamy D, Gluck M, Grossman S, Kluger A, Ferris S, et al. Dissociating hippocampal versus basal ganglia contributions to learning and transfer. J Cogn Neurosci 2003; 15: 18593.
Owen AM, Roberts AC, Hodges JR, Summers BA, Polkey CE, Robbins TW. Contrasting mechanisms of impaired attentional set-shifting in patients with frontal lobe damage or Parkinsons disease. Brain 1993; 116: 115975.
Poldrack RA. Imaging brain plasticity: conceptual and methodological issues. Neuroimage 2000; 12: 113.[CrossRef][ISI][Medline]
Poldrack RA, Clark J, Pare-Blagoev J, Shohamy D, Creso Moyano J, Myers C, et al. Interactive memory systems in the human brain. Nature 2001; 414: 54650.[CrossRef][Medline]
Reber PF, Squire LR. Intact learning of artificial grammars and intact category learning by patients with Parkinsons disease. Behav Neurosci 1999; 113: 23542.[CrossRef][ISI][Medline]
Reynolds JN, Wickens JR. Substantia nigra dopamine regulates synaptic plasticity and membrane potential fluctuations in the rat neostriatum, in vivo. Neuroscience 2000; 99: 199203.[CrossRef][ISI][Medline]
Robbins TW. Refining the taxonomy of memory. Science 1996; 273: 13534. [CrossRef][ISI][Medline]
Saint-Cyr JA, Taylor AE, Lang AE. Procedural learning and neostriatal dysfunction in man. Brain 1988; 111: 94159.
Schultz W. Getting formal with dopamine and reward. Neuron 2002; 36: 24163.[CrossRef][ISI][Medline]
Schultz W, Dayan P, Montague PR. A neural substrate of prediction and reward. Science 1997; 275: 15939.
Smith J, Siegert RJ, McDowall J, Abernethy, D. Preserved implicit learning on both the serial reaction time task and artificial grammar in patients with Parkinsons disease. Brain Cogn 2001; 45: 37891.[CrossRef][ISI][Medline]
Squire LR. Declarative and nondeclarative memory: multiple brain systems supporting learning and memory. In: Schacter DL, Tulving E, editors. Memory systems. Cambridge (MA): MIT Press; 1994. p. 20331.
Squire LR, Zola-Morgan S. Structure and function of declarative and non-declarative memory systems. Proc Natl Acad Sci USA 1996; 93: 1351522.
Swainson R, Rogers RD, Sahakian BJ, Summers BA, Polkey CE, Robbins TW. Probabilistic learning and reversal deficits in patients with Parkinsons disease or frontal or temporal lobe lesions: possible adverse effects of dopaminergic medication. Neuropsychologia 2000; 38: 596612.[CrossRef][ISI][Medline]
Vriezen ER, Moscovitch M. Memory for temporal order and conditional associative-learning in patients with Parkinsons disease. Neuropsychologia 1990; 28: 128393.[CrossRef][ISI][Medline]
Wickens JR, Begg AJ, Arbuthnott GW. Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex in vitro. Neuroscience 1996; 70: 1-5.
Witt K, Nuhsman A, Deuschl G. Intact artificial grammar learning in patients with cerebellar degeneration and advanced Parkinsons disease. Neuropsychologia 2002; 40: 153440.[CrossRef][ISI][Medline]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
V. Klucharev, A. Smidts, and G. Fernandez Brain mechanisms of persuasion: how 'expert power' modulates memory and attitudes Soc Cogn Affect Neurosci, August 7, 2008; (2008) nsn022v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. F. Doeller, J. A. King, and N. Burgess Parallel striatal and hippocampal systems for landmarks and boundaries in spatial memory PNAS, April 15, 2008; 105(15): 5915 - 5920. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Weiler, C. Bellebaum, and I. Daum Aging affects acquisition and reversal of reward-based associative learning Learn. Mem., April 1, 2008; 15(4): 190 - 197. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Lin, K. J Sullivan, A. D Wu, S. Kantak, and C. J Winstein Effect of Task Practice Order on Motor Skill Learning in Adults With Parkinson Disease: A Pilot Study Physical Therapy, September 1, 2007; 87(9): 1120 - 1131. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Balleine, M. R. Delgado, and O. Hikosaka The Role of the Dorsal Striatum in Reward and Decision-Making J. Neurosci., August 1, 2007; 27(31): 8161 - 8165. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Landmann, S. Dehaene, S. Pappata, A. Jobert, M. Bottlaender, D. Roumenov, and D. Le Bihan Dynamics of Prefrontal and Cingulate Activity during a Reward-Based Logical Deduction Task Cereb Cortex, April 1, 2007; 17(4): 749 - 759. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Cincotta and C. A. Seger Dissociation between Striatal Regions while Learning to Categorize via Feedback and via Observation. J. Cogn. Neurosci., February 1, 2007; 19(2): 249 - 265. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Seger and C. M. Cincotta Dynamics of Frontal, Striatal, and Hippocampal Systems during Rule Learning Cereb Cortex, November 1, 2006; 16(11): 1546 - 1555. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Seger The Basal Ganglia in Human Learning Neuroscientist, August 1, 2006; 12(4): 285 - 290. [Abstract] [PDF] |
||||
![]() |
K. Foerde, B. J. Knowlton, and R. A. Poldrack Modulation of competing memory systems by distraction PNAS, August 1, 2006; 103(31): 11778 - 11783. [Abstract] [Full Text] [PDF] |
||||
![]() |
|












