Abstract
Although the content of working memory (WM) can be decoded from the spatial patterns of brain activity in early visual cortex, how populations encode WM representations remain unclear. Here, we address this limitation by using a model-based approach that reconstructs the feature encoded by population activity measured with fMRI. Using this approach, we could successfully reconstruct the locations of memory-guided saccade goals based on the pattern of activity in visual cortex during a memory delay. We could reconstruct the saccade goal even when we dissociated the visual stimulus from the saccade goal using a memory-guided antisaccade procedure. By comparing the spatiotemporal population dynamics we find that the representations in visual cortex are stable, but can also evolve from a representation of a remembered visual stimulus to a prospective goal. Moreover, since the representation of the antisaccade goal cannot be the result of bottom-up visual stimulation, it must be evoked by top-down signals presumably originating from frontal and/or parietal cortex. Indeed, we find that trial-by-trial fluctuations in delay period activity in frontal and parietal cortex predicts the precision with which our model reconstructed the maintained saccade goal based on the pattern of activity in visual cortex. Therefore, the population dynamics in visual cortex encode WM representations and these representations can be sculpted by top-down signals from frontal and parietal cortex.
Introduction
Working memory (WM) enables the brain to maintain behaviorally relevant information over short periods of time while the source of the information is no longer available (Baddeley & Hitch, 1974; Curtis & D’Esposito, 2003). Much of the previous work investigating the neural mechanisms that support WM has focused on the role of the frontal cortex (Funahashi, Bruce, & Goldman-Rakic, 1989; Goldman-Rakic, 1994; Srimal & Curtis, 2008). However, the contents of WM (e.g., orientation, color, spatial frequency, motion) can be decoded from delay period activity in human visual cortex (Albers, Kok, Toni, Dijkerman, & de Lange, 2013; Ester, Serences, & Awh, 2009; Harrison & Tong, 2009; Riggall & Postle, 2012; Serences, Ester, Vogel, & Awh, 2009). Findings such as these support the sensory recruitment model of WM, whereby the same neural mechanisms that encode sensory information are also used to maintain WM representations (D’Esposito & Postle, 2015; Postle, 2006).
Because these decoding studies depend on the patterns of activity across hundreds of voxels, visual cortex appears to encode WM representations within large populations of neurons. Multivariate pattern analysis (MVPA) (Haxby et al., 2001) uses powerful supervised machine learning algorithms to classify distinct brain states (Haynes & Rees, 2006; Kamitani & Tong, 2005; Norman, Polyn, Detre, & Haxby, 2006). Although MVPA has successfully demonstrated there are different patterns of activation in visual cortex associated with different WM features, it is limited in constraining the number of inferences one can draw about why these patterns are different (Freeman, Brouwer, Heeger, & Merriam, 2011; Naselaris, Kay, Nishimoto, & Gallant, 2011; Serences & Saproo, 2012). In contrast to MVPA, which attempts to predict brain states given a pattern of activity, an inverted encoding model (IEM) uses a set of a priori assumptions that form a linking hypothesis about how these brain states give rise to different patterns (Brouwer & Heeger, 2009; Dumoulin & Wandell, 2008). For example, if one assumes that a voxel’s BOLD response during a spatial WM delay period is proportional to the responses of a set of information channels that tile visual space, the mapping between these channels and the BOLD responses can be used to predict the location of the memoranda in the visual field (Ester, Sprague, & Serences, 2015; Sprague, Ester, & Serences, 2014, 2016; Sprague & Serences, 2013). IEMs can reconstruct a full feature space, visual space in the above example, that can then be used for decoding a maintained stimulus feature (e.g., location).
Here, we model the patterns of neural population dynamics in early visual cortex to test several hypotheses. First, we ask if we can reconstruct the locations of memory-guided saccade goals based on the pattern of activity in visual cortex during a memory delay. Second, we ask if we can reconstruct the saccade goal even when we dissociate the location of the visual stimulus from the saccade goal using a memory-guided antisaccade procedure. Third, we test if the population dynamics evolve from a retrospective visual code to a prospective goal code. Fourth, since the representation of the antisaccade goal cannot be the result of bottom-up visual stimulation, it must be evoked by top-down signals, presumably originating from frontal cortex (Moore & Armstrong, 2003). To test this, we ask if trial-by-trial fluctuations in delay period activity in frontal cortex predict the precision with which our model reconstructs the maintained saccade goal based on the pattern of activity in visual cortex.
Methods
Subjects:
Using procedures approved by New York University, five healthy subjects (3 female; right-handed; 25–42 years old) gave informed consent and participated in the study. Each subject participated in six scanning sessions: one to obtain three high-resolution anatomies, one to measure retinotopy, and four to measure responses during memory-guided prosaccade and antisaccade tasks. Portions of the data were previously published in Saber et al. (2015).
Memory-Guided Saccade Tasks:
Subjects performed two memory-guided saccade tasks (Figure 1). Subjects were instructed to remember the location of a briefly presented visual stimulus and immediately plan a saccade to, in the case of memory-guided prosaccades, or away from the stimulus, in the case of memory-guided antisaccades. The stimulus, a high contrast flickering visual checkerboard (1° diameter, spatial frequency 2 cpd, 8 Hz flicker), appeared at one of eight locations separated by 30° angles at 10° eccentricity, avoiding vertical and horizontal meridians. Each trial began with 1.25s of central fixation followed by a brief presentation of the visual stimulus. Subjects maintained the planned prosaccade or antisaccade over a long and variable delay period (3–15s). Disappearance of the fixation along with a brief auditory tone prompted the memory-guided saccade. Subjects held their gaze until the correct location was presented as feedback and corrected any error in their gaze. Subjects then returned to fixation for the inter-trial interval (ITI, 10.5–15s). Partial trials were also included to help break the dependency of the delay on the target. These trials were aborted after the visual stimulus, signaled by a change in the color of the fixation spot. In four sessions, each with eight 322s scans, subjects performed 192 prosaccade and 192 antisaccade trials, blocked by condition.
Oculomotor Methods:
Eye position was recorded in the scanner at 1000 Hz (EyeLink 1000, SR Research, Ontario, Canada). We computed error in degrees of angle of the primary saccade as well as the final error after any quick corrections were made prior to the target feedback. All trials were inspected for noncompliance. Trials in which subjects made an eye movement to the wrong saccade target were rare (0.3% and 0.6% of all pro- and antisaccade trials, respectively). We excluded trials where gaze deviated from fixation greater than 2° during the delay-period or immediately following visual target (4.6% and 2.7% of pro- and anti-saccade trials, respectively).
MRI Acquisition:
All MRI data were acquired on a 3T Allegra head-only scanner (Siemens, Erlangen, Germany) using a head transmit coil (NM-011) and two surface receive coils (NOVA Medical, Wakefield, MA): (1) a four-channel phased array receive surface coil (NMSC-021) for retinotopic mapping, (2) a four-element phased parallel array (NMSC-011) for memory-guided saccade tasks. We acquired T2*-sensitive echo-planar images (repetition time (TR) 1.5s; echo time 30ms; flip angle 75°; 26 slices; 3×3×3mm voxels; 192×192mm field of view). Three T1-weighted MPRAGE scans were averaged and used for gray matter segmentation, cortical flattening, registration, and visualization for creating ROIs (see below). Functional scans were corrected for head motion and aligned across sessions, detrended and high-pass filtered with a cutoff frequency of 0.0167 Hz, and converted to percentage signal modulation.
Retinotopic Mapping:
We defined retinotopic visual areas (V1, V2, V3, V3A/B, IPS0) using a standard phase encoded approach (Wandell, Dumoulin, & Brewer, 2007). Polar angle components of the retinotopic maps were estimated with a high contrast flickering wedge that rotated (clockwise or counter-clockwise) around central fixation (subtended 90° of polar angle). Radial components were estimated with a high contrast flickering annulus expanding or contracting from fixation (subtended 4.2° radius). Each scan consisted of 10.5 cycles with a period of 24s, and lasted 252s. Each subject completed six to eight runs of the rotating wedge aperture and four runs of expanding/contracting annulus aperture. We used flattened cortical surface representations of each subject’s occipital and parietal cortices to visualize amplitude, coherence and phase maps. Visual area boundaries were drawn on retinotopic maps based on standard conventions (Larsson & Heeger, 2006; Wandell et al., 2007). We used voxels from these ROIs in our IEM described below.
Estimating Delay Period Activity:
We used a voxel-wise generalized linear model (GLM) to estimate the responses of each voxel to the stimulus encoding, memory retention interval (i.e., delay period), and memory-guided saccade intervals of each trial (Srimal & Curtis, 2008). Each epoch was modeled by the convolution of a canonical model of the hemodynamic impulse response function with a square wave (box-car regressor) whose duration was equal to the duration of the corresponding interval, detrended and high-pass filtered. Importantly, we estimated beta coefficients for each of the three trial epochs (stimulus, delay, response) for every trial independently. Modeling every event in each trial guaranteed that the beta coefficient corresponding to each epoch estimated the magnitude of BOLD activity specific to each epoch. As described below, we used the beta coefficients from the stimulus encoding epoch of the prosaccade task for model training and the beta coefficients from the delay periods of the prosaccade and antisaccade tasks for model testing.
Inverted encoding model (IEM):
To reconstruct the spatial representation of the retinotopic maps, we used an IEM (Sprague & Serences, 2013). The IEM assumes that the activity of each neuron is a linear combination of the outputs of a set of basis functions, or information channels, tuned to different stimulus values. Similarly, it assumes that the activity of each voxel, modeled as the GLM beta coefficients, is a weighted sum of the activity of all neurons in that voxel. As a result, the activity of each voxel can be equation ted as a linear combination of the output of these basis functions for a given stimulus. This can be shown mathematically as:
(1) |
Where B is a matrix that represents the GLM beta coefficients corresponding to an event of interest (i.e., stimulus encoding, delay period) for all voxels in a given ROI and for all trials. W is a matrix of regression weights and C contains the coefficients of the basis functions. The spatial representation of visual space is a linear combination of all basis functions, each determined by its corresponding coefficient in C. To calculate the channel coefficients in a given trial using equation (1), we use regression weights, W, that link the outputs of the basis functions to each voxel’s activity. Thus, we can rewrite equation (1) as:
(2) |
Where Btrain in (2) is a subset of the data used for training. However, during training we do not have access to the real contribution of each basis function to the population activity and thus we need to use a set of hypothetical channel coefficients, Ctrain. For each trial, we calculate these hypothetical coefficients as the overlap between each basis function and that part of the visual field covered by the stimulus in that trial. Using the regression weights, , and the population activity, Btest, in a given trial, we can calculate each channel’s coefficients as:
(3) |
To avoid circularity in our analysis, we cross-validated our model by using the GLM derived beta coefficients corresponding to the stimulus encoding epoch of the pro-saccade task to calculate the regression weights, . We then used these regression weights to estimate the channel coefficients, Ctest in (3), given the population activity, Btest, during the delay period in the pro-saccade and both encoding and delay periods in the anti-saccade task. For the stimulus encoding epoch of the pro-saccade task, we used a leave-one-out method to train and test the model. For each trial in the dataset, we took that trial out and used the rest of trials to calculate the regression weights, . We then used these regression weights to estimate the channel coefficients, Ctest in (3), given the population activity, Btest,, corresponding to the excluded trial. Thus, adding all basis functions, each multiplied by its corresponding channel coefficient in Ctest, gives us the reconstructed spatial representation of visual space in that trial. Figure 1C shows a schematic of how our model works.
Based on the spatial selectivity of voxels in each ROI in early visual cortex, we constructed a set of basis functions consisting of 2-D Gaussian filters with equal FWHMs distributed uniformly along a circle corresponding to a 10° eccentricity (i.e., eccentricity for the visual targets and saccade goals). The number and width of these basis functions were chosen in a way to both avoid rank-deficiency (caused by too much overlap between channels) and obtain smooth reconstructions. To make sure that using basis functions restricted to the stimulated ring does not bias the reconstructed spatial representation, we also tested a set of basis functions tiling the whole visual field. Since we did not find any meaningful differences in the reconstructed visuospatial maps, here we only report the results from the basis functions restricted to the actual eccentricity used in the experiment.
Statistical analysis:
To evaluate the differences in behavioral performance between pro- and anti-saccade tasks, we measured three parameters for each trial; the primary and final saccade accuracies and the reaction time. We then, for each parameter, performed a permutation test between the measurements in each task, and a random subset of concatenated measurements in both tasks.
To verify that the IEM can reconstruct the WM content uniformly well across different saccade target locations, we plotted the model’s reconstructed locations versus the true target locations. Within each topographic map, we calculated the peak of the reconstructed heat map averaged across all trials for each saccade target location and plotted this against the true target locations. Then, we estimated the slope of lines fit to the these points, where the closer the slope of a line was to one indicated a more precise readout across different locations. Moreover, to evaluate the model performance on a trial-by-trial basis, we defined the readout precision at each trial as where error is the difference between the angular position of the peak in the reconstructed retinotopic map and the actual location in the corresponding trial. We then, measured the skewness of the distribution of precisions. The precision varies between 0 (maximum error) and 1 (minimum error). Thus, the higher the trial-by-trial model performance, the more skewed the precisions distribution towards 1. We measured the significance of the skewness towards 1, by comparing the skewnes corresponding to the original and randomized order of data through a permutation test. Moreover, we identified correctly decoded trials based on the distance between the IEM readout peak and the actual target location in each trial. We then, through a permutation test, compared the proportion of correctly decoded trials from the original and randomized order of data in each ROI. To do so, we shuffled the trial orders 1000 times, for each subject and each ROI, and reconstructed the visual-spatial maps each time. Then, we ran a student t-test between the proportion of correctly decoded trials corresponding to the original order and shuffled.
To measure the spatiotemporal interaction of the reconstructed representation of population activity under different conditions, we fit a 2-D Gaussian to the reconstructed visual-spatial maps and measured the gain, width and precision of the fit. We then used a 2-way ANOVA to test the significance of the interaction between each of these parameters at two time epochs, the encoding and delay periods, and two locations, the stimulus location and saccade goal, which is the same on pro-saccade, but different on anti-saccade trials. We measured the Pearson’s correlation for each voxel between the delay period GLM beta coefficient across trials and the trial-by-trial success of our model’s decoding of memorized space. To identify all voxel clusters across the whole brain that significantly predict the model performance in an ROI, we used, independently for each voxel, a permutation test to compare the z-transformation of above correlations corresponding to the original and randomized order of trials. We then used these clusters to compare between subjects the brain areas the activity of which can significantly predict the model precision in each of the five ROIs. We identified two clusters of voxels from two subjects as the same if 1) the distance between the boundaries of two clusters was not larger than two voxels in the Talaraich TT_N27 brain space, and 2) both clusters were in the same brain area according to Freesurfer’s cortical labels (aparc.a2009s.annot) from the automatic cortical parcellation.
Results
Performance equated on memory guided pro- and anti-saccades
Figure 1B depicts the behavioral results for both pro and ant-saccade trials. Overall, 85% and 91% of all memory-guided pro-saccade and anti-saccade response times (SRTs), respectively, fell within the normal range of visually guided saccades (i.e., 200–500 ms), and SRT did not significantly differ between the conditions (permutation test, P=0.17). The accuracy of memory-guided pro-saccades and anti-saccades was also indistinguishable (permutation test, P=0.56). Moreover, the final eye position after small corrective saccades (Mackey, Devinsky, Doyle, Golfinos, & Curtis, 2016; Mackey, Devinsky, Doyle, Meager, & Curtis, 2016), but before feedback were not significantly different between the two conditions (permutation test, P=0.53). Thus, we were able to compare neural differences unconfounded by behavioral differences between the pro-saccade and anti-saccade conditions.
Modeled population activity in visual cortex reconstructs a representation of WM
We collected BOLD fMRI signal while the subject performed memory guided pro- and anit-saccade tasks (Figure 1A). In both tasks, each trial can be divided into four epochs; central fixation, stimulation (stimulus encoding), delay period, and response. We mainly focused on stimulus encoding and delay period as these are the intervals that the WM content is being encoded and maintained. We used an IEM (Sprague & Serences, 2013), described in detail in Methods, to reconstruct the spatial representation of neural population activity within each retinotopic map during the encoding and delay periods independently for both pro-saccade and anti-saccade trials (Figure 1C).
In the memory-guided pro-saccade task, the subject is instructed to make a saccade towards the stimulus location after a variable delay period. We hypothesized that the activity patterns of neural population ensembles encode and maintain the stimulus location. Utilizing an IEM, we reconstructed the spatial representation of population activity in the five identified retinotopic areas. To average over all possible stimulus locations, we rotated the estimated channel coefficients, aligning all trials to one location, 45° in the upper right quadrant. Figure 2 shows the reconstruction results for a sample subject (Figure 2A) and for the average of all subjects (Figure 2B) during both encoding and delay epochs. The lines on the polar plot depict the relative channel coefficients across the circular space of the model. The heat maps in the background show the reconstructed spatial representation of the neural population activity, which is the linear combination of the information channels each multiplied by its corresponding coefficients. In all retinotopic areas we identified, the IEM model yielded a high fidelity reconstruction of the stimulus location during both time periods. This suggests that the population activity in these retinotopically organized cortical areas encodes and maintain the stimulus location.
Next, we quantified the accuracy and precision of the model in each retinotopic area by estimating the trial-by-trial performance (Figure 3). For each trial, we estimated the accuracy by comparing the peak of the reconstructed visual space within each retinotopic area with the true stimulus location, separately at the encoding and delay epochs. To do this over all stimulus locations, we regressed the decoded locations, averaged over all trials with the same stimulus location, against the true locations. In all five retinotopic areas, we found a tight coupling between reconstructed location and true location as demonstrated by regression slopes that were near 1 (range across ROIs [0.94, 1.05] with largest P = 1.6×10−6). The top row of Figure 3 depicts the relation between the decoded location and true location in a sample retinotopic area (V3) during both encoding and delay epochs. We defined the precision of the model as the inverse of the decoded errors across trials, where smaller decoding error approached a precision of 1. The distribution of precision over trials was highly skewed toward 1 for each of the retinotopic areas (range across ROIs [−1.36, −0.51] with largest P = 1.2×10−13; note negative numbers indicate skewness towards 1). The middle row of Figure 3 depicts the distribution of precision in a sample retinotopic area (V3) during the encoding and delay epochs. Moreover, using various thresholds we asked how often did the model correctly decoded the stimulus location within a confidence interval of 15°, 30°, or 45° of the true location. The bottom row of Figure 3 depicts the percentage of correctly decoded trials at different confidence intervals. For all retinotopic areas, the percentage of correctly decoded trials was significantly above chance even at the lowest interval of 15° (largest P = 2×10−118 for all confidence intervals). Overall, our model of the population activity in these retinotopically organized cortical areas accurately and precisely represented the location of the encoded and maintained stimulus location.
Dissociating WM representations of past visual stimulation from future saccade goals
In the memory-guided pro-saccade task, the subject was instructed to plan and make a saccade towards the stimulus location. Therefore, we do not know if the population response encodes the visual target or the saccade goal because they are the same. To dissociate these two possible causes, we used a memory guided anti-saccade task in which the saccade goal is dissociated from the stimulus location (Figure 1A). We used the same IEM as in the pro-saccade task to reconstruct the spatial representation of neural populations in each of the retinotopic areas during encoding and delay time epochs. To avoid any circularity in the reconstruction, we used the data corresponding to the encoding phase of the pro-saccade task to calculate regression weights for the reconstruction at encoding and delay periods of the anti-saccade task.
Figure 4 depicts the reconstructed retinotopic maps for both stimulus encoding and delay epochs for a sample subject and the average over all subjects. As with the prosaccade data, the lines on the polar plot depict the relative channel coefficients across the circular space of the model. Here, we aligned all trials such that the location of the visual stimulus was at 45° and the goal of the antisaccade was at 135°. In all retinotopic areas we identified, the IEM model reconstruction yielded two peaks, one at the location of the stimulus and another at the location of the saccade goal. Remarkably, this suggests that the population activity in these retinotopically organized cortical areas encode and maintain locations that were never visually stimulated, and therefore, must be the result of top-down stimulation.
As with the prosaccade data, we quantified the accuracy and precision of the model in each retinotopic area by estimating the trial-by-trial performance (Figure 5). Focusing solely on the delay period epoch, we compared separately both the decoded versus true location of the encoded stimulus and the decoded versus true location of the antisaccade goal. As depicted in the top row of Figure 5 for sample area V3, the true locations of the stimulus and antisaccade target were well predicted by the reconstructed locations as evidenced by regression slopes that were near 1 (range across ROIs [0.93, 1.12], largest P = 1.0×10−4). In each of the retinotopic areas, except V2, the distribution of precision over trials was highly skewed toward 1 when decoding the stimulus (range across ROIs [−0.5, −0.24] with largest P = 0.003) and antisaccade target (range across ROIs [−1.0, −0.4] with largest P = 0.009) locations. For V2, the average skewness was −0.4 (P = 0.17). In a sample retinotopic area (V3), the middle row of Figure 5 depicts the distribution of precision of decoding the stimulus and antisaccade target locations based on the pattern of activity in the delay period. Next, we calculated the percentage of correctly decoded stimulus locations and correctly decoded antisaccade target locations within a boundary of 15°, 30°, or 45° of the true location. The bottom row of Figure 5 depicts the percentage of correctly decoded trials at different boundaries. For all retinotopic areas, the percentage of correctly decoded stimulus and antisaccade target locations was significantly above chance even at the lowest interval of 15° (largest P = 8.7×10−56 for all boundaries). Overall, our model of the population activity in these retinotopically organized cortical areas accurately and precisely represented the location of both the past stimulus location and future antisaccade goal.
Population dynamics track the evolution of the shifting prioritized location
In order to perform the antisaccade task, one must first encode the location of the visual stimulus, then compute, plan and maintain the location of the antisaccade target. As the trial evolves, the prioritized location therefore shifts from the stimulus location to the saccade goal. Consistent with this idea, and as can be seen in Figure 4, it appears as if the reconstruction of the visual stimulus is greater during the encoding epoch than the delay period epoch. It also appears as if the reconstruction of the antisaccade target is greater during the delay than then encoding epoch. Building on this observation, we quantified the gain of the stimulus and saccade target locations in the reconstructed visual space during the encoding and delay period epochs. To this end, we fit 2-D Gaussian functions to the peak locations in the reconstructions at both the stimulus and antisaccade target locations. In Figure 6, we plot the average gain at the stimulus and target locations during the encoding and delay epochs for each retinotopic area. As can be seen, there is a strongly significant reconstruction gain interaction between epoch (encoding/delay) and location (stimulus/saccade target) in all retinotopic areas (two-way ANOVA for V1, V2, V3, V3AB, and IPS0: smallest F(1,99) = 597, P = 10−81). This pattern supports the hypothesis that the spatiotemporal pattern of the population activity tracks the prioritized location as it evolves from the visual stimulus to the antisaccade goal.
Delay period activity in frontal and parietal cortex sculpts the population activity in early retinotopic areas improving the precision of readout
We demonstrated that the pattern of population activity in visual cortex represents the antisaccade goal. This representation is not the result of bottom-up visual stimulation, and therefore, must be instantiated by top-down feedback signals. In order to identify the source of these top-down signals, we searched for brain areas whose trial-by-trial fluctuations in the magnitude of delay period activity predicted the precision with which our model could accurately reconstruct the remembered location. We first, for each retinotopic area, created a vector of values that represented the model accuracy, inversely proportional to the error of the reconstructed WM location. Second, using a generalized linear model (GLM) we estimated, for all voxels in the brain, the delay period activity for each trial. Then, we calculated the correlation between decoding accuracy and delay period activity of each voxel. In individual subjects, we found clusters of voxels whose delay period activity significantly predicted the decoding accuracy of the antisaccade goal from the pattern of activity in area V3 (Figure 7A). In the sample subject shown, we find significant clusters in dorsolateral prefrontal cortex, superior precentral sulcus, dorsal parietal cortex, and lateral occipital cortex. Despite some individual differences in the exact locations of significant voxel clusters within brain areas, we found consistent overlap in these four brain areas (Figure 7B). The histograms in Figure 7 depict the distribution of voxel correlations within each of the four brain areas, as well as their significance thresholds determined by permutation testing (Methods). In addition, Table 1 contains a list of all clusters of voxels in the brain, containing more than four contiguous voxels with overlap in at least four out of five subjects, whose activity is significantly correlated with the decoding accuracy in areas V2 and V3. We focus here on areas V2 and V3 because the correlations were particularly strong and consistent, compared to the correlations with decoding from areas V1, V3AB and IPS0, which was less consistent. The patterns of correlations identify both attentional and motor planning networks (Corbetta & Shulman, 2002) as potential sources for the modulatory top-down control signals prioritizing the saccade target location in the population activity in early visual cortex.
Table 1.
Clusters with activity predicting model precision in area V3 | |||||
---|---|---|---|---|---|
Hemisphere | Label | X | Y | Z | Volume (voxels) |
Right | G_occipital_middle | −36 | 77 | −1 | 614 |
Right | G_front_inf-Opercular | −45 | −11 | 8 | 568 |
Right | S_parieto_occipital | 20 | 70 | 27 | 565 |
Left | S_temporal_sup | 32 | 66 | 9 | 445 |
Right | Lat_Fis-post | −32 | 30 | 21 | 307 |
Right | S_central | −31 | 24 | 43 | 278 |
Right | S_postcentral | −17 | 47 | 48 | 220 |
Right | G_and_S_cingul-Ant | −6 | −28 | 28 | 188 |
Right | G_occipital_middle | −47 | 74 | 2 | 176 |
Right | S_front_sup | −17 | −5 | 46 | 176 |
Right | S_intrapariet_and_P_trans | −31 | 42 | 46 | 156 |
Right | S_front_sup | −16 | −41 | 38 | 138 |
Left | G_front_sup | 2 | 5 | 61 | 118 |
Right | G_front_middle | −30 | 37 | 57 | 108 |
Right | G_occipital_middle | −36 | 77 | −1 | 614 |
Clusters with activity predicting model precision in area V2 | |||||
Right | S_central | −34 | 24 | 47 | 627 |
Right | S_calcarine | −23 | 57 | 3 | 500 |
Right | S_circular_insulasup; | −35 | −6 | 9 | 450 |
Right | S_calcarine | −19 | 35 | 0 | 336 |
Right | G_front_middle | −35 | −34 | 22 | 398 |
Left | S_circular_insula_sup | 28 | −11 | 12 | 236 |
Left | G_oc-temp_lat-fusifor | 25 | 66 | −15 | 226 |
Right | G_and_S_cingul-Ant | −19 | −22 | 19 | 222 |
Right | S_postcentral | −27 | 38 | 39 | 218 |
Left | S_front_inf | 41 | −26 | 24 | 210 |
Left | G_and_S_cingul-Ant | 13 | −27 | 20 | 158 |
Right | S_oc-temp_lat | −39 | 52 | −13 | 149 |
Right | S_central | −34 | 24 | 47 | 627 |
Discussion
To summarize, we used model-based fMRI to reconstruct the locations of memory-guided saccade goals based on the pattern of activity in visual cortex during a memory delay. We could reconstruct the saccade goal even when we dissociated the location of the visual target from the saccade goal using a memory-guided antisaccade procedure. During these trials the spatiotemporal population dynamics were informative, as they indicated an evolution from the past remembered visual stimulus to the future prospective goal. Early in the trial the visual target location was primarily encoded in the population reconstruction. As the trial evolved in time, the population began to primarily encode the location of the saccade goal in the opposite hemifield. Therefore, the representation of the antisaccade goal cannot be the result of bottom-up visual stimulation, and must be caused by top-down signals presumably originating from frontal and/or parietal cortex. In support of this hypothesis, we found that trial-by-trial fluctuations in delay period activity in frontal and parietal cortex predicts the precision with which our model reconstructed the maintained saccade goal based on the pattern of activity in visual cortex. Therefore, the population dynamics in visual cortex encode WM representations and these representations can be sculpted by top-down signals from frontal and parietal cortex (Curtis & D’Esposito, 2003; Sreenivasan, Curtis, & D’Esposito, 2014).
Based on probabilistic population coding theory, we believe that neural populations do not encode sensory features – in our task the location of the saccade goal – but instead encode a probability distribution over sensory features (Ma, Beck, Latham, & Pouget, 2006; Zemel, Dayan, & Pouget, 1998). In this framework, the population characteristics, such as how the encoding of feature space is distributed across the topographic population and how a single neuron is tuned to variation over the feature space, determine the population dynamics. Using similar assumptions about the dynamics of populations of neurons, we built our IEM to decode spatial information encoded in the dynamics of populations of voxels in early visual cortex during WM. Using this model, we could successfully reconstruct a spatial representation of the saccade goal from the activity of neural ensembles in early visual cortex. The success of the model provides a strong linking hypothesis that connects decoding to population-level mechanisms. Specifically, we assumed an underlying neural architecture based on the retinotopic organization of the voxels within visual maps, and modeled each voxel’s response with a set of basis functions that tiled visual space. The success of spatial WM decoding depended on the fact that the precise structure of our model’s basis set matched the representation of the feature space distributed across the neural population. Computational models of microcircuits describe how WM representations are maintained by the balance between recurrent local excitation among similarly tuned neurons and long-range inhibition (Lundqvist, Compte, & Lansner, 2010; M. Wang et al., 2013; X. J. Wang, 1999). Two key predictions from these models are relevant. First, WM representations are encoded in the collective response of populations of neurons whose tuning varies along the stimulus dimension. Second, persistent activity in the subset of neurons tuned to the remembered feature maintains the representation over time. In human visual cortex, evidence exists that support both of these predictions. The current study and other human neuroimaging studies have been successful at reconstructing representations of the population response encoding WM features (Ester, Anderson, Serences, & Awh, 2013; Ester et al., 2015; Sprague et al., 2014, 2016; Sprague & Serences, 2013; van Bergen, Ma, Pratte, & Jehee, 2015). Moreover, neural activity persists in early visual cortex during WM delays in humans, measured with BOLD imaging (Geng, Ruff, & Driver, 2009; Saber, Pestilli, & Curtis, 2015), and monkeys, measured with electrophysiology (Super, Spekreijse, & Lamme, 2001; van Kerkoerle, Self, & Roelfsema, 2017). Although the computational models were originally devised to model microcircuits in prefrontal cortex, these results together open the possibility that similar mechanisms may support WM in early visual cortex.
During the memory-guided antisaccade trials, when we dissociated the location of the visual target from the saccade goal, the spatiotemporal population dynamics first encoded the visual target then encoded the saccade goal. Therefore, the representation of the antisaccade goal in early visual cortex must be the result of top-down signals, and not some lingering visual response. Moreover, we recently demonstrated that fMRI activity persists in the specific parts of retinotopic maps that represent the location of memory guided saccade goals (Saber et al., 2015). Whether these persistent BOLD responses in visual cortex reflect increases in spiking or only affect subthreshold synaptic activity remains unclear, but evidence for both exist (Steinmetz & Moore, 2014; Super et al., 2001; van Kerkoerle et al., 2017; Zaksas & Pasternak, 2006). For example, despite the absence of persistent spiking of neurons in monkey areas V4 and MT, feedback signals from FEF increases the response gain of neurons, but only in the presence of visual stimulation (Merrikhi et al., 2017). Other studies confirm that bottom-up signals in visual cortex gate the effects of top-down feedback signals originating from the monkey FEF (Ekstrom, Roelfsema, Arsenault, Bonmassar, & Vanduffel, 2008; Moore & Armstrong, 2003; Moore & Fallah, 2004). In our study of human visual cortex using fMRI, we find that top-down signals alone are sufficient to induce a spatially specific pattern of topographic activity in visual cortex, and gating by bottom-up signals is unnecessary. Overall, our results are consistent with the sensory-motor recruitment model of WM that posits that the same mechanisms that evolved to create internal representations of sensory events are also used to maintain WM representations (Curtis & D’Esposito, 2003; Pasternak & Greenlee, 2005; Postle, 2006; Theeuwes, Olivers, & Chizk, 2005) and add to a growing list of studies demonstrating that WM representations are encoded in the patterns of activity in sensory cortices (Albers et al., 2013; Christophel, Hebart, & Haynes, 2012; Ester et al., 2013; Ester et al., 2009; Harrison & Tong, 2009; Lee, Kravitz, & Baker, 2013; Riggall & Postle, 2012; Sprague et al., 2014; Sprague & Serences, 2013)
Finally, we tested the hypothesis that the source of the top-down signals that prioritize the visual and saccade goal locations originate in the frontal and/or parietal cortex. The trial-by-trial fluctuations in delay period activity in both frontal and parietal cortex predicted the precision with which our model of visual cortex reconstructed the maintained saccade goal. Notably, delay period activity near the junction of the sPCS and the superior frontal sulcus, the location of the putative human homologue of monkey area FEF, predicted the precision of encoding of the WM representation in visual cortex. Feedback signals during WM originating in FEF cause spatially selective increases in the response gain of neurons, and also expand and shift the receptive fields of V4 and MT neurons towards the memorized location (Merrikhi et al., 2017). Previous studies established the importance of the human sPCS for spatial WM. During spatial WM delays, neural activity in the human sPCS persists (Curtis, Rao, & D’Esposito, 2004; Jerde, Merriam, Riggall, Hedges, & Curtis, 2012; Schluppeck, Curtis, Glimcher, & Heeger, 2006; Srimal & Curtis, 2008; Tark & Curtis, 2009). Moreover, transcranial magnetic stimulation (Mackey & Curtis, 2017) and cortical resections (Mackey, Devinsky, Doyle, Meager, et al., 2016) to sPCS impact the precision of memory-guided saccades. Given that the sPCS is necessary for spatial WM, activity persists during WM maintenance, and it is topographically organized (Kastner et al., 2007; Mackey, Winawer, & Curtis, 2017), we propose that it maintains representations of spatial priority (Jerde et al., 2012; Sprague & Serences, 2013). Furthermore, priority maps may be the basis for top-down signals that sculpt the population dynamics in visual cortex in favor of neurons with receptive fields that match the locus of priority, providing a mechanism by which association cortex and sensory cortex interact to support WM.
Acknowledgements:
This study was supported by R01-EY016407 to CEC. We thank the NYU Center for Brain Imaging’s Pablo Velasco, Keith Sanzenbach and Valerio Luccio for help with data collection, John Serences and Thomas Sprague for help with the inverted encoding model, and Kartik Sreenivasan for helpful comments on the manuscript.
Footnotes
Conflict of interest: The authors declare no competing financial interests.
References
- Albers AM, Kok P, Toni I, Dijkerman HC, & de Lange FP (2013). Shared representations for working memory and mental imagery in early visual cortex. Current biology : CB, 23(15), 1427–1431. [DOI] [PubMed] [Google Scholar]
- Baddeley AD, & Hitch G (1974). Working memory. Psychology of learning and motivation, 8, 47–89. [Google Scholar]
- Brouwer GJ, & Heeger DJ (2009). Decoding and reconstructing color from responses in human visual cortex. The Journal of neuroscience : the official journal of the Society for Neuroscience, 29(44), 13992–14003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christophel TB, Hebart MN, & Haynes JD (2012). Decoding the contents of visual short-term memory from human visual and parietal cortex. The Journal of neuroscience : the official journal of the Society for Neuroscience, 32(38), 12983–12989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbetta M, & Shulman GL (2002). Control of goal-directed and stimulus-driven attention in the brain. Nature reviews. Neuroscience, 3(3), 201–215. [DOI] [PubMed] [Google Scholar]
- Curtis CE, & D’Esposito M (2003). Persistent activity in the prefrontal cortex during working memory. Trends Cogn Sci, 7(9), 415–423. [DOI] [PubMed] [Google Scholar]
- Curtis CE, Rao VY, & D’Esposito M (2004). Maintenance of spatial and motor codes during oculomotor delayed response tasks. J Neurosci, 24(16), 3944–3952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- D’Esposito M, & Postle BR (2015). The cognitive neuroscience of working memory. Annual review of psychology, 66, 115–142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dumoulin SO, & Wandell BA (2008). Population receptive field estimates in human visual cortex. NeuroImage, 39(2), 647–660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ekstrom LB, Roelfsema PR, Arsenault JT, Bonmassar G, & Vanduffel W (2008). Bottom-up dependent gating of frontal signals in early visual cortex. Science, 321(5887), 414–417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ester EF, Anderson DE, Serences JT, & Awh E (2013). A neural measure of precision in visual working memory. Journal of cognitive neuroscience, 25(5), 754–761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ester EF, Serences JT, & Awh E (2009). Spatially global representations in human primary visual cortex during working memory maintenance. The Journal of neuroscience : the official journal of the Society for Neuroscience, 29(48), 15258–15265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ester EF, Sprague TC, & Serences JT (2015). Parietal and Frontal Cortex Encode Stimulus-Specific Mnemonic Representations during Visual Working Memory. Neuron, 87(4), 893–905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freeman J, Brouwer GJ, Heeger DJ, & Merriam EP (2011). Orientation decoding depends on maps, not columns. The Journal of neuroscience : the official journal of the Society for Neuroscience, 31(13), 4792–4804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Funahashi S, Bruce CJ, & Goldman-Rakic PS (1989). Mnemonic coding of visual space in the monkey’s dorsolateral prefrontal cortex. Journal of neurophysiology, 61(2), 331–349. [DOI] [PubMed] [Google Scholar]
- Geng JJ, Ruff CC, & Driver J (2009). Saccades to a remembered location elicit spatially specific activation in human retinotopic visual cortex. Journal of cognitive neuroscience, 21(2), 230–245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldman-Rakic PS (1994). Working memory dysfunction in schizophrenia. The Journal of neuropsychiatry and clinical neurosciences, 6(4), 348–357. [DOI] [PubMed] [Google Scholar]
- Harrison SA, & Tong F (2009). Decoding reveals the contents of visual working memory in early visual areas. Nature, 458(7238), 632–635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haxby JV, Gobbini MI, Furey ML, Ishai A, Schouten JL, & Pietrini P (2001). Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science, 293(5539), 2425–2430. [DOI] [PubMed] [Google Scholar]
- Haynes JD, & Rees G (2006). Decoding mental states from brain activity in humans. Nature reviews. Neuroscience, 7(7), 523–534. [DOI] [PubMed] [Google Scholar]
- Jerde TA, Merriam EP, Riggall AC, Hedges JH, & Curtis CE (2012). Prioritized maps of space in human frontoparietal cortex. The Journal of neuroscience : the official journal of the Society for Neuroscience, 32(48), 17382–17390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kamitani Y, & Tong F (2005). Decoding the visual and subjective contents of the human brain. Nature neuroscience, 8(5), 679–685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kastner S, DeSimone K, Konen CS, Szczepanski SM, Weiner KS, & Schneider KA (2007). Topographic maps in human frontal cortex revealed in memory-guided saccade and spatial working-memory tasks. Journal of neurophysiology, 97(5), 3494–3507. [DOI] [PubMed] [Google Scholar]
- Larsson J, & Heeger DJ (2006). Two retinotopic visual areas in human lateral occipital cortex. The Journal of neuroscience : the official journal of the Society for Neuroscience, 26(51), 13128–13142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee SH, Kravitz DJ, & Baker CI (2013). Goal-dependent dissociation of visual and prefrontal cortices during working memory. Nature neuroscience, 16(8), 997–999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lundqvist M, Compte A, & Lansner A (2010). Bistable, irregular firing and population oscillations in a modular attractor memory network. PLoS Comput Biol, 6(6), e1000803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ma WJ, Beck JM, Latham PE, & Pouget A (2006). Bayesian inference with probabilistic population codes. Nature neuroscience, 9(11), 1432–1438. [DOI] [PubMed] [Google Scholar]
- Mackey WE, & Curtis CE (2017). Distinct contributions by frontal and parietal cortices support working memory. Sci Rep, 7(1), 6188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mackey WE, Devinsky O, Doyle WK, Golfinos JG, & Curtis CE (2016). Human parietal cortex lesions impact the precision of spatial working memory. Journal of neurophysiology, 116(3), 1049–1054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mackey WE, Devinsky O, Doyle WK, Meager MR, & Curtis CE (2016). Human Dorsolateral Prefrontal Cortex Is Not Necessary for Spatial Working Memory. The Journal of neuroscience : the official journal of the Society for Neuroscience, 36(10), 2847–2856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mackey WE, Winawer J, & Curtis CE (2017). Visual field map clusters in human frontoparietal cortex. Elife, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merrikhi Y, Clark K, Albarran E, Parsa M, Zirnsak M, Moore T, et al. (2017). Spatial working memory alters the efficacy of input to visual cortex. Nature communications, 8, 15041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore T, & Armstrong KM (2003). Selective gating of visual signals by microstimulation of frontal cortex. Nature, 421(6921), 370–373. [DOI] [PubMed] [Google Scholar]
- Moore T, & Fallah M (2004). Microstimulation of the frontal eye field and its effects on covert spatial attention. Journal of neurophysiology, 91(1), 152–162. [DOI] [PubMed] [Google Scholar]
- Naselaris T, Kay KN, Nishimoto S, & Gallant JL (2011). Encoding and decoding in fMRI. NeuroImage, 56(2), 400–410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Norman KA, Polyn SM, Detre GJ, & Haxby JV (2006). Beyond mind-reading: multi-voxel pattern analysis of fMRI data. Trends in cognitive sciences, 10(9), 424–430. [DOI] [PubMed] [Google Scholar]
- Pasternak T, & Greenlee MW (2005). Working memory in primate sensory systems. Nature reviews. Neuroscience, 6(2), 97–107. [DOI] [PubMed] [Google Scholar]
- Postle BR (2006). Working memory as an emergent property of the mind and brain. Neuroscience, 139(1), 23–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riggall AC, & Postle BR (2012). The relationship between working memory storage and elevated activity as measured with functional magnetic resonance imaging. The Journal of neuroscience : the official journal of the Society for Neuroscience, 32(38), 12990–12998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saber GT, Pestilli F, & Curtis CE (2015). Saccade planning evokes topographically specific activity in the dorsal and ventral streams. The Journal of neuroscience : the official journal of the Society for Neuroscience, 35(1), 245–252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schluppeck D, Curtis CE, Glimcher PW, & Heeger DJ (2006). Sustained activity in topographic areas of human posterior parietal cortex during memory-guided saccades. The Journal of neuroscience : the official journal of the Society for Neuroscience, 26(19), 5098–5108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Serences JT, Ester EF, Vogel EK, & Awh E (2009). Stimulus-specific delay activity in human primary visual cortex. Psychological science, 20(2), 207–214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Serences JT, & Saproo S (2012). Computational advances towards linking BOLD and behavior. Neuropsychologia, 50(4), 435–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sprague TC, Ester EF, & Serences JT (2014). Reconstructions of information in visual spatial working memory degrade with memory load. Curr Biol, 24(18), 2174–2180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sprague TC, Ester EF, & Serences JT (2016). Restoring Latent Visual Working Memory Representations in Human Cortex. Neuron, 91(3), 694–707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sprague TC, & Serences JT (2013). Attention modulates spatial priority maps in the human occipital, parietal and frontal cortices. Nat Neurosci, 16(12), 1879–1887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sreenivasan KK, Curtis CE, & D’Esposito M (2014). Revisiting the role of persistent neural activity during working memory. Trends in cognitive sciences, 18(2), 82–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Srimal R, & Curtis CE (2008). Persistent neural activity during the maintenance of spatial position in working memory. NeuroImage, 39(1), 455–468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steinmetz NA, & Moore T (2014). Eye movement preparation modulates neuronal responses in area V4 when dissociated from attentional demands. Neuron, 83(2), 496–506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Super H, Spekreijse H, & Lamme VA (2001). A neural correlate of working memory in the monkey primary visual cortex. Science, 293(5527), 120–124. [DOI] [PubMed] [Google Scholar]
- Tark KJ, & Curtis CE (2009). Persistent neural activity in the human frontal cortex when maintaining space that is off the map. Nature neuroscience, 12(11), 1463–1468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Theeuwes J, Olivers CN, & Chizk CL (2005). Remembering a location makes the eyes curve away. Psychological science, 16(3), 196–199. [DOI] [PubMed] [Google Scholar]
- van Bergen RS, Ma WJ, Pratte MS, & Jehee JF (2015). Sensory uncertainty decoded from visual cortex predicts behavior. Nature neuroscience, 18(12), 1728–1730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Kerkoerle T, Self MW, & Roelfsema PR (2017). Layer-specificity in the effects of attention and working memory on activity in primary visual cortex. Nature communications, 8, 13804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wandell BA, Dumoulin SO, & Brewer AA (2007). Visual field maps in human cortex. Neuron, 56(2), 366–383. [DOI] [PubMed] [Google Scholar]
- Wang M, Yang Y, Wang CJ, Gamo NJ, Jin LE, Mazer JA, et al. (2013). NMDA receptors subserve persistent neuronal firing during working memory in dorsolateral prefrontal cortex. Neuron, 77(4), 736–749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang XJ (1999). Synaptic basis of cortical persistent activity: the importance of NMDA receptors to working memory. The Journal of neuroscience : the official journal of the Society for Neuroscience, 19(21), 9587–9603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zaksas D, & Pasternak T (2006). Directional signals in the prefrontal cortex and in area MT during a working memory for visual motion task. The Journal of neuroscience : the official journal of the Society for Neuroscience, 26(45), 11726–11742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zemel RS, Dayan P, & Pouget A (1998). Probabilistic interpretation of population codes. Neural computation, 10(2), 403–430. [DOI] [PubMed] [Google Scholar]