1,987 publications from this institution
Biological systems leverage top-down feedback for visual processing, yet most artificial vision models succeed in image classification using purely feedforward or recurrent architectures, calling into question the functional significance of descending cortical pathways. Here, we trained convolutional recurrent neural networks (ConvRNN) on image classification in the presence or absence of top-down feedback projections to elucidate the specific computational contributions of those feedback pathways. We found that ConvRNNs with top-down feedback exhibited remarkable speed-accuracy trade-off and robustness to noise perturbations and adversarial attacks, but only when they were trained with stochastic neural variability, simulated by randomly silencing single units via dropout. By performing detailed analyses to identify the reasons for such benefits, we observed that feedback information substantially shaped the representational geometry of the post-integration layer, combining the bottom-up and top-down streams, and this effect was amplified by dropout. Moreover, feedback signals coupled with dropout optimally constrained network activity onto a low-dimensional manifold and encoded object information more efficiently in out-of-distribution regimes, with top-down information stabilizing the representational dynamics at the population level. Together, these findings uncover a dual mechanism for resilient sensory coding. On the one hand, neural stochasticity prevents unit-level co-adaptation albeit at the cost of more chaotic dynamics. On the other hand, top-down feedback harnesses high-level information to stabilize network activity on compact low-dimensional manifolds.
This dataset includes skin conductance response (SCR) measurements, CS and US information, keypress responses and keypress response times for each of 32 healthy unmedicated participants (16 males and 16 females aged 22.4+/-4.5 years) participating in a classical (Pavlovian) discriminant delay fear conditioning task. CS is a visual stimulus with variation in position on screen and color. Us is a 1s long white noise burst at 95dB presented over headphones. SOA between the CS and US is varied between participants to be 4, 10, or 16 s. The ITI is selected randomly on each trial from 14, 19, or 23 s