AFRT Neural Encoding of Visual Perception using Affine Transformations

Keywords
Loading...
Thumbnail Image
Issue Date
2023-01-01
Language
en
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
State of the art neural encoding models do not explicitly model neuron’s receptive fields, which is inefficient and increases the likelihood of overfitting. Even though this can be fixed using heavy regularisation, a more elegant solution is to explicitly learn the receptive fields of the biological neurons the model is trying to model. Enter AFRT, which takes a standard neural encoding architecture and adds an affine layer at the start. This layer spatially transforms the input by cropping and shifting the input image to fit a certain feature of the image that the neuron is sensitive to. Next, the feature model, AlexNet, extracts features from the selected part of the image, which are then converted into brain response space using a single linear layer. This simple architecture is used to investigate the learned receptive fields of neurons in various layers of AlexNet. Besides matching performance of typical neural encoding models, model parameters are far reduced and more interpretable. Finally, AFRT is used to reinforce the idea that feature complexity of neuron’s receptive fields increases down the visual pathway, and that this idea is mimicked in Deep Neural Networks.
Description
Citation
Faculty
Faculteit der Sociale Wetenschappen