Model metamers illuminate divergences between biological and artificial neural networks
Jenelle Feather, Guillaume Leclerc, Aleksander Ma ̨dry, Josh H McDermott, Massachusetts Institute of Technology, United States
Contributed Talks 3 Lecture
Grand Ballroom A-C
Sun, 28 Aug, 10:40 - 11:00 Pacific Time (UTC -7)
Deep neural network models of sensory systems are often proposed to learn representational transformations with invariances like those in the brain. To reveal these invariances we generated “model metamers” – stimuli whose activations within a model stage are matched to those of a natural stimulus. Metamers for state-of-the- art supervised and unsupervised neural network models of vision and audition were often completely unrecognizable to humans when generated from deep model stages, suggesting differences between model and human invariances. Targeted model changes, such as “adversarial training”, improved human-recognizability of model metamers, but did not eliminate the overall human-model discrepancy. However, metamer recognition dissociated from adversarial vulnerability, revealing a distinct failure mode of existing sensory models and providing a complementary benchmark for model assessment. The human-recognizability of a model’s metamers was well predicted by their recognizability by other models, suggesting that models learn idiosyncratic invariances in addition to those required by the task.