Mechanistic Interpretability
Select a category:
-
The shape and simplicity biases of adversarially robust ImageNet-trained CNNs
Peijie Chen*, Chirag Agarwal*, Anh Nguyen*
Links: pdf | code | project page
-
Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space
Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski
Links: pdf | code | project page
-
Synthesizing the preferred inputs for neurons in neural networks via deep generator networks
Anh Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, Jeff Clune
Links: pdf | code | project page
-
Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks
Anh Nguyen, Jason Yosinski, Jeff Clune
Links: pdf | code | project page
-
Deep Neural Network are Easily Fooled: High Confidence Predictions for Unrecognizable Images
Anh Nguyen, Jason Yosinski, Jeff Clune
Links: pdf | code | project page
-
Understanding Neural Networks Through Deep Visualization
Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, and Hod Lipson
Links: pdf | code | project page