Mechanistic Interpretability
Select a category:
-
The shape and simplicity biases of adversarially robust ImageNet-trained CNNs
Peijie Chen*, Chirag Agarwal*, Anh Nguyen*
WHI 2020
Links: pdf | code | project page
-
Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space
Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski
CVPR 2017
Links: pdf | code | project page
-
Synthesizing the preferred inputs for neurons in neural networks via deep generator networks
Anh Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, Jeff Clune
NeurIPS 2016
Links: pdf | code | project page
-
Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks
Anh Nguyen, Jason Yosinski, Jeff Clune
Visualization workshop ICML 2016
Links: pdf | code | project page
-
Deep Neural Network are Easily Fooled: High Confidence Predictions for Unrecognizable Images
Anh Nguyen, Jason Yosinski, Jeff Clune
CVPR 2015
Links: pdf | code | project page
-
Understanding Neural Networks Through Deep Visualization
Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, and Hod Lipson
ICML DL workshop 2015
Links: pdf | code | project page