SlideShare a Scribd company logo
Interpretation of
Neural Networks is
Fragile
1
• Interpretation of Neural Networks is Fragile
• Amirata Ghorbani, Abubakar Abid, James Zou
• Stanford University
• AAAI 2019
• arXiv:1710.10547
•
• DNN Adversarial Attack
•
•
• Adversarial Attack
• Attack Attack
2
•


•


• 

•


•
3
Adversarial Attacks
•


• panda → gibbon ( )
x
4
Adversarial Attacks for Against Feature Importance
• Attack
Attack
•
5
• DNN
δ
•
L ∇x
xt
6
• Feature importance
•
• 3
• Simple gradient method



• Integrated gradient DeepLIFT …
• 

• Sample Importance
•
• [Koh&Liang2017]
I S ∇x
7
Sl(xt) xt ∈ ℝd
l
zi zt
•
•


•
• clip
•
xt I(xt; 𝒩) xt + δ
I(xt + δ; 𝒩) D( ⋅ ) δ
8
• Feature Importance
• Top-k
•
• Targeted
•
• Mass-center
•
• Sample Importance
• N
9
xt D
xp
(p ∈ [1,N]) D(xt, xp
)
•
• Feature Importance Attack
• Attack
• Sample Importance Attack
• Attack
•
• Feature Importance
• ImageNet SqueezeNet
• CIFAR-10 (cf. Appendix A.)
•
•
• Sample Importance
• InceptionNet v3 ImageNet
• rose or sunflower
• 1000
• Validation acc. 97.5 %
•
•
•
•
• FI
• SI
• TopK
• FI 1000
• SI Top
• ※ Appendix E F Center Attack CenterShift
P = 100, α = 0.5
±ϵ
10
Feature Importance Targeted Attack
• 3 Targeted Attack
•
•
11
Feature importance Attack
• ImageNet 512
• Top 1000
•
• l∞
12
Sample importance Attack
• (a)
• (b) clip size
• (c) 2
ϵ l∞
ϵ
13
• 1
•
•
•
•
• g
• i feature
•
• feature
S(x; w)
∇xS(x + δ) − ∇xS(x) ≃ Hδ, x ∈ ℝd
, Hi,j =
∂S
∂xi∂xj
S(w ⋅ x)
∇xS = w x
S = g(w ⋅ x)
x → x + δ
δ w l1
14
• Attack
•
• Fig6 2 NN
•
•
15
•


• 

•
• 

•
16

More Related Content

PDF
Deep recurrent generative decoder for abstractive text summarization
PDF
Selective encoding for abstractive sentence summarization
PDF
[論文紹介] Understanding and improving transformer from a multi particle dynamic ...
PDF
[論文紹介] Towards Understanding Linear Word Analogies
PDF
Understanding the origin of bias in word embeddings
PPTX
[NeurIPS2018読み会@PFN] On the Dimensionality of Word Embedding
PPTX
[研究室論文紹介用スライド] Adversarial Contrastive Estimation
PPTX
Probabilistic fasttext for multi sense word embeddings
Deep recurrent generative decoder for abstractive text summarization
Selective encoding for abstractive sentence summarization
[論文紹介] Understanding and improving transformer from a multi particle dynamic ...
[論文紹介] Towards Understanding Linear Word Analogies
Understanding the origin of bias in word embeddings
[NeurIPS2018読み会@PFN] On the Dimensionality of Word Embedding
[研究室論文紹介用スライド] Adversarial Contrastive Estimation
Probabilistic fasttext for multi sense word embeddings

Recently uploaded (20)

PPTX
BIOMOLECULES PPT........................
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PDF
. Radiology Case Scenariosssssssssssssss
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
2. Earth - The Living Planet earth and life
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Sciences of Europe No 170 (2025)
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
famous lake in india and its disturibution and importance
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Microbiology with diagram medical studies .pptx
BIOMOLECULES PPT........................
7. General Toxicologyfor clinical phrmacy.pptx
POSITIONING IN OPERATION THEATRE ROOM.ppt
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
2Systematics of Living Organisms t-.pptx
INTRODUCTION TO EVS | Concept of sustainability
AlphaEarth Foundations and the Satellite Embedding dataset
. Radiology Case Scenariosssssssssssssss
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
2. Earth - The Living Planet earth and life
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Sciences of Europe No 170 (2025)
TOTAL hIP ARTHROPLASTY Presentation.pptx
famous lake in india and its disturibution and importance
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Derivatives of integument scales, beaks, horns,.pptx
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Microbiology with diagram medical studies .pptx
Ad
Ad

Lpixel論文読み会資料 "Interpretation of neural network is fragile"