[DL輪読会]Understanding Measures of Uncertainty for Adversarial Example Detection

DEEP LEARNING JP
[DL Papers]
Understanding Measures of Uncertainty for
Adversarial Example Detection
Makoto Kawano, Keio Univ.
http://deeplearning.jp/

● Understanding measures of Uncertainty for
Adversarial Example Detection
● Lewis Smith, Yarin Gal
• Department of Engineering Science, University of Oxford
● 2018 3 22
● twitter
• Adversarial examples /
• Gal
•

●Gal MC
•
•
•
• MC
• MNIST Kaggle
•

●Adversarial examples
• Basic Iterative Method
• Fast Gradient Method
• Momentum Iterative Method
●Measures of uncertainty
• /
• Softmax Variance
●Bayesian Neural Networks/MC Dropout
• Bayesian Neural Network
●Experiments
•
•
•

Adversarial Examples /
●Szegedy(2013)
● DNN
●

● [Kurakin, 2016]
● [Sharif, 2016]

[Szegedy et al., 2013]
●
● A B
●
ckx − ˜xk2
2 + Loss(˜x, y)
˜x = x + ⌘

Fast Gradient Method [Goodfellow et al., 2014]
●NN
●
●
● 1
˜x = x + ✏sign(rxLoss(x, y))
wT
˜x = wT
x + wT
⌘
˜x = x + ⌘ k⌘k1 < ✏

Basic Iterative Method [Kurakin et al., 2016]
●FGM
●
• JPEG
•
˜x0 = x, ˜xN+1 = Clipx,✏{˜xN + ↵sign(rxJ(˜xN , ytrue))}

[DL輪読会]Understanding Measures of Uncertainty for Adversarial Example Detection

Momentum Iterative Method [Dong et al., 2017]
●FGM/BIM
●NIPS
●Carlini
•

● aleatoric uncertainty
• ≒
•
• p( )=p( )=0.5
● epistemic uncertainty
•
•
•

●
•
• MNIST 1 7 →
●
•
•
•
•
I(w, y|D, x) = H[p(y|x, D)] − Ep(w|D)H[p(y|x, w)]

=
X
j
1
T
X
i
pij(pij − 1)
!
− ˆpj(ˆpj − 1) + . . .=
1
C
0
@
CX
j=1
1
T
TX
i=1
p2
ij
!
− ˆp2
j
1
A
ˆσ2
=
1
C
CX
j=1
1
T
TX
i=1
(pij − ˆpi)2
ˆI = H(ˆp) −
1
T
X
i
H(pi)
=
X
j
1
T
X
i
pij log pij
!
− ˆpj log ˆpj
=
X
j
1
T
X
i
p2
ij
!
− ˆp2
j −
1
T
X
i
pij
!
+ ˆpj + . . .
=
CX
j
1
T
TX
i
p2
ij
!
− ˆp2
j + . . .
Softmax

●
•
• L2
ˆw = arg min
w
X
i
E(f(xi; w), y) + λ
X
l
kWlk2
●
• w p(w)
p(w) w
p(y|x, D) =
Z
p(y|x, w)p(w|D)dw

● q p
•
●L
●q
Wl = Ml · diag([zl,j]Kl
j=1)
where zl,j ⇠ Bernoulli(pl), l = 1..L, j = 1..Kl−1
Ki ⇥ Ki−1
✓ = {Ml, pl|l = [1..L]}
※ pl
LV I :=
Z
q✓(w) log p(D|w)dw − DKL(q✓kp(w))

●MNIST
●Kaggle ASSIRA
•
•
●ResNet50 Dropout FC
● ROC
• NN PE/dropout PE/dropout MI
● BIM/FGM/MIM

●
●
• False-Positive AUC

● ( )
•
•
•
• ( Dropout )

● NN
•
•
●
• NN
•
●
•

[DL輪読会]Understanding Measures of Uncertainty for Adversarial Example Detection

More Related Content

What's hot (20)

Similar to [DL輪読会]Understanding Measures of Uncertainty for Adversarial Example Detection (20)

More from Deep Learning JP (20)

Recently uploaded (20)

[DL輪読会]Understanding Measures of Uncertainty for Adversarial Example Detection