Sources of Bias and Explanation

sources of
bias and explanation
Alan Dix
Computational Foundry
Swansea
http://alandix.com/academic/talks/PIT-2019-bias-and-explanation/

Tiree
Tiree Tech Wave
3-7 October
Computational Foundry
Swansea University

the foundry
building
mission
community

types of algorithms …
rules and regulations
ordinary code
classic AI
machine learning and neural nets
increasing
opacity

when things go wrong – deliberate
misuse
hacking
bad use
cyberwarfare – Stuxnet, etc.
autonomous weapons

when things go wrong – well meaning
accidents
autonomous car crashes
unintended consequences
bias (gender, ethnicity)
disproportionate social effects
https://www.bbc.co.uk/sounds/play/m00017s4 (report @ 1:41:00 in)

warns of the danger of gender and ethnic bias in
black-box machine learning systems
gives example: database queries using ID3
offers (partial) solution: Query-by-Browsing
and even some broader heuristics
inter alia …

Query-by-Browsing
creating scructable
internal representations

Query by Browsing
user chooses records of interest
 tick for those wanted
 cross for those not wanted
system infers query
web version uses rule induction
variant of Quinlan’s ID3
www.meandeviation.com/qbb

Query by Browsing
what it looks like
user asks
system to
make a query
system infers
SQL query
query results
highlighted

Query by Browsing
dual representation
query (intensional)
for precision
listing (extensional)
for understanding

Query by Browsing – how it works
examples
machine
learning
SQL query
cond
cond
decision
tree

it is not just about
being accurate
not just right
but also upright

learning
past bias in
training data
training
data
learnt
rules
objective
function
societal
bias in goals
‘best’ may
be biased

mimicking human behaviour and choices

pandering to human bias
(effective outcomes?)
• dating sites using ethnicity (CHI 2018!)
• young pretty waitresses sell more drinks
• Trump (reportedly) hiding black employees at
casino when certain rich customers arrived
• BBC (& others) paying male presenters more
because they are more popular

‘good’ business
but is it good?

reinforcing societal/cultural norms
at school
boys more likely to study STEM subjects
girls more likely to study humanities
so, on average, with no other information
gender is an (albeit poor) predictor
of communication skills
and engineering knowledge

as a society we choose
to use other (and better)
predictors

innate (but largely irrelevant) differences
men are (on average) larger and stronger
so gender is a Bayesian predictor of strength
this may explain gender differences in some jobs
but …
it does NOT justify employment discrimination

bias is not about
algorithmic correctness
it is about social choice

the choice of input features
often critical in
creating or controlling bias
more data not always better!

Note:
human reasoning is
poor at ignoring low quality cues
even when we have better ones

however …
not sufficient to remove explicit indicators:
gender/ethnicity/disability/religion
potential correlating factors e.g. clothing
algorithms need to actively avoid discrimination

and how do we know our
algorithms are OK?

Not just bias
safety – e.g. autonomous cars
democracy – e.g. social media, fake news
health and well being – e.g. soft-drink adverts
social issues – e.g. credit ratings

we need to ask
Why?
algorithmic transparency
c.f. court judgment

an AIX Kitbag
AI explainability
how to make sense of
black-box machine-learning algorithms

crucial insight …
human–human explanations
rarely utterly precise or reproducible
but are
sufficient to inspire confidence and trust

white-box black-box
grey-box
creating scructable
analysing and
understanding
from the outside
peeking within
understanding

but … this was all evident
25 years ago
why didn’t I do more?
if it is important
not sufficient to publish
you need to transform into
publicity and policy

white-box methods
creating scructable

WB0. choose a white box classifier!
training set
scrutable
rules
white-box
algorithm
unseen data white-box classifier outputs

WB1. black-box generation of white box
classifier
training set
scrutable
rules
black-box
algorithm
unseen data white-box classifier outputs

WB2. Adversarial examples for white-box
learning
case-base of
behaviour scrutable
rules
black-box
adversarial learning
white-box
learning

WB3. Simplification of rule set
scrutable
rules
black-box
learning
training
set
inscrutable
rules tweak

black-box methods
analysing and understanding
from the outside

BB1. exploration analysis for human
visualisation
black-box
learning
training
set
inscrutable
rules
lots of
examples
black-box
classifier
visualise
input-output

BB2. perturbation/exploration analysis for
key feature detection
black-box
learning
inscrutable
rules
randomly vary
feature values
black-box
classifier
hotspot
visualisation

BB3. perturbation analysis for central and
boundary cases
lots of
examples
black-box
classifier
central and
boundary
cases
user
visualisation
white-box
learning

BB3. close up
central cases
perturbations
do not change class
boundary cases
small perturbations
change class
penumbra
larger perturbations
change class

BB4. black-box oracle – white-box learning
input
examples
black-box
classifier
scrutable
rules
white-box
learning
input–output
pairs as
training set
output
classes

grey-box methods
peeking within

GB0a. sensitivity analysis – weights
perturb parameters in
the inscrutable rules
lots of
examples
black-box
classifier
hotspot analysis
on parameters

GB0b. sensitivity analysis – activation
input
example
black-box classifier
(low level)
extract
intermediate
activation
(high level)
perturb
activations
hotspot analysis
of nodes

GB0c. sensitivity analysis – algorithmic
apply black-box
algorithm
inverse
algorithm

GB1. high level model generation
input
examples
black-box
classifier
extract
intermediate
activation
scrutable
rules
white-box
learning
activations with
output class
as training set
output
classes

GB2. Clustering and comprehension of
low level
input
examples
black-box
classifier
extract
intermediate
activation
clusters
various
algorithms
activations
as input
MDS
SOM

GB3. triad distinctions
input
examples
(low level)
A
B
C
hotspot analysis
of nodes
compare

GB4. apply generatively
output to input
activation to input
output to activation
between layers

Sources of Bias and Explanation

More Related Content

What's hot (20)

Similar to Sources of Bias and Explanation (20)

More from Alan Dix (20)

Recently uploaded (20)

Sources of Bias and Explanation