SlideShare a Scribd company logo
The dark side of
deep learning
Milan, 11 November 2017
1
Just about any DL presentation...
2
The deep learning craze
[Business Insider, 2017]
IBM speech recognition is on the verge of super-human accuracy
[Time, 2017]
Are Computers Already Smarter Than Humans?
[LiveScience, 2016]
Artificial Intelligence Beats 'Most Complex Game Devised by
Humans'
[RECODE, 2017]
Intel is paying more than $400 million to buy deep-learning
startup Nervana Systems
3
4
5
I know there's a proverb which that says 'To err is human,' but a
human error is nothing to what a computer can do if it tries.
--- Agatha Christie
People worry that computers will get too smart and take over the
world, but the real problem is that they're too stupid and they've
already taken over the world.
--- Pedro Domingos
6
What about the limitations of
DL?
DL is not magic - it is an incredibly powerful tool for
extracting regularities from data according a given
objective.
Corollary #1: A DL program will be just as smart as the
data it gets.
Corollary #2: A DL program will be just as smart as the
objective it optimizes.
7
Something to worry about #1
Bias and discrimination
8
Word embeddings
Can convert words to vectors of numbers - at the
hearth of most NLP applications with deep learning
9
Embeddings are highly sexists!
Bolukbasi, T., Chang, K.W., Zou, J., Saligrama, V. and Kalai, A., 2016.
Quantifying and reducing stereotypes in word embeddings. arXiv preprint
10
Hundreds of papers were published before this was
openly discussed!
Bolukbasi, T., Chang, K.W., Zou, J.Y., Saligrama, V. and Kalai, A.T., 2016.
. In Advances in Neural Information Processing Systems (pp. 4349-4357).
This is because gender biases probably account for an
increase in testing accuracy.
Man is to computer programmer as woman is to
homemaker? Debiasing word embeddings
11
Recent years have brought extraordinary
advances in the technical domains of AI. Alongside such efforts,
designers and researchers from a range of disciplines need to
conduct what we call social-systems analyses of AI. They need to
assess the impact of technologies on their social, cultural and
political settings
--- There is a blind spot in AI research, Nature, 2016
12
Racism is definitely bad PR!
[New Statesman, 2016]The rise of the racist robots
13
Not just an economic problem
[an investigation] found that the proprietary algorithms widely
used by judges to help determine the risk of reoffending are
almost twice as likely to mistakenly flag black defendants than
white defendants [There is a blind spot in AI research]
14
Discrimination and fairness
[Google Research Blog]Attacking discrimination with smarter machine learning
15
Something to worry about #2
Adversarial attacks
16
Can we break neural networks?
17
Fooling neural networks
(Andrej Karpathy blog)Breaking linear classifiers on Imagenet
18
Universal perturbations!
Moosavi-Dezfooli, S.M., Fawzi, A., Fawzi, O. and Frossard, P., 2016. Universal adversarial
perturbations. arXiv preprint arXiv:1610.08401.
19
20
Something to worry about #3
Privacy
21
Anonymous data?
De Montjoye, Y.A., Radaelli, L. and Singh, V.K., 2015. Unique in the shopping mall: On the reidentifiability of credit card metadata.
Science, 347(6221), pp.536-539.
22
Given access to a black-box classifier, can we infer
whether a specific example was part of the training
dataset?
We can with shadow training:
Shokri, R., Stronati, M., Song, C. and Shmatikov, V., 2017, May. Membership inference attacks against machine learning models. In
2017 IEEE Symposium on Security and Privacy (SP),  (pp. 3-18). IEEE.
23
Privacy in distributed
environments
Hitaj, B., Ateniese, G. and Perez-Cruz, F., 2017. Deep Models Under the GAN: Information Leakage from Collaborative Deep
Learning. arXiv preprint arXiv:1702.07464.
24
Something to worry about #4
Security threats
25
https://www.youtube.com/watch?v=21HjF4A3WE4
26
Something to worry about #5
Hidden technical debt
27
DL is just a tiny component!
(NIPS 2015)
Hidden Technical Debt in Machine Learning Systems
28
(NIPS 2015)
Machine learning offers a fantastically powerful toolkit for
building useful complex prediction systems quickly. ... it is
dangerous to think of these quick wins as coming for free. ... it is
common to incur massive ongoing maintenance costs in real-
world ML systems. [Risk factors include] boundary erosion,
entanglement, hidden feedback loops, undeclared consumers,
data dependencies, configuration issues, changes in the external
world, and a variety of system-level anti-patterns.
Hidden Technical Debt in Machine Learning Systems
29
If you are in Rome, check out our
Meetup:
And our new association:
Italian Association for Machine Learning
30

More Related Content

PPTX
The Need for Deep Learning Transparency
PPTX
Has Internet of Things really happened?
PDF
Army looks to block data ‘poisoning’ in facial recognition, AI
PDF
Service Design Days 2017 - Keynote Jon Rogers (University of Dundee)
PPTX
Into the Future with Artificial Intelligence: Opportunities and Challenges
PDF
What is I2 Final-Approved
PPT
Strata who owns data - april 2011 - jonas
PDF
Let's Talk: fundamentals of conversational design
The Need for Deep Learning Transparency
Has Internet of Things really happened?
Army looks to block data ‘poisoning’ in facial recognition, AI
Service Design Days 2017 - Keynote Jon Rogers (University of Dundee)
Into the Future with Artificial Intelligence: Opportunities and Challenges
What is I2 Final-Approved
Strata who owns data - april 2011 - jonas
Let's Talk: fundamentals of conversational design

Similar to Simone Scardapane - The dark side of deep learning - Codemotion Milan 2017 (20)

PDF
Ronde C: Masterclass - Omgaan met ethische vraagstukken over digitalisering v...
PDF
A primer on Artificial Intelligence (AI) and Machine Learning (ML)
PDF
Challenges and Solution for Artificial Intelligence in Cybersecurity of the USA
PDF
PDF
Applications of Artificial Intelligence & Associated Technologies
PPTX
[DSC Europe 23] Shahab Anbarjafari - Generative AI: Impact of Responsible AI
PPTX
AI - Exploring Frontiers
PPTX
Have we poisoned the internet for good?
PDF
Data Reliability Challenges with Spark by Henning Kropp (Spark & Hadoop User ...
PPTX
Don't Handicap AI without Explicit Knowledge
PDF
AI and disinfo (1).pdf
PDF
Principles of Artificial Intelligence & Machine Learning
PDF
PatternLanguageOfData
PDF
Student Presentation: AI & Government
PPTX
AI Presentation Y65 Class Dinner
PPTX
AIMA UCSC-SV Leadership_in_the_AI_era 20250628 v16.pptx
PPTX
Open Mining Education, Ethics & AI
PDF
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...
PPTX
ARTIFICIAL INTELLIGENCE
PDF
AI: The New Player in Cybersecurity (Nov. 08, 2023)
Ronde C: Masterclass - Omgaan met ethische vraagstukken over digitalisering v...
A primer on Artificial Intelligence (AI) and Machine Learning (ML)
Challenges and Solution for Artificial Intelligence in Cybersecurity of the USA
Applications of Artificial Intelligence & Associated Technologies
[DSC Europe 23] Shahab Anbarjafari - Generative AI: Impact of Responsible AI
AI - Exploring Frontiers
Have we poisoned the internet for good?
Data Reliability Challenges with Spark by Henning Kropp (Spark & Hadoop User ...
Don't Handicap AI without Explicit Knowledge
AI and disinfo (1).pdf
Principles of Artificial Intelligence & Machine Learning
PatternLanguageOfData
Student Presentation: AI & Government
AI Presentation Y65 Class Dinner
AIMA UCSC-SV Leadership_in_the_AI_era 20250628 v16.pptx
Open Mining Education, Ethics & AI
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...
ARTIFICIAL INTELLIGENCE
AI: The New Player in Cybersecurity (Nov. 08, 2023)
Ad

More from Codemotion (20)

PDF
Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...
PDF
Pompili - From hero to_zero: The FatalNoise neverending story
PPTX
Pastore - Commodore 65 - La storia
PPTX
Pennisi - Essere Richard Altwasser
PPTX
Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...
PPTX
Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019
PPTX
Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019
PPTX
Francesco Baldassarri - Deliver Data at Scale - Codemotion Amsterdam 2019 -
PDF
Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...
PDF
Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...
PDF
Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...
PDF
Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...
PDF
Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019
PDF
Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019
PPTX
Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019
PPTX
James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...
PDF
Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...
PDF
Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019
PDF
Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019
PDF
Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019
Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...
Pompili - From hero to_zero: The FatalNoise neverending story
Pastore - Commodore 65 - La storia
Pennisi - Essere Richard Altwasser
Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...
Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019
Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019
Francesco Baldassarri - Deliver Data at Scale - Codemotion Amsterdam 2019 -
Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...
Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...
Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...
Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...
Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019
Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019
Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019
James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...
Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...
Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019
Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019
Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019
Ad

Recently uploaded (20)

PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Tartificialntelligence_presentation.pptx
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
A Presentation on Artificial Intelligence
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Big Data Technologies - Introduction.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
cuic standard and advanced reporting.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Tartificialntelligence_presentation.pptx
Accuracy of neural networks in brain wave diagnosis of schizophrenia
“AI and Expert System Decision Support & Business Intelligence Systems”
Per capita expenditure prediction using model stacking based on satellite ima...
A Presentation on Artificial Intelligence
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Big Data Technologies - Introduction.pptx
Machine learning based COVID-19 study performance prediction
Diabetes mellitus diagnosis method based random forest with bat algorithm
Spectral efficient network and resource selection model in 5G networks
Dropbox Q2 2025 Financial Results & Investor Presentation
Advanced methodologies resolving dimensionality complications for autism neur...
MYSQL Presentation for SQL database connectivity
cuic standard and advanced reporting.pdf
Approach and Philosophy of On baking technology
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Simone Scardapane - The dark side of deep learning - Codemotion Milan 2017

  • 1. The dark side of deep learning Milan, 11 November 2017 1
  • 2. Just about any DL presentation... 2
  • 3. The deep learning craze [Business Insider, 2017] IBM speech recognition is on the verge of super-human accuracy [Time, 2017] Are Computers Already Smarter Than Humans? [LiveScience, 2016] Artificial Intelligence Beats 'Most Complex Game Devised by Humans' [RECODE, 2017] Intel is paying more than $400 million to buy deep-learning startup Nervana Systems 3
  • 4. 4
  • 5. 5
  • 6. I know there's a proverb which that says 'To err is human,' but a human error is nothing to what a computer can do if it tries. --- Agatha Christie People worry that computers will get too smart and take over the world, but the real problem is that they're too stupid and they've already taken over the world. --- Pedro Domingos 6
  • 7. What about the limitations of DL? DL is not magic - it is an incredibly powerful tool for extracting regularities from data according a given objective. Corollary #1: A DL program will be just as smart as the data it gets. Corollary #2: A DL program will be just as smart as the objective it optimizes. 7
  • 8. Something to worry about #1 Bias and discrimination 8
  • 9. Word embeddings Can convert words to vectors of numbers - at the hearth of most NLP applications with deep learning 9
  • 10. Embeddings are highly sexists! Bolukbasi, T., Chang, K.W., Zou, J., Saligrama, V. and Kalai, A., 2016. Quantifying and reducing stereotypes in word embeddings. arXiv preprint 10
  • 11. Hundreds of papers were published before this was openly discussed! Bolukbasi, T., Chang, K.W., Zou, J.Y., Saligrama, V. and Kalai, A.T., 2016. . In Advances in Neural Information Processing Systems (pp. 4349-4357). This is because gender biases probably account for an increase in testing accuracy. Man is to computer programmer as woman is to homemaker? Debiasing word embeddings 11
  • 12. Recent years have brought extraordinary advances in the technical domains of AI. Alongside such efforts, designers and researchers from a range of disciplines need to conduct what we call social-systems analyses of AI. They need to assess the impact of technologies on their social, cultural and political settings --- There is a blind spot in AI research, Nature, 2016 12
  • 13. Racism is definitely bad PR! [New Statesman, 2016]The rise of the racist robots 13
  • 14. Not just an economic problem [an investigation] found that the proprietary algorithms widely used by judges to help determine the risk of reoffending are almost twice as likely to mistakenly flag black defendants than white defendants [There is a blind spot in AI research] 14
  • 15. Discrimination and fairness [Google Research Blog]Attacking discrimination with smarter machine learning 15
  • 16. Something to worry about #2 Adversarial attacks 16
  • 17. Can we break neural networks? 17
  • 18. Fooling neural networks (Andrej Karpathy blog)Breaking linear classifiers on Imagenet 18
  • 19. Universal perturbations! Moosavi-Dezfooli, S.M., Fawzi, A., Fawzi, O. and Frossard, P., 2016. Universal adversarial perturbations. arXiv preprint arXiv:1610.08401. 19
  • 20. 20
  • 21. Something to worry about #3 Privacy 21
  • 22. Anonymous data? De Montjoye, Y.A., Radaelli, L. and Singh, V.K., 2015. Unique in the shopping mall: On the reidentifiability of credit card metadata. Science, 347(6221), pp.536-539. 22
  • 23. Given access to a black-box classifier, can we infer whether a specific example was part of the training dataset? We can with shadow training: Shokri, R., Stronati, M., Song, C. and Shmatikov, V., 2017, May. Membership inference attacks against machine learning models. In 2017 IEEE Symposium on Security and Privacy (SP),  (pp. 3-18). IEEE. 23
  • 24. Privacy in distributed environments Hitaj, B., Ateniese, G. and Perez-Cruz, F., 2017. Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning. arXiv preprint arXiv:1702.07464. 24
  • 25. Something to worry about #4 Security threats 25
  • 27. Something to worry about #5 Hidden technical debt 27
  • 28. DL is just a tiny component! (NIPS 2015) Hidden Technical Debt in Machine Learning Systems 28
  • 29. (NIPS 2015) Machine learning offers a fantastically powerful toolkit for building useful complex prediction systems quickly. ... it is dangerous to think of these quick wins as coming for free. ... it is common to incur massive ongoing maintenance costs in real- world ML systems. [Risk factors include] boundary erosion, entanglement, hidden feedback loops, undeclared consumers, data dependencies, configuration issues, changes in the external world, and a variety of system-level anti-patterns. Hidden Technical Debt in Machine Learning Systems 29
  • 30. If you are in Rome, check out our Meetup: And our new association: Italian Association for Machine Learning 30