SlideShare a Scribd company logo
THE TESTING OF MACHINE
LEARNING AND ARTIFICIAL
INTELLIGENCE ALGORITHMS
Evan R. Giordanella
QUICK INTRODUCTION
• Python Engineer
• Tester
• QA Engineer
• Manager
• Quality Expert
• Mentor
• Coach
• Musician
• Cyclist
Who am I?
THE TEAM
• Where
• Comcast Advanced Advertising
• What we do
• Intelligent ad sales, planning, and execution
• The Team
• Build e2e testing systems: simulate system
behavior
• Experts on big/small picture code (many
languages, technologies, environments, etc)
MY FIRST MACHINE LEARNING
TESTING EXPERIENCE
NEW PRODUCT - NEW TEAM
• New product to “intelligently” deliver ads
• Machine learning techniques were implemented to predict
how many people will be watching an ad at any given moment
• New data science team for new product
• New test engineers for new product
NEW PRODUCT - NEW TEAM
• New product to “intelligently” deliver ads
• Machine learning techniques were implemented to predict
how many people will be watching an ad at any given moment
• New data science team for new product
• New test engineers for new product
My task was to assess whether this new
product intelligently delivered ads.
CHALLENGES
• Inexperience with ML applications
• Stochastic output
• Computationally intense
validation
• Communication barriers with the
data science team
• Statistics, math!?
• Data scientists’ confidence in
testing
• Buy-in from product organization
re: Machine Learning
• Buy-in from industry stakeholders
(slow moving local cable
businesses)
• New testing can be intimidating
WHAT IS MACHINE LEARNING?
• Field of Computer Science
• An application of artificial intelligence (AI) that analyzes data without the computer being
explicitly programmed to do so
WHAT IS MACHINE LEARNING?
• Field of Computer Science
• An application of artificial intelligence (AI) that analyzes data without the computer being
explicitly programmed to do so
SKILLS NEEDED FOR ML TESTING
There are a lot of them!
STATISTICS / MATH
• All testers needed to have a
foundational knowledge of data
analytics
• Characterization the data
• mean, standard deviation, variance
• Performance Metrics
• mean square error (MSE),
classification accuracy, loss, area
under the curve (AUC)
TERMINOLOGY
• We had to learn basic Machine Learning
terminology
• Testing and Training
• Regression and Classification
• Validation
• Features
• Dimensions
• Random Forest
TERMINOLOGY
• We had to learn basic Machine Learning
terminology
• Testing and Training
• Regression and Classification
• Validation
• Features
• Dimensions
• Random Forest
Speaking
the same
language is
important!
TOOLS AND METHODS WE USED
some worked and some didn’t…
TOOLS
•We wanted to use
same tools as Data
Science Team
•What tools do
computational
scientists
commonly use?
Python, Pandas,
SciPy, NumPy,
Docker
VISUALIZATION THEN
• How do we assess the
application?
• Visualization!
• 1st strategy was to plot model
performance over time using
excel
• Visualization very helpful
• Worked only in the
beginning
• This did not scale! Who
would have thought! Ha!
VISUALIZATION NOW
• 2nd strategy was to automate test and output visualization
• Created tools using Docker and D3 to graph results of various models using
various testing data sets
HOW
• Looked at how models were
implemented in order to find ways
to exploit them
• Examine the code
• Demonstrated how models would
behave given common data
problems
• missing values
• anomalies
• small vs large data
• real vs synthetic data
• Creates predictable data sets with
injected noise in an attempt to break
the algorithm
How we devised our tests
LESSONS LEARNED
• Need to speak the same language with those that implement
the ML/AI
• Conflicting technical nomenclature can create unnecessary
slow downs
• Need to interpret and present the results in a way that
stakeholders and PMs can digest
• Visualize when possible
• Know your audience!
• Knowing what fears your stakeholders have about ML can help
you build their confidence in it
• Do not be afraid!
THANK YOU
• Questions?
• evang@visibleworld.com

More Related Content

PPTX
Horizons 8th Grade STEAM Project Electrical Technology School Safety
PPTX
VivaFinalProject
PPTX
How to build a successfull software
PDF
Pstc 2018
PPTX
An Introduction to Machine Learning
PDF
The Machine Learning Workflow with Azure
PPT
Easily apply Quality Assurance and Testing in the ML Project
PPTX
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...
Horizons 8th Grade STEAM Project Electrical Technology School Safety
VivaFinalProject
How to build a successfull software
Pstc 2018
An Introduction to Machine Learning
The Machine Learning Workflow with Azure
Easily apply Quality Assurance and Testing in the ML Project
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...

Similar to Evan Giordanella: The Testing of Machine Learning and Artificial Intelligence Algorithms (20)

PPTX
L1_intro.pptx
PDF
Engineering Intelligent Systems using Machine Learning
PDF
Efficient Learning Machines Theories Concepts And Applications For Engineers ...
PPTX
AI hype or reality
PDF
1 machine learning demystified
PDF
The Data Science Process - Do we need it and how to apply?
PPTX
Introduction to Machine Learning
PDF
ML MODULE 1_slideshare.pdf
PDF
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
PDF
Artificial Intelligence (ML - DL)
PDF
Managing machine learning
PDF
Week 2 lecture
PPTX
Machine learning101 v1.2
 
PDF
General introduction to AI ML DL DS
PDF
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
PPTX
L15.pptx
PDF
Machine Learning Workshop, TSEC 2020
PDF
Intro to machine learning
PPTX
Machine Learning
PDF
Introduction to ML.pdf Supervised Learning, Unsupervised
L1_intro.pptx
Engineering Intelligent Systems using Machine Learning
Efficient Learning Machines Theories Concepts And Applications For Engineers ...
AI hype or reality
1 machine learning demystified
The Data Science Process - Do we need it and how to apply?
Introduction to Machine Learning
ML MODULE 1_slideshare.pdf
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
Artificial Intelligence (ML - DL)
Managing machine learning
Week 2 lecture
Machine learning101 v1.2
 
General introduction to AI ML DL DS
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
L15.pptx
Machine Learning Workshop, TSEC 2020
Intro to machine learning
Machine Learning
Introduction to ML.pdf Supervised Learning, Unsupervised
Ad

More from Anna Royzman (20)

PDF
InnovateQASeattle2024_ArtemGolubev _ How Generative AI changed tools landscap...
PDF
Playwright: An Emerging Tool in Test Automation
PDF
TLC2018 Bertold Kolics: Funnels of Hiring Test Engineers
PDF
TLC2018 Dwayne Green: Let's Get Deliberate - for Managers
PDF
TLC2018 Gitte Klitgaard: Experience Hierarchy (Workshop)
PDF
TLC2018 Gitte Klitgaard: Imposter Syndrome
PDF
TLC2018 Justin Ison: Delivering Flawless UI - Challenges and Solutions
PDF
TLC2018 Melissa Tondi: Finding Efficiencies in Software Testing
PDF
TLC2018 Shyam Sunder: Legoizing Testing
PDF
TLC2018 Tanya Kravtsov: 10 Steps to CI, Testing and Delivery
PDF
TLC2018 Thomas Haver: The Automation Firehose - Be Strategic and Tactical
PDF
TLC2018 Thomas Haver: The Science of Testing
PDF
TLC2018 Thomas Haver: Transform with Enterprise Automation
PDF
TLC2018 Valeriy Burmistrov: Testing Leaders in 5-10 years
PPSX
The Three Pillars of Successful Test Leadership: Driving Projects, Process an...
PDF
Alexandra Schieren: It's Just...
PDF
Simon Peter Schrijver: Exploratory Testing Live
PDF
Simon Peter Schrjver: Pair Testing
PDF
Wing Wong: Quality Coaching
PDF
Tuhin Mitra: How I Automate My Negative Tests
InnovateQASeattle2024_ArtemGolubev _ How Generative AI changed tools landscap...
Playwright: An Emerging Tool in Test Automation
TLC2018 Bertold Kolics: Funnels of Hiring Test Engineers
TLC2018 Dwayne Green: Let's Get Deliberate - for Managers
TLC2018 Gitte Klitgaard: Experience Hierarchy (Workshop)
TLC2018 Gitte Klitgaard: Imposter Syndrome
TLC2018 Justin Ison: Delivering Flawless UI - Challenges and Solutions
TLC2018 Melissa Tondi: Finding Efficiencies in Software Testing
TLC2018 Shyam Sunder: Legoizing Testing
TLC2018 Tanya Kravtsov: 10 Steps to CI, Testing and Delivery
TLC2018 Thomas Haver: The Automation Firehose - Be Strategic and Tactical
TLC2018 Thomas Haver: The Science of Testing
TLC2018 Thomas Haver: Transform with Enterprise Automation
TLC2018 Valeriy Burmistrov: Testing Leaders in 5-10 years
The Three Pillars of Successful Test Leadership: Driving Projects, Process an...
Alexandra Schieren: It's Just...
Simon Peter Schrijver: Exploratory Testing Live
Simon Peter Schrjver: Pair Testing
Wing Wong: Quality Coaching
Tuhin Mitra: How I Automate My Negative Tests
Ad

Recently uploaded (20)

PPTX
Strategic Plan 2023-2024 Presentation.pptx
PDF
CHAPTER 14 Manageement of Nursing Educational Institutions- planing and orga...
PPTX
Empowering Project Management Through Servant Leadership - PMI UK.pptx
PPTX
Self-Awareness and Values Development presentation
PPTX
INTELLECTUAL PROPERTY LAW IN UGANDA.pptx
PDF
Human resources management is a best management
PDF
The Plan: Save the Palestinian Nation Now
PDF
The Untold Story of Swami Vijay Kumar Durai: Building PRS International
PPTX
Basics of Project Management for development of leadership skills in practice
PDF
Air India AI-171 Crash in Ahmedabad A Tragic Wake-Up Call.
PDF
1_Corporate Goverance presentation topic
PDF
Phillips model training for evaluation pdf
PPTX
The Sustainable Site: Boosting Productivity in Construction – Pipe Dream or P...
PPTX
Course Overview of the Course Titled.pptx
PDF
Timeless Leadership Principles from History’s Greatest Figures by Alfonso Ken...
PDF
CHAPTER 14 Manageement of Nursing Educational Institutions- planing and orga...
PDF
Leveraging Intangible Assets Through Campus Entrepreneurship and Tech Transfer
PPTX
Human resources management -job perception concept
PPTX
Improved_Leadership_in_Total_Quality_Lesson.pptx
PDF
Case study -Uber strategic plan and management
Strategic Plan 2023-2024 Presentation.pptx
CHAPTER 14 Manageement of Nursing Educational Institutions- planing and orga...
Empowering Project Management Through Servant Leadership - PMI UK.pptx
Self-Awareness and Values Development presentation
INTELLECTUAL PROPERTY LAW IN UGANDA.pptx
Human resources management is a best management
The Plan: Save the Palestinian Nation Now
The Untold Story of Swami Vijay Kumar Durai: Building PRS International
Basics of Project Management for development of leadership skills in practice
Air India AI-171 Crash in Ahmedabad A Tragic Wake-Up Call.
1_Corporate Goverance presentation topic
Phillips model training for evaluation pdf
The Sustainable Site: Boosting Productivity in Construction – Pipe Dream or P...
Course Overview of the Course Titled.pptx
Timeless Leadership Principles from History’s Greatest Figures by Alfonso Ken...
CHAPTER 14 Manageement of Nursing Educational Institutions- planing and orga...
Leveraging Intangible Assets Through Campus Entrepreneurship and Tech Transfer
Human resources management -job perception concept
Improved_Leadership_in_Total_Quality_Lesson.pptx
Case study -Uber strategic plan and management

Evan Giordanella: The Testing of Machine Learning and Artificial Intelligence Algorithms

  • 1. THE TESTING OF MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE ALGORITHMS Evan R. Giordanella
  • 2. QUICK INTRODUCTION • Python Engineer • Tester • QA Engineer • Manager • Quality Expert • Mentor • Coach • Musician • Cyclist Who am I?
  • 3. THE TEAM • Where • Comcast Advanced Advertising • What we do • Intelligent ad sales, planning, and execution • The Team • Build e2e testing systems: simulate system behavior • Experts on big/small picture code (many languages, technologies, environments, etc)
  • 4. MY FIRST MACHINE LEARNING TESTING EXPERIENCE
  • 5. NEW PRODUCT - NEW TEAM • New product to “intelligently” deliver ads • Machine learning techniques were implemented to predict how many people will be watching an ad at any given moment • New data science team for new product • New test engineers for new product
  • 6. NEW PRODUCT - NEW TEAM • New product to “intelligently” deliver ads • Machine learning techniques were implemented to predict how many people will be watching an ad at any given moment • New data science team for new product • New test engineers for new product My task was to assess whether this new product intelligently delivered ads.
  • 7. CHALLENGES • Inexperience with ML applications • Stochastic output • Computationally intense validation • Communication barriers with the data science team • Statistics, math!? • Data scientists’ confidence in testing • Buy-in from product organization re: Machine Learning • Buy-in from industry stakeholders (slow moving local cable businesses) • New testing can be intimidating
  • 8. WHAT IS MACHINE LEARNING? • Field of Computer Science • An application of artificial intelligence (AI) that analyzes data without the computer being explicitly programmed to do so
  • 9. WHAT IS MACHINE LEARNING? • Field of Computer Science • An application of artificial intelligence (AI) that analyzes data without the computer being explicitly programmed to do so
  • 10. SKILLS NEEDED FOR ML TESTING There are a lot of them!
  • 11. STATISTICS / MATH • All testers needed to have a foundational knowledge of data analytics • Characterization the data • mean, standard deviation, variance • Performance Metrics • mean square error (MSE), classification accuracy, loss, area under the curve (AUC)
  • 12. TERMINOLOGY • We had to learn basic Machine Learning terminology • Testing and Training • Regression and Classification • Validation • Features • Dimensions • Random Forest
  • 13. TERMINOLOGY • We had to learn basic Machine Learning terminology • Testing and Training • Regression and Classification • Validation • Features • Dimensions • Random Forest Speaking the same language is important!
  • 14. TOOLS AND METHODS WE USED some worked and some didn’t…
  • 15. TOOLS •We wanted to use same tools as Data Science Team •What tools do computational scientists commonly use? Python, Pandas, SciPy, NumPy, Docker
  • 16. VISUALIZATION THEN • How do we assess the application? • Visualization! • 1st strategy was to plot model performance over time using excel • Visualization very helpful • Worked only in the beginning • This did not scale! Who would have thought! Ha!
  • 17. VISUALIZATION NOW • 2nd strategy was to automate test and output visualization • Created tools using Docker and D3 to graph results of various models using various testing data sets
  • 18. HOW • Looked at how models were implemented in order to find ways to exploit them • Examine the code • Demonstrated how models would behave given common data problems • missing values • anomalies • small vs large data • real vs synthetic data • Creates predictable data sets with injected noise in an attempt to break the algorithm How we devised our tests
  • 19. LESSONS LEARNED • Need to speak the same language with those that implement the ML/AI • Conflicting technical nomenclature can create unnecessary slow downs • Need to interpret and present the results in a way that stakeholders and PMs can digest • Visualize when possible • Know your audience! • Knowing what fears your stakeholders have about ML can help you build their confidence in it • Do not be afraid!
  • 20. THANK YOU • Questions? • evang@visibleworld.com