SlideShare a Scribd company logo
P R E S E N TAT I O N T E M P L AT E
CORPORATEJoe Keating
Ethical Data Science - with Big Data, comes Big
Responsibility.
Bank of Ireland - Analytics Connect
20.09.2018
The Origins of Data Science
1962
• John W. Tukey
writes in “The
Future of Data
Analysis” that..
“data analysis is
intrinsically an
empirical science”.
• He Later
published
“Exploratory Data
Analysis”, arguing
that more emphasis
needed to be
placed on “using
data to suggest
hypotheses to test”.
1974
• Peter Naur
publishes the
“Concise Survey of
Computer
Methods” including
the following
definition of data
science:
• “The science of
dealing with data,
once they have
been established -
while the relation of
the data to what
they represent is
delegated to other
fields and sciences.”
1977
• The International
Association for
Statistical
Computing (IASC) is
established with the
following mission:
• “to link traditional
statistical
methodology,
modern computer
technology, and the
knowledge of
domain experts in
order to convert
data into
information and
knowledge.”
1996
• Members of the
International
Federation of
Classification
Societies (IFCS)
meet in Kobe,
Japan, for their
biennial conference.
• For the first time,
the term “data
science” is included
in the title of the
conference.
2009
• Nathan Yau writes
in “Rise of the Data
Scientist” that..
“We're seeing data
scientists—people
who can do it all—
emerge from the
rest of the pack”.
2011
• Harlan Harris writes
in “Data Science,
Moore’s Law, and
Moneyball” that..
“What Data
Scientists do has
been very well
covered, and it runs
the gamut from
data collection and
munging, through
application of
statistics and
machine learning
and related
techniques, to
interpretation,
communication,
and visualization of
the results.”
Traditional Data Science
•Targeted advertising based on our
online activity, whether it’s a YouTube
pre-roll ad or a targeted article on
Facebook.
Deep learning
•Scales to much larger volumes and
gives better results. However, we
don’t necessarily know how it works.
You can’t open up the lid and look
inside a black-box model.
Where is Data Science today?
• At the coalface of digital transformation.
• The opportunity for achieving social good within the field of data science is
huge.
• Data has become one of the most valuable commodities in the global
economy.
Typical Application Emerging Techniques
What About Bias and Ethics?
• A central challenge in building a fair model is to quantify some notion of
‘fairness’.
• Group vs. Individual Fairness
• Group fairness is the requirement that different groups of people should be treated the
same on average.
• Individual fairness is the requirement that individuals who are similar should be treated
similarly.
• Sample Bias vs. Label Bias in your Data
• Label bias occurs when the data-generating process systematically assigns labels
differently for different groups (i.e. Studies show that men with beards drink more = label
bias).
• Sample bias occurs when the data-generating process, samples from different groups in
different ways (i.e. More people are tested for drink driving in Dublin than elsewhere =
sample bias).
What is an Example of Bias?
In the financial industry, biased data may cause results that offend the
United States Equal Credit Opportunity Act (fair lending).
This law, enacted in 1974, prohibits credit discrimination based on race,
color, religion, national origin, sex, marital status, age or source of income.
While lenders will take steps not to include such data in a loan decision, it
may be possible to infer race in some cases using a zip code, for example.
The Impact - Consumer Financial Protection
Bureau
Federal law that places regulation of the financial
industry in the hands of the government
How do we eliminate Bias?
• To mitigate bias, data scientists need to understand the data and its
contexts before they even begin modelling algorithmic patterns.
• If bias is present in the data collected, the algorithm will carry this forward.
• Model outcomes will become more neutral if an algorithm is trained on
data that is pre-processed to minimize bias.
There is no silver bullet measurement which is guaranteed to detect
unfairness, choosing an appropriate definition of model fairness is task-
specific but should always be underpinned by an ethical code of conduct
and transparency.
“Data made simple”
Glantus provides a unique data platform,
using our own IPR to enable
organisations to discover and optimise
the value of their business data.
1. Connect to Data from any Digital source - configure Validation,
Classification and Transformation Rules, if necessary
2. Apply out-of-the-box Algorithms that are Industry, Solution Specific
to suit your needs and/or Create your own – simply
3. Automate end-to-end Workflows to eliminate manual effort and
add value to your business through proactive Actions
4. Empower the Business to self-serve their own Data needs
How it Works.
The Glantus platform provides an automated, end-to-end solution to compile data and
perform analytics to support the requirements mandated by Financial Regulation.
Solution to Apply the Rules of Fair Lending
Data Preparation
• Connect to any and all digital
systems and sources for
aggregation and transformation in
line with the regulatory
requirements identified
• Automated generation of
Universal Loan Identifiers (ULI)
and check digits
• Automated validation and
notifications based on user
defined rules
Analytics
• Automate the consolidation of
qualitative and quantitative data
elements
• Produce Risk Scores at individual
and cumulative levels across all
businesses within the
organization
• Examine activity by region (e.g.,
MSA) and branch (e.g., origination
activity and loan approval rates)
as a function of race, ethnicity, or
gender across LOBs and entities
Automation
• Automated digital
communications based outcome
on the Analytics:
• Internal, External Systems
• Email
• SMS
• Pro-active Monitoring and
Alerting based on Real-Time
activity
Glantus Data Stream
ERP Data
Web Store
Data
Master and
Transactional
Data feed
Mimic Orders streaming
form web store
Twitter feed monitoring
for keywords
Workflow Modeler
Structured
DB
Actions
• Tweet
• SMS
• eMail
• DB Update
Typical Use Case
Your data becomes your asset.
Contact us @ www.glantus.com
With

More Related Content

PDF
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018
PDF
Ethics in Data Science and Machine Learning
PPTX
Ethics of Big Data
PPTX
Ethics In DW & DM
PDF
What is Data Science
PDF
Data and Ethics: Why Data Science Needs One
PDF
Introduction to Ethics of Big Data
PDF
Philosophical Aspects of Big Data
Glantus Presentation: Ethical Data Science - BoI Analytics Connect 2018
Ethics in Data Science and Machine Learning
Ethics of Big Data
Ethics In DW & DM
What is Data Science
Data and Ethics: Why Data Science Needs One
Introduction to Ethics of Big Data
Philosophical Aspects of Big Data

What's hot (20)

PDF
Data ethics
PPTX
Impact of data science in financial reporting
PDF
Big data march2016 ipsos mori
PPTX
Social Νetworks Data Mining
PDF
Ethics and Data
PPTX
Technologies and Innovation – Ethics
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
PDF
Big Data technology
PDF
Marshaling Data for Enterprise Insights
PDF
Big Data, Psychografics and Social Media Advertising - Alessandro Sisti
PPTX
What is big data
DOCX
Big Data has Big Problems
PPTX
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
PPTX
Open data for smart cities
PPTX
HICSS - 50
PDF
Documaster – The true value of documents
PDF
The Big Data Talent Gap
PPTX
Governance of Big Data
PPTX
Dark data by Worapol Alex Pongpech
PPTX
State of Florida Neo4J Graph Briefing - Keynote
 
Data ethics
Impact of data science in financial reporting
Big data march2016 ipsos mori
Social Νetworks Data Mining
Ethics and Data
Technologies and Innovation – Ethics
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
Big Data technology
Marshaling Data for Enterprise Insights
Big Data, Psychografics and Social Media Advertising - Alessandro Sisti
What is big data
Big Data has Big Problems
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
Open data for smart cities
HICSS - 50
Documaster – The true value of documents
The Big Data Talent Gap
Governance of Big Data
Dark data by Worapol Alex Pongpech
State of Florida Neo4J Graph Briefing - Keynote
 
Ad

Similar to Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018 (20)

PPTX
Joe keating - world legal summit - ethical data science
PPTX
Data Analytics Ethics: Issues and Questions (Arnie Aronoff, Ph.D.)
PDF
dataminingppt-170616163835.pdf jejwwkwnwnn
PPTX
Data mining
PPTX
Introduction To Data Mining and Data Mining Techniques.pptx
PPTX
Data analytics using Scalable Programming
PPT
datamining.ppt
PPTX
datamining management slyabbus and ppt.pptx
PPT
datamining.ppt
PDF
Business Analytics and Data mining.pdf
PPTX
Evolution & Introduction to Big data-2.pptx
PDF
Data science and ethics in fundraising
PDF
DATAIA & TransAlgo
PDF
IMA meeting accounting for big data
PDF
Engineering Ethics: Practicing Fairness
PPTX
3rd Socio-Cultural Data Summit
PDF
Module-1-IntroductionToDataMining (Data Mining)
PDF
ETHICAL ISSUES WITH CUSTOMER DATA COLLECTION
PPTX
An Introduction to Data Science.pptx learn
Joe keating - world legal summit - ethical data science
Data Analytics Ethics: Issues and Questions (Arnie Aronoff, Ph.D.)
dataminingppt-170616163835.pdf jejwwkwnwnn
Data mining
Introduction To Data Mining and Data Mining Techniques.pptx
Data analytics using Scalable Programming
datamining.ppt
datamining management slyabbus and ppt.pptx
datamining.ppt
Business Analytics and Data mining.pdf
Evolution & Introduction to Big data-2.pptx
Data science and ethics in fundraising
DATAIA & TransAlgo
IMA meeting accounting for big data
Engineering Ethics: Practicing Fairness
3rd Socio-Cultural Data Summit
Module-1-IntroductionToDataMining (Data Mining)
ETHICAL ISSUES WITH CUSTOMER DATA COLLECTION
An Introduction to Data Science.pptx learn
Ad

More from Joe Keating (11)

PPTX
HESCA23 - Joe Keating - Data Analytics for Engagement Presentation.pptx
PDF
Personal Growth in Times of Change - 17.11.2020
PPTX
Glantus - Gannon Homes - Business Value Award Presentation
PDF
Techconnect live joe keating - what can data do for you - 16x9
PPTX
Techconnect Live 2019 - Joe Keating - what can data do for you?
PDF
Mindfulness overview northside partnership - 28.02.2019
PDF
Smart Retail & Hospitality 2019 - Robotic Process Automation in Retail
PDF
Smart retail & hospitality 2019 joe keating - retail automation
PDF
Kofax Glantus Spotlight Event
PDF
Data is simple.
PDF
Sql Saturday Dublin 2017 - Master Data Services Custom Extensions
HESCA23 - Joe Keating - Data Analytics for Engagement Presentation.pptx
Personal Growth in Times of Change - 17.11.2020
Glantus - Gannon Homes - Business Value Award Presentation
Techconnect live joe keating - what can data do for you - 16x9
Techconnect Live 2019 - Joe Keating - what can data do for you?
Mindfulness overview northside partnership - 28.02.2019
Smart Retail & Hospitality 2019 - Robotic Process Automation in Retail
Smart retail & hospitality 2019 joe keating - retail automation
Kofax Glantus Spotlight Event
Data is simple.
Sql Saturday Dublin 2017 - Master Data Services Custom Extensions

Recently uploaded (20)

PDF
Business Analytics and business intelligence.pdf
PDF
Foundation of Data Science unit number two notes
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Lecture1 pattern recognition............
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to machine learning and Linear Models
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Business Analytics and business intelligence.pdf
Foundation of Data Science unit number two notes
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Lecture1 pattern recognition............
Business Ppt On Nestle.pptx huunnnhhgfvu
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
.pdf is not working space design for the following data for the following dat...
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to machine learning and Linear Models
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
climate analysis of Dhaka ,Banglades.pptx
Clinical guidelines as a resource for EBP(1).pdf
Business Acumen Training GuidePresentation.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
1_Introduction to advance data techniques.pptx
Introduction to Knowledge Engineering Part 1
iec ppt-1 pptx icmr ppt on rehabilitation.pptx

Glantus Presentation Slides - Ethical Data Science - BoI Analytics Connect 2018

  • 1. P R E S E N TAT I O N T E M P L AT E CORPORATEJoe Keating Ethical Data Science - with Big Data, comes Big Responsibility. Bank of Ireland - Analytics Connect 20.09.2018
  • 2. The Origins of Data Science 1962 • John W. Tukey writes in “The Future of Data Analysis” that.. “data analysis is intrinsically an empirical science”. • He Later published “Exploratory Data Analysis”, arguing that more emphasis needed to be placed on “using data to suggest hypotheses to test”. 1974 • Peter Naur publishes the “Concise Survey of Computer Methods” including the following definition of data science: • “The science of dealing with data, once they have been established - while the relation of the data to what they represent is delegated to other fields and sciences.” 1977 • The International Association for Statistical Computing (IASC) is established with the following mission: • “to link traditional statistical methodology, modern computer technology, and the knowledge of domain experts in order to convert data into information and knowledge.” 1996 • Members of the International Federation of Classification Societies (IFCS) meet in Kobe, Japan, for their biennial conference. • For the first time, the term “data science” is included in the title of the conference. 2009 • Nathan Yau writes in “Rise of the Data Scientist” that.. “We're seeing data scientists—people who can do it all— emerge from the rest of the pack”. 2011 • Harlan Harris writes in “Data Science, Moore’s Law, and Moneyball” that.. “What Data Scientists do has been very well covered, and it runs the gamut from data collection and munging, through application of statistics and machine learning and related techniques, to interpretation, communication, and visualization of the results.”
  • 3. Traditional Data Science •Targeted advertising based on our online activity, whether it’s a YouTube pre-roll ad or a targeted article on Facebook. Deep learning •Scales to much larger volumes and gives better results. However, we don’t necessarily know how it works. You can’t open up the lid and look inside a black-box model. Where is Data Science today? • At the coalface of digital transformation. • The opportunity for achieving social good within the field of data science is huge. • Data has become one of the most valuable commodities in the global economy. Typical Application Emerging Techniques
  • 4. What About Bias and Ethics? • A central challenge in building a fair model is to quantify some notion of ‘fairness’. • Group vs. Individual Fairness • Group fairness is the requirement that different groups of people should be treated the same on average. • Individual fairness is the requirement that individuals who are similar should be treated similarly. • Sample Bias vs. Label Bias in your Data • Label bias occurs when the data-generating process systematically assigns labels differently for different groups (i.e. Studies show that men with beards drink more = label bias). • Sample bias occurs when the data-generating process, samples from different groups in different ways (i.e. More people are tested for drink driving in Dublin than elsewhere = sample bias).
  • 5. What is an Example of Bias? In the financial industry, biased data may cause results that offend the United States Equal Credit Opportunity Act (fair lending). This law, enacted in 1974, prohibits credit discrimination based on race, color, religion, national origin, sex, marital status, age or source of income. While lenders will take steps not to include such data in a loan decision, it may be possible to infer race in some cases using a zip code, for example.
  • 6. The Impact - Consumer Financial Protection Bureau Federal law that places regulation of the financial industry in the hands of the government
  • 7. How do we eliminate Bias? • To mitigate bias, data scientists need to understand the data and its contexts before they even begin modelling algorithmic patterns. • If bias is present in the data collected, the algorithm will carry this forward. • Model outcomes will become more neutral if an algorithm is trained on data that is pre-processed to minimize bias. There is no silver bullet measurement which is guaranteed to detect unfairness, choosing an appropriate definition of model fairness is task- specific but should always be underpinned by an ethical code of conduct and transparency.
  • 8. “Data made simple” Glantus provides a unique data platform, using our own IPR to enable organisations to discover and optimise the value of their business data.
  • 9. 1. Connect to Data from any Digital source - configure Validation, Classification and Transformation Rules, if necessary 2. Apply out-of-the-box Algorithms that are Industry, Solution Specific to suit your needs and/or Create your own – simply 3. Automate end-to-end Workflows to eliminate manual effort and add value to your business through proactive Actions 4. Empower the Business to self-serve their own Data needs How it Works.
  • 10. The Glantus platform provides an automated, end-to-end solution to compile data and perform analytics to support the requirements mandated by Financial Regulation. Solution to Apply the Rules of Fair Lending Data Preparation • Connect to any and all digital systems and sources for aggregation and transformation in line with the regulatory requirements identified • Automated generation of Universal Loan Identifiers (ULI) and check digits • Automated validation and notifications based on user defined rules Analytics • Automate the consolidation of qualitative and quantitative data elements • Produce Risk Scores at individual and cumulative levels across all businesses within the organization • Examine activity by region (e.g., MSA) and branch (e.g., origination activity and loan approval rates) as a function of race, ethnicity, or gender across LOBs and entities Automation • Automated digital communications based outcome on the Analytics: • Internal, External Systems • Email • SMS • Pro-active Monitoring and Alerting based on Real-Time activity
  • 11. Glantus Data Stream ERP Data Web Store Data Master and Transactional Data feed Mimic Orders streaming form web store Twitter feed monitoring for keywords Workflow Modeler Structured DB Actions • Tweet • SMS • eMail • DB Update Typical Use Case
  • 12. Your data becomes your asset. Contact us @ www.glantus.com With

Editor's Notes

  • #2: Initial introductions.
  • #9: Focus on the fact that the end-to-end platform is unique (on a global level) and emphasize that this is our IPR.
  • #10: One single platform to Connect, Analyse and Optimise the visibility and use of data within the organisation.
  • #11: Organic Growth through Dublin, Gloucester and Katowice. New York through Acquisition. Dubai, Sing and Sydney based on Alliance and Partnerships.