SlideShare a Scribd company logo
TRUSTWORTHY AI AND
OPEN SCIENCE
Beth Plale
Michael A and Laurie Burns McRobbie Professor of Computer Engineering
Beilstein Open Science symposium
October 06, 2021
Luddy School of Informatics, Computing, and Engineering
Data To Insight Center
Observations influenced by my role (2017-2020) in the
National Science Foundation working on agency policies
and practice in open science. Views expressed are
entirely my own.
Funding agency perspective on open science: how do
we bring visibility to the products of research (that we
fund)
NSF funds the collection and capture
of research data through projects
ranging from a few hundred thousand
dollars to tens of millions of dollars.
The data are maintained in a
landscape of solutions to meet the
needs of researchers.
Specialist repositories
- Organizational resources
Generalist repositories
- Organizational resources
Data Portals
- Low velocity data
- Employs cloud resources
- Employs data-compute proximity for analysis
Observation networks
- High velocity data
- Employs cloud resources
RESEARCH DATA LANDSCAPE
SAGE
NEON ARM
HPWREN
UWI
LTER, OOI
NEON
HydroShare
LTER
MGDS, IRIS
ICPSR
QDR
TAIR
MDF
IEDA
PDB
CCDC
DataVerse
Figshare
Dryad
Zenodo
IRs
Exemplar
systems
RESEARCH DATA LANDSCAPE
Data
timeliness
need
Researcher
depth of
expertise
Expectation
for level of
curation
Expectation
of data
longevity
Specialist repositories
- Organizational resources
Generalist repositories
- Organizational resources
Data Portals
- Low velocity data
- Employs cloud resources
- Employs data-compute proximity for analysis
Observation networks
- High velocity data
- Employs cloud resources
SAGE
NEON ARM
HPWREN
UWI
LTER, OOI
NEON
HydroShare
LTER
MGDS, IRIS
ICPSR
QDR
TAIR
MDF
IEDA
PDB
CCDC
DataVerse
Figshare
Dryad
Zenodo
IRs
RESEARCH DATA LANDSCAPE
Publisher’s
view of
landscape
(general
public
view as
well)
Optimization
for timeliness
of research
could
suggest
lower value
over time
Specialist repositories
- Organizational resources
Generalist repositories
- Organizational resources
Data Portals
- Low velocity data
- Employs cloud resources
- Employs data-compute proximity for analysis
Observation networks
- High velocity data
- Employs cloud resources
SAGE
NEON ARM
HPWREN
UWI
LTER, OOI
NEON
HydroShare
LTER
MGDS, IRIS
ICPSR
QDR
TAIR
MDF
IEDA
PDB
CCDC
DataVerse
Figshare
Dryad
Zenodo
IRs
Generalist–Aided
Deposit:
engages generalist
curators
Metadata:
generalist schema
Reuse potential:
moderate-low as
metadata is curated
but general
Scope:
discipline agnostic
scope
Discovery:
broad name
recognition
Specialist-DBMS
Deposit:
difficult so DB often
read-only
Metadata:
data dictionary + DB
schema
Reuse potential:
high potential as self
contained
Scope:
subdiscipline scope
Discovery:
known within
subdiscipline
Specialist–Aided
Deposit:
engages specialist
curators
Metadata:
specialized
schema
Reuse potential:
high due to
specialists
Scope:
discipline scope
Discovery:
known within
discipline
Specialist-Unaided
Deposit:
unaided deposit
Metadata:
specialized schema
Reuse potential:
moderate-high from
discipline focus of
metadata schema
Scope:
discipline scope
Discovery:
known within
discipline
Generalist-Unaided
Deposit:
unaided deposit
Metadata:
generalist schema
Reuse potential:
low as metadata is
minimal
Scope:
discipline agnostic
scope
Discovery:
broad name
recognition
i.e., institutional repositories
Generalist–Aided
Deposit:
engages generalist
curators
Metadata:
generalist schema
Reuse potential:
moderate-low as
metadata is curated
but general
Scope:
discipline agnostic
scope
Discovery:
broad name
recognition
Specialist-DBMS
Deposit:
difficult so DB often
read-only
Metadata:
data dictionary + DB
schema
Reuse potential:
high potential as self
contained
Scope:
subdiscipline scope
Discovery:
known within
subdiscipline
Specialist–Aided
Deposit:
engages specialist
curators
Metadata:
specialized
schema
Reuse potential:
high due to
specialists
Scope:
discipline scope
Discovery:
known within
discipline
Specialist-Unaided
Deposit:
unaided deposit
Metadata:
specialized schema
Reuse potential:
moderate-high from
discipline focus of
metadata schema
Scope:
discipline scope
Discovery:
known within
discipline
Generalist-Unaided
Deposit:
unaided deposit
Metadata:
generalist schema
Reuse potential:
low as metadata is
minimal
Scope:
discipline agnostic
scope
Discovery:
broad name
recognition
i.e., institutional repositories
Generalist–Aided
Deposit:
engages generalist
curators
Metadata:
generalist schema
Reuse potential:
moderate-low as
metadata is curated
but general
Scope:
discipline agnostic
scope
Discovery:
broad name
recognition
Specialist-DBMS
Deposit:
difficult so DB often
read-only
Metadata:
data dictionary + DB
schema
Reuse potential:
high potential as self
contained
Scope:
subdiscipline scope
Discovery:
known within
subdiscipline
Specialist–Aided
Deposit:
engages specialist
curators
Metadata:
specialized
schema
Reuse potential:
high due to
specialists
Scope:
discipline scope
Discovery:
known within
discipline
Specialist-Unaided
Deposit:
unaided deposit
Metadata:
specialized schema
Reuse potential:
moderate-high from
discipline focus of
metadata schema
Scope:
discipline scope
Discovery:
known within
discipline
Generalist-Unaided
Deposit:
unaided deposit
Metadata:
generalist schema
Reuse potential:
low as metadata is
minimal
Scope:
discipline agnostic
scope
Discovery:
broad name
recognition
i.e., institutional repositories
FEDERAL RESEARCH DATA SUMMARY
• Observation networks and data portals are a fixed part of the
landscape. They have a different role in open science than do
repositories
• Generalist repositories are easier to use than specialist
repositories
• Specialist repositories have higher reusability
• Generalist repositories have economies of scale
• If specialist repositories can leverage generalist repositories as
back ends it would reduce overall cost
OPEN SCIENCE ROLE IN AI
TRUSTWORTHINESS
“ON ARTIFICIAL
INTELLIGENCE, TRUST
IS A MUST, NOT A
NICE-TO-HAVE”
Margrethe Vestager, the European
Commission executive vice president
who oversees digital policy for the 27-
nation bloc
TRUST ó TRUSTWORTHINESS
TRUST
• An individual’s confidence in an
entity
• “I trust this web site”
TRUSTWORTHINESS
• An entity’s state of being
trustworthy or reliable
• An estimate of an object’s
worthiness to receive someone’s
trust
• Trustworthiness is difficult to
accurately quantify
Trustworthy AI and Open Science
Trustworthy AI and Open Science
INDIANA UNIVERSITY BLOOMINGTON
AI: Human-Machine Interaction
§ Fitness smartwatch, smart hearing aids
§ Co-bots, cyber-crews, digital twins
§ Integration of smart machines into human body in
form of computer-brain interfaces or cyborgs
AI: Autonomous and Semi-
Autonomous Actors
• Weapon systems
• Robots in deep sea and space
exploration
• Self driving cars
• Bots in financial trade
AI: Big Data / Big Compute
• Deep learning / Machine Learning /
Natural Language Processing
• Medical diagnosis, image recognition
Broad Categories
of AI
INDIANA UNIVERSITY BLOOMINGTON
AI: Human-Machine Interaction
§ Fitness smartwatch, smart hearing aids
§ Co-bots, cyber-crews, digital twins
§ Integration of smart machines into human body in
form of computer-brain interfaces or cyborgs
AI: Autonomous and Semi-
Autonomous Actors
• Weapon systems
• Robots in deep sea and space
exploration
• Self driving cars
• Bots in financial trade
AI: Big Data / Big Compute
• Deep learning / Machine Learning /
Natural Language Processing
• Medical diagnosis, image recognition
Broad Categories
of AI
Category with most
urgency in issues of
artificial moral agency
INDIANA UNIVERSITY BLOOMINGTON
AI: Human-Machine Interaction
§ Fitness smartwatch, smart hearing aids
§ Co-bots, cyber-crews, digital twins
§ Integration of smart machines into human body in
form of computer-brain interfaces or cyborgs
AI: Autonomous and Semi-
Autonomous Actors
• Weapon systems
• Robots in deep sea and space
exploration
• Self driving cars
• Bots in financial trade
AI: Big Data / Big Compute
• Deep learning / Machine Learning /
Natural Language Processing
• Medical diagnosis, image recognition
Broad Categories
of AI
Research needed in policy
and technical extensions
that lead to greater and
more measurable forms of
accountability
INDIANA UNIVERSITY BLOOMINGTON
INTERVENTION POINTS: ENHANCED
TRUSTWORTHINESS
Developer
ethics,
development
process norms
Societal influence:
public pressure,
legislation,
regulatory
oversight AI algorithmic
knowledge
exhibiting
higher levels
of
trustworthiness
Technological
manifestation:
verifiable claims,
explainability,
accountability
Trustworthy AI is AI that is designed, developed, and used in a
manner that is lawful, fair, unbiased, accurate, reliable,
effective, safe, secure, resilient, understandable, and with
processes in place to regularly monitor and evaluate the AI
system’s performance and outcomes
Lynne Parker, Deputy US Chief Technology Officer and Director of the National Artificial Intelligence Initiative Office
ML PROCESS
M. Veale et al., CHI 2018
Data
Training
data
Feature
extraction
Test data
Learning
algorithm
Trained
model
Predict
New
data
Explain-
ability
inquiries
dev ops
RESEARCH PRODUCTS
M. Veale et al., CHI 2018
Data
Training
data
Feature
extraction
Test data
Learning
algorithm
Trained
model
Predict
New
data
Explain-
ability
inquiries
dev ops
Open science contributes to trustworthy
AI (trusted products)
The research products of AI need to
include intermediate results and
explainability services
BETH PLALE
INDIANA UNIVERSITY
PLALE@INDIANA.EDU
TRUSTWORTHY

More Related Content

PDF
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
PPTX
PDF
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
PDF
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
PDF
Linking Open Government Data at Scale
PDF
McGeary Data Curation Network: Developing and Scaling
PPTX
Ziegler Open Data in Special Collections Libraries
PPTX
DataCite: the Perfect Complement to CrossRef
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Linking Open Government Data at Scale
McGeary Data Curation Network: Developing and Scaling
Ziegler Open Data in Special Collections Libraries
DataCite: the Perfect Complement to CrossRef

What's hot (20)

PPTX
Research Data Management
PPTX
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
PPT
Lifting the Lid on Linked Data
PPT
Information Extraction and Linked Data Cloud
PPT
CrossRef And The Pursuit Of Truthiness, STM Meeting, Frankfurt, Germany, Octo...
PDF
Organizational Identifiers - Crossref LIVE Hannover
PPT
LIBER Webinar: 23 Things About Research Data Management
PPTX
DataONE Education Module 10: Legal and Policy Issues
PDF
Connecting the dots: drug information and Linked Data
PDF
Pistoia alliance harmonizing fair data catalog approaches webinar
PPT
Fox-Keynote-Now and Now of Data Publishing-nfdp13
PPTX
Washington Linked Data Authority Service at University of Houston
PDF
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
PDF
Mendeley Data FAIR hackathon
PPTX
CrossRef at SciELO15 Conference 2013
PPTX
Experience from 10 months of University Linked Data
PPTX
The Dataverse Commons
PPTX
Working with data.open.ac.uk, the Linked Data Platform of the Open University
PPTX
The State of Linked Government Data
PPTX
Research Data Sharing: A Basic Framework
Research Data Management
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Lifting the Lid on Linked Data
Information Extraction and Linked Data Cloud
CrossRef And The Pursuit Of Truthiness, STM Meeting, Frankfurt, Germany, Octo...
Organizational Identifiers - Crossref LIVE Hannover
LIBER Webinar: 23 Things About Research Data Management
DataONE Education Module 10: Legal and Policy Issues
Connecting the dots: drug information and Linked Data
Pistoia alliance harmonizing fair data catalog approaches webinar
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Washington Linked Data Authority Service at University of Houston
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
Mendeley Data FAIR hackathon
CrossRef at SciELO15 Conference 2013
Experience from 10 months of University Linked Data
The Dataverse Commons
Working with data.open.ac.uk, the Linked Data Platform of the Open University
The State of Linked Government Data
Research Data Sharing: A Basic Framework
Ad

Similar to Trustworthy AI and Open Science (20)

PPTX
Repository Federation: Towards Data Interoperability
PPTX
Solving disciplines 20200207 v2
PPTX
Milano short 20190529 v1
PDF
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
PPTX
Machines are people too
PPTX
20211103 jim spohrer oecd ai_science_productivity_panel v5
PPTX
Hicss52 20190108 v3
PDF
Artificial Intelligence in Data Curation
PPTX
Building COVID-19 Museum as Open Science Project
 
PPTX
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
PPTX
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
PPTX
Spohrer SIRs 20230511 v16.pptx
PPTX
20220228 uc merced maglio_class v14
PPTX
Milano short 20190530 v2
PPTX
2021020 jim spohrer ai for_good_conference future_of_ai v4
PDF
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
PPTX
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
PDF
Carpenter "The Future of the Scholarly Record"
Repository Federation: Towards Data Interoperability
Solving disciplines 20200207 v2
Milano short 20190529 v1
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Machines are people too
20211103 jim spohrer oecd ai_science_productivity_panel v5
Hicss52 20190108 v3
Artificial Intelligence in Data Curation
Building COVID-19 Museum as Open Science Project
 
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Spohrer SIRs 20230511 v16.pptx
20220228 uc merced maglio_class v14
Milano short 20190530 v2
2021020 jim spohrer ai for_good_conference future_of_ai v4
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
Carpenter "The Future of the Scholarly Record"
Ad

More from Beth Plale (11)

PDF
Open science as roadmap to better data science research
PDF
Capsule Computing: Safe Open Science
PDF
Towards FAIR Open Science with PID Kernel Information: RPID Testbed
PDF
HathiTrust Research Center Secure Commons
PDF
Trust threads : Active Curation and Publishing in SEAD
PDF
Trust threads: Provenance for Data Reuse in Long Tail Science
PDF
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
PDF
Plale HathiTrust El Colegio de Mexico May2014
PDF
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
PDF
Big data and open access: a collision course for science
PPTX
HathiTrust Reserach Center Nov2013
Open science as roadmap to better data science research
Capsule Computing: Safe Open Science
Towards FAIR Open Science with PID Kernel Information: RPID Testbed
HathiTrust Research Center Secure Commons
Trust threads : Active Curation and Publishing in SEAD
Trust threads: Provenance for Data Reuse in Long Tail Science
Case Study Big Data: Socio-Technical Issues of HathiTrust Digital Texts
Plale HathiTrust El Colegio de Mexico May2014
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Big data and open access: a collision course for science
HathiTrust Reserach Center Nov2013

Recently uploaded (20)

PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Computer network topology notes for revision
PDF
Introduction to Business Data Analytics.
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Database Infoormation System (DBIS).pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
Foundation of Data Science unit number two notes
PDF
Lecture1 pattern recognition............
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
Business Acumen Training GuidePresentation.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
IB Computer Science - Internal Assessment.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Computer network topology notes for revision
Introduction to Business Data Analytics.
Moving the Public Sector (Government) to a Digital Adoption
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Database Infoormation System (DBIS).pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Foundation of Data Science unit number two notes
Lecture1 pattern recognition............
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx

Trustworthy AI and Open Science

  • 1. TRUSTWORTHY AI AND OPEN SCIENCE Beth Plale Michael A and Laurie Burns McRobbie Professor of Computer Engineering Beilstein Open Science symposium October 06, 2021 Luddy School of Informatics, Computing, and Engineering Data To Insight Center
  • 2. Observations influenced by my role (2017-2020) in the National Science Foundation working on agency policies and practice in open science. Views expressed are entirely my own. Funding agency perspective on open science: how do we bring visibility to the products of research (that we fund)
  • 3. NSF funds the collection and capture of research data through projects ranging from a few hundred thousand dollars to tens of millions of dollars. The data are maintained in a landscape of solutions to meet the needs of researchers.
  • 4. Specialist repositories - Organizational resources Generalist repositories - Organizational resources Data Portals - Low velocity data - Employs cloud resources - Employs data-compute proximity for analysis Observation networks - High velocity data - Employs cloud resources RESEARCH DATA LANDSCAPE SAGE NEON ARM HPWREN UWI LTER, OOI NEON HydroShare LTER MGDS, IRIS ICPSR QDR TAIR MDF IEDA PDB CCDC DataVerse Figshare Dryad Zenodo IRs Exemplar systems
  • 5. RESEARCH DATA LANDSCAPE Data timeliness need Researcher depth of expertise Expectation for level of curation Expectation of data longevity Specialist repositories - Organizational resources Generalist repositories - Organizational resources Data Portals - Low velocity data - Employs cloud resources - Employs data-compute proximity for analysis Observation networks - High velocity data - Employs cloud resources SAGE NEON ARM HPWREN UWI LTER, OOI NEON HydroShare LTER MGDS, IRIS ICPSR QDR TAIR MDF IEDA PDB CCDC DataVerse Figshare Dryad Zenodo IRs
  • 6. RESEARCH DATA LANDSCAPE Publisher’s view of landscape (general public view as well) Optimization for timeliness of research could suggest lower value over time Specialist repositories - Organizational resources Generalist repositories - Organizational resources Data Portals - Low velocity data - Employs cloud resources - Employs data-compute proximity for analysis Observation networks - High velocity data - Employs cloud resources SAGE NEON ARM HPWREN UWI LTER, OOI NEON HydroShare LTER MGDS, IRIS ICPSR QDR TAIR MDF IEDA PDB CCDC DataVerse Figshare Dryad Zenodo IRs
  • 7. Generalist–Aided Deposit: engages generalist curators Metadata: generalist schema Reuse potential: moderate-low as metadata is curated but general Scope: discipline agnostic scope Discovery: broad name recognition Specialist-DBMS Deposit: difficult so DB often read-only Metadata: data dictionary + DB schema Reuse potential: high potential as self contained Scope: subdiscipline scope Discovery: known within subdiscipline Specialist–Aided Deposit: engages specialist curators Metadata: specialized schema Reuse potential: high due to specialists Scope: discipline scope Discovery: known within discipline Specialist-Unaided Deposit: unaided deposit Metadata: specialized schema Reuse potential: moderate-high from discipline focus of metadata schema Scope: discipline scope Discovery: known within discipline Generalist-Unaided Deposit: unaided deposit Metadata: generalist schema Reuse potential: low as metadata is minimal Scope: discipline agnostic scope Discovery: broad name recognition i.e., institutional repositories
  • 8. Generalist–Aided Deposit: engages generalist curators Metadata: generalist schema Reuse potential: moderate-low as metadata is curated but general Scope: discipline agnostic scope Discovery: broad name recognition Specialist-DBMS Deposit: difficult so DB often read-only Metadata: data dictionary + DB schema Reuse potential: high potential as self contained Scope: subdiscipline scope Discovery: known within subdiscipline Specialist–Aided Deposit: engages specialist curators Metadata: specialized schema Reuse potential: high due to specialists Scope: discipline scope Discovery: known within discipline Specialist-Unaided Deposit: unaided deposit Metadata: specialized schema Reuse potential: moderate-high from discipline focus of metadata schema Scope: discipline scope Discovery: known within discipline Generalist-Unaided Deposit: unaided deposit Metadata: generalist schema Reuse potential: low as metadata is minimal Scope: discipline agnostic scope Discovery: broad name recognition i.e., institutional repositories
  • 9. Generalist–Aided Deposit: engages generalist curators Metadata: generalist schema Reuse potential: moderate-low as metadata is curated but general Scope: discipline agnostic scope Discovery: broad name recognition Specialist-DBMS Deposit: difficult so DB often read-only Metadata: data dictionary + DB schema Reuse potential: high potential as self contained Scope: subdiscipline scope Discovery: known within subdiscipline Specialist–Aided Deposit: engages specialist curators Metadata: specialized schema Reuse potential: high due to specialists Scope: discipline scope Discovery: known within discipline Specialist-Unaided Deposit: unaided deposit Metadata: specialized schema Reuse potential: moderate-high from discipline focus of metadata schema Scope: discipline scope Discovery: known within discipline Generalist-Unaided Deposit: unaided deposit Metadata: generalist schema Reuse potential: low as metadata is minimal Scope: discipline agnostic scope Discovery: broad name recognition i.e., institutional repositories
  • 10. FEDERAL RESEARCH DATA SUMMARY • Observation networks and data portals are a fixed part of the landscape. They have a different role in open science than do repositories • Generalist repositories are easier to use than specialist repositories • Specialist repositories have higher reusability • Generalist repositories have economies of scale • If specialist repositories can leverage generalist repositories as back ends it would reduce overall cost
  • 11. OPEN SCIENCE ROLE IN AI TRUSTWORTHINESS
  • 12. “ON ARTIFICIAL INTELLIGENCE, TRUST IS A MUST, NOT A NICE-TO-HAVE” Margrethe Vestager, the European Commission executive vice president who oversees digital policy for the 27- nation bloc
  • 13. TRUST ó TRUSTWORTHINESS TRUST • An individual’s confidence in an entity • “I trust this web site” TRUSTWORTHINESS • An entity’s state of being trustworthy or reliable • An estimate of an object’s worthiness to receive someone’s trust • Trustworthiness is difficult to accurately quantify
  • 16. INDIANA UNIVERSITY BLOOMINGTON AI: Human-Machine Interaction § Fitness smartwatch, smart hearing aids § Co-bots, cyber-crews, digital twins § Integration of smart machines into human body in form of computer-brain interfaces or cyborgs AI: Autonomous and Semi- Autonomous Actors • Weapon systems • Robots in deep sea and space exploration • Self driving cars • Bots in financial trade AI: Big Data / Big Compute • Deep learning / Machine Learning / Natural Language Processing • Medical diagnosis, image recognition Broad Categories of AI
  • 17. INDIANA UNIVERSITY BLOOMINGTON AI: Human-Machine Interaction § Fitness smartwatch, smart hearing aids § Co-bots, cyber-crews, digital twins § Integration of smart machines into human body in form of computer-brain interfaces or cyborgs AI: Autonomous and Semi- Autonomous Actors • Weapon systems • Robots in deep sea and space exploration • Self driving cars • Bots in financial trade AI: Big Data / Big Compute • Deep learning / Machine Learning / Natural Language Processing • Medical diagnosis, image recognition Broad Categories of AI Category with most urgency in issues of artificial moral agency
  • 18. INDIANA UNIVERSITY BLOOMINGTON AI: Human-Machine Interaction § Fitness smartwatch, smart hearing aids § Co-bots, cyber-crews, digital twins § Integration of smart machines into human body in form of computer-brain interfaces or cyborgs AI: Autonomous and Semi- Autonomous Actors • Weapon systems • Robots in deep sea and space exploration • Self driving cars • Bots in financial trade AI: Big Data / Big Compute • Deep learning / Machine Learning / Natural Language Processing • Medical diagnosis, image recognition Broad Categories of AI Research needed in policy and technical extensions that lead to greater and more measurable forms of accountability
  • 19. INDIANA UNIVERSITY BLOOMINGTON INTERVENTION POINTS: ENHANCED TRUSTWORTHINESS Developer ethics, development process norms Societal influence: public pressure, legislation, regulatory oversight AI algorithmic knowledge exhibiting higher levels of trustworthiness Technological manifestation: verifiable claims, explainability, accountability
  • 20. Trustworthy AI is AI that is designed, developed, and used in a manner that is lawful, fair, unbiased, accurate, reliable, effective, safe, secure, resilient, understandable, and with processes in place to regularly monitor and evaluate the AI system’s performance and outcomes Lynne Parker, Deputy US Chief Technology Officer and Director of the National Artificial Intelligence Initiative Office
  • 21. ML PROCESS M. Veale et al., CHI 2018 Data Training data Feature extraction Test data Learning algorithm Trained model Predict New data Explain- ability inquiries dev ops
  • 22. RESEARCH PRODUCTS M. Veale et al., CHI 2018 Data Training data Feature extraction Test data Learning algorithm Trained model Predict New data Explain- ability inquiries dev ops
  • 23. Open science contributes to trustworthy AI (trusted products) The research products of AI need to include intermediate results and explainability services