SlideShare a Scribd company logo
University of Virginia
School of Data Science
Dean Philip E. Bourne
peb6a@virginia.edu
SDS and NASA
• The state of play in academic data science
• UVA response
• School formation
• Mission
• Our data science framework
• Examples of research
• School capabilities
• Opportunities for NASA/SDS Collaboration
Increased Demand over the Past Five Years
74%
Artificial Intelligence specialists
Top industries hiring this talent: Computer software, internet,
information technology and services, higher education,
consumer electronics
37%
Data Scientist
Top industries hiring this talent: Information technology and
services, computer software, internet, financial services, higher
education
33%
Data Engineer
Top industries hiring this talent: Information technology and
services, internet, computer software, financial services,
hospital and healthcare
The Rising Demand for Data Scientists
*for graduates seeking employment
100% 100% 100% 98% 97%
UVA School of Data Science
Graduate Job Placement
2019 2018 2017 2016 2015
*
Roles
Machine Learning Engineer, Director of Data
Science, Deep Learning Research Scientist,
Senior Data Analyst, Data Science Developer,
Consultant, Product Data Analyst, Financial
Engineer, Engagement Manager & more
Industries
● Finance
● Government
● Healthcare & Medicine
● Professional Sports
● Commerce
● Media
● Higher Ed
● Technology
We Are Not Alone
But We Are Unique
A New School for a New Century
A School Without Walls
Mission
To be a national and international leader in responsible data science
emphasizing interdisciplinary collaboration which results in
furthering discovery, sharing knowledge, and societal benefit
One Representation of Data Science
The 4+1 Model
The model is based
on the core insight
that all definitions of
data science
assume a pipeline
and that this
pipeline forms a
parallel process
[From Raf Alvarado]
One Representation of Data Science
The 4+1 Model
• Value – assuring
societal benefit
• Design -
Communication of the
value of data
• Systems – the means
to communicate and
convey benefit
• Analytics – models
and methods
• Practice – where
everything happens
[From Raf Alvarado]
The 4+1 Model Interplay
[From Raf Alvarado]
• Value + Design = Openness,
responsibility
• Value + Analytics = Human
centered AI, algorithmic bias
• Value + Systems =
sustainability, access,
environmental impact
• Design + Analytics = literate
programming, visualization
• Design + Systems =
dashboards, engineering
design
• Analytics + Systems = ML
engineering
A New School for a New Century
Where we are Today
Foundation
● Residential & Online Masters in Data
Science
● Presidential Fellows
● Undergrad minor Spring 2021
● PhD programs submitted in the fall
● Hiring & recruiting leading faculty
● Research & community projects underway
● New building plan – occupy 2023
Use case – Data Integration
Researcher and Assistant Professor of
Medicine Dr. Thomas Hartka, also a current
online Masters in Data Science student, is
combining two disparate data sets—electronic
health records and DMV crash data—to save
lives after motor vehicle crashes.
“I enrolled in the MSDS program
to expand my research on
automotive safety. I have already
used techniques from classes in
my work. I hope to expand my
research to real-time analytics to
improve emergency room care.”
— Dr. Thomas Hartka, UVA
School of Medicine
Use Cases
Machine Learning
powered insights Don Brown
• Monitor data from host computers logs,
authentication attempts and network
traffic from multiple enterprises, and
subject this data to optimized ML
techniques capable of detecting
anomalies that signal an intrusion
• Develop deep neural network learning
methods that do not require enterprises
to send their data to a global repository
• Preserve Privacy
Project 2 - DODProject 1 - DARPA
• Exploit massive amounts of contextual
data and use other aspects of the
dynamic environment
• Leverage signal processing, data fusion,
visualization, human factors and cyber-
security
• Fusion processes developed in
the Predictive Technology
Laboratory took data from multiple
sources and combined them using
hierarchical models.
Use Case Presidential Fellowship with NASA
Environmental Data
Jake Malcomb and Linnea Saby
• Analyze a massive geospatial data
set collected over a two-year period
from the International Space
Station, and then parsed by an
“extreme machine learning” tool that
aims to mimic the human brain.
• Tree core samples provide temporal
information about long-term tree
growth and physiology
• ML taps geospatial data to
understand forest ecosystems
• NASA ECOSTRES and GEDI
provide the extraordinarily large
geospatial dataset
Furthering Discovery to Build a Better World
RESEARCH
Cybersecurity
Detecting broad-spectrum cyber
threats almost immediately after
they are launched through a $7.6
million Defense Advanced
Research Projects Agency
(DARPA) grant.
Environment
Using NASA data collected aboard the
International Space Station to examine
climate change in the Shenandoah
National Forest and beyond, and find
solutions
Health & Medicine
Securing high-performance computing
equipment and personnel to allow
collaboration across the university on brain
science research like Autism, Alzheimer’s,
mental health disorders, traumatic brain
injuries and more.
Business
Discovering what makes a job
interview successful for the
candidate and the recruiter, and
how to mitigate bias in the
recruiting process
Democracy
Investigating how terrorist groups recruit
women through propaganda and
examining risk and threat assessment for
extremist violence perpetrated by women.
Education
Helping economically disadvantaged,
underrepresented populations pursue
tailored educational workforce pathways
that have a higher probability of leading
them to success.
Applying Data Science Across Industries
“To tackle challenges in science and medicine.”
— Elizabeth Driskell, MSDS ‘20
“To inform public policy and government.”
— Bradley Katcher, MSDS ‘20
“I want to use data science to find a new way of
thinking.” — Alex Gromadzki, MSDS ‘21
“I want to use data science to solve complex business
problems.” — Ruslan Askerov, MSDS ‘21
“To address poverty and income inequality.”
— Arti Patel, MSDS ‘20
SDS Faculty Research
Data Science Faculty member or affiliated
faculty Website Research Interests
Nada Basit
https://engineering.virginia.edu/facul
ty/nada-basit
Machine Learning, Bioinformatics, Data Mining, Pattern
Recognition
Phil Bourne
https://engineering.virginia.edu/facul
ty/philip-e-bourne
Multiscale Modeling Using Data Science Techniques
Early Stage Drug Discovery and Drug Repurposing
Early Stage Drug Methods and Tools for Macromolecular
Don Brown
https://engineering.virginia.edu/facul
ty/donald-e-brown-phd
Data Fusion, Knowledge Discovery, and Simulation
Optimization
Sallie Keller
https://biocomplexity.virginia.edu/sal
lie-keller
social and decision informatics, statistical underpinnings of
data science, and data access and confidentiality.
Daniel Mietchen
https://tools.wmflabs.org/scholia/aut
hor/Q20895785
Computational Biology, Biodiversity integrating research
workflows with the World Wide Web through open
licensing, open standards, and open collaboration via
Rafael Avarado http://transducer.ontoligent.com/
Cultural Analytics and Machine Learning, Digital
Humanities, Text Analysis
Heman Shakeri https://www.hemanshakeri.com/
structure and function of interconnected networks, often
expressed via graphs that comprise a set of nodes and a
set of connections between them.
Jonathan Kropko
https://facultydirectory.virginia.edu/f
aculty/jk8sd
methods to examine historical data, to test theories of
voting in U.S. presidential elections, and to handle
nonresponse in surveys.
Michael Porter
https://engineering.virginia.edu/facul
ty/michael-d-porter
event prediction, pattern and anomaly detection, and data
linkage - applications for Criminology, Transportation,
Terrorism, Defense, Security, Forensics, Business
Mohammad Fallahi-Sichani new hire
designing and building new experimental and
computational tools to enable the analysis, interpretation
and rational modulation of multi-scale processes that
Jack Van Horn
https://scholar.google.com/citations?
user=i9bGqbgAAAAJ&hl=en Psychology and Data Science, Cognitive Neuroscience
Pete Alonzi https://github.com/alonzi
Vicente Ordonez
https://engineering.virginia.edu/facul
ty/vicente-ordonez-roman
Computer Vision, Natural Language Processing and
Machine Learning
Tim Clark
https://scholar.google.com/citations?
user=k-iwlCUAAAAJ&hl=en
next generation approaches for biomedical
communications and data integration, including
semantically integrated data repositories, claims and
Gerard Learmonth
https://www.researchgate.net/profil
e/Gerard_Learmonth
Generation and testing of pseudorandom number
generators; Abstract database design; Strategic
applications of information systems and technology
Hongning Wang http://www.cs.virginia.edu/~hw5x/
data mining, machine learning, and information retrieval,
with a special emphasis on computational user behavior
modelin
Stephen Adams
http://www.nsfcvdi.org/wordpress/c
vdi_personnel/steven-adams-ph-d/
Adaptive Decision Systems Lab at UVA and his research is
applied to several domains including activity recognition,
prognostics and health management for manufacturing
Aidong Zhang
https://engineering.virginia.edu/facul
ty/aidong-zhang ML, Data mining, bioinformatics
Jundong Li http://people.virginia.edu/~jl6qk/
Data Mining, Machine Learning, Social Computing, and
Deep Learning
Brian Wright
https://www.linkedin.com/in/brian-
wright-ph-d-90063027/
2020 Capstone Projects
Org sponsor Capstone project
Markel Corporation
Machine Learning Based Approaches to Predict Customer Churn for an Insurance
Company
UVA Health System
Analyzing the Composition of Diabetes Patients andImpact of Seasonal and Climate Trends
on Emergency Room Utilization in Central Virginia
Met Museum Exploring Themes and Bias in Art using MachineLearning Image Analysis
Raytheon Machine Learning for Real-Time Vehicle Detection in All-Electronic Tolling System
Capital One Evaluating and Improving Attrition Models for the Retail Banking Industry
Babylon Farms A Digital Green Thumb: Neural Networks toMonitor Hydroponic Plant Growth
S&P Global
An Exploration and Characterization of Financial Performance of Standard and Poor’s 500
Index Constituents Led By Female CEOs
UVA School of Medicine/McManusGeographic Access to HIV Care
Corning Natural Language Processing for Company Financial Communication Style
Westrock Enhancing Promotion Decisions using Classification and Network-based Methods
Capital One - dual degree Retailer’s Dilemma: Personalized Product Marketing to Maximize Revenue
LMI Document Retrieval Using Deep Learning
X Mode Social Applying Mobile Location Data to Improve Hurricane Evacuation Plans
Smart C-ville The Deployment of a LoRaWAN-Based IoT Air Quality Sensor Network for Public Good
Fortive Modeling Client Churn for Small Business-to-Business Firms
Politics Lessons Learned: A Case Study in Creating a Data Pipeline using Twitter’s API
School of Architecture Analyzing Pre-Trained Neural Network Behavior with Layer Activation Optimization
Biomedical Engineering Deep Learning of Protein Structural Classes: Any Evidence for an ‘Urfold’?
Clarabridge
A Comparative Study of the Performance of Unsupervised Text Segmentation Techniques
on Dialogue Transcripts
Growing the School
M.S. IN DATA SCIENCE
Residential & Online
2020
2020-2023
UNDERGRADUATE
COURSES
increase to 18
courses per AY
2021
PH.D. PROGRAM
2023
UNDERGRADUATE
MAJOR
Building occupied
Team Size (FTEs)
5
40
60
80
120
Exec. Ed.
Why Responsible Data Science?
• A defining feature
• A partnership between STEM,
social sciences and the
humanities
• Where UVA has strength
SDS and NASA
• Course or short course, including NASA content
• Funded and collaborative research
• Faculty, Capstone, Presidential Fellowship
• Armed Forces Admits – MSDS, PhD
• Cybersecurity Joint Hire – Faculty
• Diversity partnership
• Secure facility
QUESTIONS?
peb6a@virginia.edu
@pebourne

More Related Content

PPTX
The UVA School of Data Science
PPTX
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
PPTX
2015 Kno.e.sis Center Annual Review
PDF
Data_Science_Applications_&_Use_Cases.pdf
PPT
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
PPTX
Web and Complex Systems Lab @ Kno.e.sis
PPTX
Smart Data - How you and I will exploit Big Data for personalized digital hea...
PDF
Big data trends in 2020
The UVA School of Data Science
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
2015 Kno.e.sis Center Annual Review
Data_Science_Applications_&_Use_Cases.pdf
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Web and Complex Systems Lab @ Kno.e.sis
Smart Data - How you and I will exploit Big Data for personalized digital hea...
Big data trends in 2020

What's hot (20)

PPTX
What Can Happen when Genome Sciences Meets Data Sciences?
PPTX
What's up at Kno.e.sis?
PPTX
Kno.e.sis Approach to Impactful Research & Training for Exceptional Careers
PPTX
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
PDF
hariri2019.pdf
PPTX
Big Data and Artificial Intelligence
PDF
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
PPTX
Knowledge Will Propel Machine Understanding of Big Data
PPTX
Smart IoT for Connected Manufacturing
PPTX
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
PPTX
A Semantics-based Approach to Machine Perception
PDF
University of Illinois - Data Science
PPTX
A Blind Date With (Big) Data: Student Data in (Higher) Education
PPTX
Science Data, Responsibly
PDF
2017 sa tc_pi_meeting_-_poster final 2
PPTX
Matching Uses and Protections for Government Data Releases: Presentation at t...
PPTX
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
PPTX
Data, Responsibly: The Next Decade of Data Science
PPTX
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
PPTX
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
What Can Happen when Genome Sciences Meets Data Sciences?
What's up at Kno.e.sis?
Kno.e.sis Approach to Impactful Research & Training for Exceptional Careers
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
hariri2019.pdf
Big Data and Artificial Intelligence
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
Knowledge Will Propel Machine Understanding of Big Data
Smart IoT for Connected Manufacturing
MIT Program on Information Science Talk -- Julia Flanders on Jobs, Roles, Ski...
A Semantics-based Approach to Machine Perception
University of Illinois - Data Science
A Blind Date With (Big) Data: Student Data in (Higher) Education
Science Data, Responsibly
2017 sa tc_pi_meeting_-_poster final 2
Matching Uses and Protections for Government Data Releases: Presentation at t...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
Data, Responsibly: The Next Decade of Data Science
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
Ad

Similar to UVA School of Data Science (20)

PPTX
University of Virginia School of Data Science
PPTX
One View of Data Science
PPTX
The Analytics and Data Science Landscape
PPTX
What Data Science Will Mean to You - One Person's View
PPTX
Data Science and AI in Biomedicine: The World has Changed
PPTX
Introduction to Data Science and Analytics
PPTX
Data_Science_Applications_&_Use_Cases.pptx
PPTX
Future And Scope of Data Science Online Program.pptx
PPTX
Data_Science_Applications_&_Use_Cases.pptx
PDF
Luciano uvi hackfest.28.10.2020
PDF
AI for Marking Industry application for.pdf
PPTX
Data Science Meets Biomedicine, Does Anything Change
PPTX
Real-time applications of Data Science.pptx
PPT
Big Data ( Charactertics of 6vs of Big Data)
PPTX
Data Science and AI in Biomedicine: The World has Changed
PPT
BIG-DATAPPTFINAL.ppt
PDF
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
PPTX
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
PDF
DATAIA & TransAlgo
PPTX
Biomedical Data Science: We Are Not Alone
University of Virginia School of Data Science
One View of Data Science
The Analytics and Data Science Landscape
What Data Science Will Mean to You - One Person's View
Data Science and AI in Biomedicine: The World has Changed
Introduction to Data Science and Analytics
Data_Science_Applications_&_Use_Cases.pptx
Future And Scope of Data Science Online Program.pptx
Data_Science_Applications_&_Use_Cases.pptx
Luciano uvi hackfest.28.10.2020
AI for Marking Industry application for.pdf
Data Science Meets Biomedicine, Does Anything Change
Real-time applications of Data Science.pptx
Big Data ( Charactertics of 6vs of Big Data)
Data Science and AI in Biomedicine: The World has Changed
BIG-DATAPPTFINAL.ppt
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
DATAIA & TransAlgo
Biomedical Data Science: We Are Not Alone
Ad

More from Philip Bourne (20)

PPTX
Your Science Needs You - More Than Ever Before
PPTX
The Biological Data Sustainability Paradox: A Time to Think Differently
PPTX
AI in Medical Education A Meta View to Start a Conversation
PPTX
AI+ Now and Then How Did We Get Here And Where Are We Going
PPTX
Thoughts on Biological Data Sustainability
PPTX
What is FAIR Data and Who Needs It?
PPTX
Data Science Meets Drug Discovery
PPTX
BIMS7100-2023. Social Responsibility in Research
PPTX
AI from the Perspective of a School of Data Science
PPTX
Novo Nordisk 080522.pptx
PPTX
Towards a US Open research Commons (ORC)
PPTX
COVID and Precision Education
PPTX
Cancer Research Meets Data Science — What Can We Do Together?
PPTX
Data Science Meets Open Scholarship – What Comes Next?
PPTX
Data to Advance Sustainability
PPTX
Frontiers of Computing at the Cellular and Molecular Scales
PPTX
Social Responsibility in Research
PPTX
SWOT Analysis - What Does it Tell Us?
PPTX
The Most Important Ten Simple Rules
PPTX
Capstone Experience - SWOT Analysis
Your Science Needs You - More Than Ever Before
The Biological Data Sustainability Paradox: A Time to Think Differently
AI in Medical Education A Meta View to Start a Conversation
AI+ Now and Then How Did We Get Here And Where Are We Going
Thoughts on Biological Data Sustainability
What is FAIR Data and Who Needs It?
Data Science Meets Drug Discovery
BIMS7100-2023. Social Responsibility in Research
AI from the Perspective of a School of Data Science
Novo Nordisk 080522.pptx
Towards a US Open research Commons (ORC)
COVID and Precision Education
Cancer Research Meets Data Science — What Can We Do Together?
Data Science Meets Open Scholarship – What Comes Next?
Data to Advance Sustainability
Frontiers of Computing at the Cellular and Molecular Scales
Social Responsibility in Research
SWOT Analysis - What Does it Tell Us?
The Most Important Ten Simple Rules
Capstone Experience - SWOT Analysis

Recently uploaded (20)

PPTX
GDM (1) (1).pptx small presentation for students
PDF
Insiders guide to clinical Medicine.pdf
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Institutional Correction lecture only . . .
PPTX
Lesson notes of climatology university.
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Complications of Minimal Access Surgery at WLH
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
Computing-Curriculum for Schools in Ghana
GDM (1) (1).pptx small presentation for students
Insiders guide to clinical Medicine.pdf
Abdominal Access Techniques with Prof. Dr. R K Mishra
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPH.pptx obstetrics and gynecology in nursing
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Institutional Correction lecture only . . .
Lesson notes of climatology university.
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Microbial disease of the cardiovascular and lymphatic systems
TR - Agricultural Crops Production NC III.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Complications of Minimal Access Surgery at WLH
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
O5-L3 Freight Transport Ops (International) V1.pdf
Cell Structure & Organelles in detailed.
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Computing-Curriculum for Schools in Ghana

UVA School of Data Science

  • 1. University of Virginia School of Data Science Dean Philip E. Bourne peb6a@virginia.edu
  • 2. SDS and NASA • The state of play in academic data science • UVA response • School formation • Mission • Our data science framework • Examples of research • School capabilities • Opportunities for NASA/SDS Collaboration
  • 3. Increased Demand over the Past Five Years 74% Artificial Intelligence specialists Top industries hiring this talent: Computer software, internet, information technology and services, higher education, consumer electronics 37% Data Scientist Top industries hiring this talent: Information technology and services, computer software, internet, financial services, higher education 33% Data Engineer Top industries hiring this talent: Information technology and services, internet, computer software, financial services, hospital and healthcare
  • 4. The Rising Demand for Data Scientists *for graduates seeking employment 100% 100% 100% 98% 97% UVA School of Data Science Graduate Job Placement 2019 2018 2017 2016 2015 * Roles Machine Learning Engineer, Director of Data Science, Deep Learning Research Scientist, Senior Data Analyst, Data Science Developer, Consultant, Product Data Analyst, Financial Engineer, Engagement Manager & more Industries ● Finance ● Government ● Healthcare & Medicine ● Professional Sports ● Commerce ● Media ● Higher Ed ● Technology
  • 5. We Are Not Alone But We Are Unique
  • 6. A New School for a New Century A School Without Walls Mission To be a national and international leader in responsible data science emphasizing interdisciplinary collaboration which results in furthering discovery, sharing knowledge, and societal benefit
  • 7. One Representation of Data Science The 4+1 Model The model is based on the core insight that all definitions of data science assume a pipeline and that this pipeline forms a parallel process [From Raf Alvarado]
  • 8. One Representation of Data Science The 4+1 Model • Value – assuring societal benefit • Design - Communication of the value of data • Systems – the means to communicate and convey benefit • Analytics – models and methods • Practice – where everything happens [From Raf Alvarado]
  • 9. The 4+1 Model Interplay [From Raf Alvarado] • Value + Design = Openness, responsibility • Value + Analytics = Human centered AI, algorithmic bias • Value + Systems = sustainability, access, environmental impact • Design + Analytics = literate programming, visualization • Design + Systems = dashboards, engineering design • Analytics + Systems = ML engineering
  • 10. A New School for a New Century Where we are Today Foundation ● Residential & Online Masters in Data Science ● Presidential Fellows ● Undergrad minor Spring 2021 ● PhD programs submitted in the fall ● Hiring & recruiting leading faculty ● Research & community projects underway ● New building plan – occupy 2023
  • 11. Use case – Data Integration Researcher and Assistant Professor of Medicine Dr. Thomas Hartka, also a current online Masters in Data Science student, is combining two disparate data sets—electronic health records and DMV crash data—to save lives after motor vehicle crashes. “I enrolled in the MSDS program to expand my research on automotive safety. I have already used techniques from classes in my work. I hope to expand my research to real-time analytics to improve emergency room care.” — Dr. Thomas Hartka, UVA School of Medicine
  • 12. Use Cases Machine Learning powered insights Don Brown • Monitor data from host computers logs, authentication attempts and network traffic from multiple enterprises, and subject this data to optimized ML techniques capable of detecting anomalies that signal an intrusion • Develop deep neural network learning methods that do not require enterprises to send their data to a global repository • Preserve Privacy Project 2 - DODProject 1 - DARPA • Exploit massive amounts of contextual data and use other aspects of the dynamic environment • Leverage signal processing, data fusion, visualization, human factors and cyber- security • Fusion processes developed in the Predictive Technology Laboratory took data from multiple sources and combined them using hierarchical models.
  • 13. Use Case Presidential Fellowship with NASA Environmental Data Jake Malcomb and Linnea Saby • Analyze a massive geospatial data set collected over a two-year period from the International Space Station, and then parsed by an “extreme machine learning” tool that aims to mimic the human brain. • Tree core samples provide temporal information about long-term tree growth and physiology • ML taps geospatial data to understand forest ecosystems • NASA ECOSTRES and GEDI provide the extraordinarily large geospatial dataset
  • 14. Furthering Discovery to Build a Better World RESEARCH Cybersecurity Detecting broad-spectrum cyber threats almost immediately after they are launched through a $7.6 million Defense Advanced Research Projects Agency (DARPA) grant. Environment Using NASA data collected aboard the International Space Station to examine climate change in the Shenandoah National Forest and beyond, and find solutions Health & Medicine Securing high-performance computing equipment and personnel to allow collaboration across the university on brain science research like Autism, Alzheimer’s, mental health disorders, traumatic brain injuries and more. Business Discovering what makes a job interview successful for the candidate and the recruiter, and how to mitigate bias in the recruiting process Democracy Investigating how terrorist groups recruit women through propaganda and examining risk and threat assessment for extremist violence perpetrated by women. Education Helping economically disadvantaged, underrepresented populations pursue tailored educational workforce pathways that have a higher probability of leading them to success.
  • 15. Applying Data Science Across Industries “To tackle challenges in science and medicine.” — Elizabeth Driskell, MSDS ‘20 “To inform public policy and government.” — Bradley Katcher, MSDS ‘20 “I want to use data science to find a new way of thinking.” — Alex Gromadzki, MSDS ‘21 “I want to use data science to solve complex business problems.” — Ruslan Askerov, MSDS ‘21 “To address poverty and income inequality.” — Arti Patel, MSDS ‘20
  • 16. SDS Faculty Research Data Science Faculty member or affiliated faculty Website Research Interests Nada Basit https://engineering.virginia.edu/facul ty/nada-basit Machine Learning, Bioinformatics, Data Mining, Pattern Recognition Phil Bourne https://engineering.virginia.edu/facul ty/philip-e-bourne Multiscale Modeling Using Data Science Techniques Early Stage Drug Discovery and Drug Repurposing Early Stage Drug Methods and Tools for Macromolecular Don Brown https://engineering.virginia.edu/facul ty/donald-e-brown-phd Data Fusion, Knowledge Discovery, and Simulation Optimization Sallie Keller https://biocomplexity.virginia.edu/sal lie-keller social and decision informatics, statistical underpinnings of data science, and data access and confidentiality. Daniel Mietchen https://tools.wmflabs.org/scholia/aut hor/Q20895785 Computational Biology, Biodiversity integrating research workflows with the World Wide Web through open licensing, open standards, and open collaboration via Rafael Avarado http://transducer.ontoligent.com/ Cultural Analytics and Machine Learning, Digital Humanities, Text Analysis Heman Shakeri https://www.hemanshakeri.com/ structure and function of interconnected networks, often expressed via graphs that comprise a set of nodes and a set of connections between them. Jonathan Kropko https://facultydirectory.virginia.edu/f aculty/jk8sd methods to examine historical data, to test theories of voting in U.S. presidential elections, and to handle nonresponse in surveys. Michael Porter https://engineering.virginia.edu/facul ty/michael-d-porter event prediction, pattern and anomaly detection, and data linkage - applications for Criminology, Transportation, Terrorism, Defense, Security, Forensics, Business Mohammad Fallahi-Sichani new hire designing and building new experimental and computational tools to enable the analysis, interpretation and rational modulation of multi-scale processes that Jack Van Horn https://scholar.google.com/citations? user=i9bGqbgAAAAJ&hl=en Psychology and Data Science, Cognitive Neuroscience Pete Alonzi https://github.com/alonzi Vicente Ordonez https://engineering.virginia.edu/facul ty/vicente-ordonez-roman Computer Vision, Natural Language Processing and Machine Learning Tim Clark https://scholar.google.com/citations? user=k-iwlCUAAAAJ&hl=en next generation approaches for biomedical communications and data integration, including semantically integrated data repositories, claims and Gerard Learmonth https://www.researchgate.net/profil e/Gerard_Learmonth Generation and testing of pseudorandom number generators; Abstract database design; Strategic applications of information systems and technology Hongning Wang http://www.cs.virginia.edu/~hw5x/ data mining, machine learning, and information retrieval, with a special emphasis on computational user behavior modelin Stephen Adams http://www.nsfcvdi.org/wordpress/c vdi_personnel/steven-adams-ph-d/ Adaptive Decision Systems Lab at UVA and his research is applied to several domains including activity recognition, prognostics and health management for manufacturing Aidong Zhang https://engineering.virginia.edu/facul ty/aidong-zhang ML, Data mining, bioinformatics Jundong Li http://people.virginia.edu/~jl6qk/ Data Mining, Machine Learning, Social Computing, and Deep Learning Brian Wright https://www.linkedin.com/in/brian- wright-ph-d-90063027/
  • 17. 2020 Capstone Projects Org sponsor Capstone project Markel Corporation Machine Learning Based Approaches to Predict Customer Churn for an Insurance Company UVA Health System Analyzing the Composition of Diabetes Patients andImpact of Seasonal and Climate Trends on Emergency Room Utilization in Central Virginia Met Museum Exploring Themes and Bias in Art using MachineLearning Image Analysis Raytheon Machine Learning for Real-Time Vehicle Detection in All-Electronic Tolling System Capital One Evaluating and Improving Attrition Models for the Retail Banking Industry Babylon Farms A Digital Green Thumb: Neural Networks toMonitor Hydroponic Plant Growth S&P Global An Exploration and Characterization of Financial Performance of Standard and Poor’s 500 Index Constituents Led By Female CEOs UVA School of Medicine/McManusGeographic Access to HIV Care Corning Natural Language Processing for Company Financial Communication Style Westrock Enhancing Promotion Decisions using Classification and Network-based Methods Capital One - dual degree Retailer’s Dilemma: Personalized Product Marketing to Maximize Revenue LMI Document Retrieval Using Deep Learning X Mode Social Applying Mobile Location Data to Improve Hurricane Evacuation Plans Smart C-ville The Deployment of a LoRaWAN-Based IoT Air Quality Sensor Network for Public Good Fortive Modeling Client Churn for Small Business-to-Business Firms Politics Lessons Learned: A Case Study in Creating a Data Pipeline using Twitter’s API School of Architecture Analyzing Pre-Trained Neural Network Behavior with Layer Activation Optimization Biomedical Engineering Deep Learning of Protein Structural Classes: Any Evidence for an ‘Urfold’? Clarabridge A Comparative Study of the Performance of Unsupervised Text Segmentation Techniques on Dialogue Transcripts
  • 18. Growing the School M.S. IN DATA SCIENCE Residential & Online 2020 2020-2023 UNDERGRADUATE COURSES increase to 18 courses per AY 2021 PH.D. PROGRAM 2023 UNDERGRADUATE MAJOR Building occupied Team Size (FTEs) 5 40 60 80 120 Exec. Ed.
  • 19. Why Responsible Data Science? • A defining feature • A partnership between STEM, social sciences and the humanities • Where UVA has strength
  • 20. SDS and NASA • Course or short course, including NASA content • Funded and collaborative research • Faculty, Capstone, Presidential Fellowship • Armed Forces Admits – MSDS, PhD • Cybersecurity Joint Hire – Faculty • Diversity partnership • Secure facility