SlideShare a Scribd company logo
ß
Urban Data Science @ UW
2
“It’s a great time to be a data geek.”
-- Roger Barga, Microsoft Research
“The greatest minds of my generation are trying
to figure out how to make people click on ads”
-- Jeff Hammerbacher, co-founder, Cloudera
The Fourth Paradigm
1. Empirical + experimental
2. Theoretical
3. Computational
4. Data-Intensive
Jim Gray
7/13/2015 Bill Howe, UW 3
“All across our campus, the process of discovery will increasingly rely on
researchers’ ability to extract knowledge from vast amounts of data… In order
to remain at the forefront, UW must be a leader in advancing these
techniques and technologies, and in making [them] accessible to researchers
in the broadest imaginable range of fields.”
2005-2008
In other words:
• Data-driven discovery will be ubiquitous
• UW must be a leader in inventing the
capabilities
• UW must be a leader in translational
activities – in putting these capabilities to
work
• It’s about intellectual infrastructure (human capital) and software
infrastructure (shared tools and services – digital capital)
A 5-year, US$37.8 million cross-institutional
collaboration to create a data science environment
5
2014
7/13/2015 Bill Howe, UW 7
Data Science Kickoff Session:
137 posters from 30+ departments and units
8
PIs on Moore/Sloan effort
+ eScience Institute
Steering Committee
+ UW participants in
February 7 Data Science
poster session
Broad collaborations
Establish a virtuous cycle
• 6 working groups, each with
• 3-6 faculty from each institution
10
Assessing Community Well-Being
Third-Place Technologies
Optimization of King County Metro Paratransit
Computer Science & Engineering
Predictors of Permanent Housing for Homeless Families
Bill and Melinda Gates Foundation
Open Sidewalk Graph for Accessible Trip Planning
Electrical Engineering
11
1. Form a City/University collaboration within their respective
community memorialized in a Memorandum of
Understanding;
2. Appoint a representative from each partner responsible for
maintaining the collaboration;
3. Through the collaboration, identify and undertake at least
three research, development and deployment projects
within the coming year (by May 2016);
4. Participate as a founding member of the Metro Lab
Network through workshops and other knowledge sharing
activities (see Metro Lab Network SUMMARY).
Seattle crime map using open data, UW EE ugrad
Jay Feng
13
14
Charlie Catlett
OneBusAway:
Transit Traveler Information
Systems
Alan Borning
Dept of Computer Science and
Engineering
University of Washington
Design Use Build – University of
Washington
University of Washington
University of Washington
Usage
 Started as a grad student project by Brian
Ferris and Kari Watkins; became their PhD
dissertations
 Over 100,000 unique weekly users in Puget
Sound
 Deployments in Atlanta, Tampa, versions in
New York and Detroit; experimental
deployment in Washington DC
 Goal: OneBusAway Foundation to provide
long-term stability and support
University of Washington
18

More Related Content

PPTX
Data Science and Urban Science @ UW
PPTX
Big Data Talent in Academic and Industry R&D
PPTX
Data Science, Data Curation, and Human-Data Interaction
PPTX
Big Data Curricula at the UW eScience Institute, JSM 2013
PPTX
Science Data, Responsibly
PPTX
Data, Responsibly: The Next Decade of Data Science
PPTX
The Other HPC: High Productivity Computing
PPTX
Intro to Data Science Concepts
Data Science and Urban Science @ UW
Big Data Talent in Academic and Industry R&D
Data Science, Data Curation, and Human-Data Interaction
Big Data Curricula at the UW eScience Institute, JSM 2013
Science Data, Responsibly
Data, Responsibly: The Next Decade of Data Science
The Other HPC: High Productivity Computing
Intro to Data Science Concepts

What's hot (20)

PPTX
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
PPTX
Democratizing Data Science in the Cloud
PPTX
Tragedy of the (Data) Commons
PPTX
Machines are people too
PDF
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
PPTX
Tragedy of the Data Commons (ODSC-East, 2021)
PPTX
Knowledge Graph Semantics/Interoperability
PPTX
Thoughts on Knowledge Graphs & Deeper Provenance
PPTX
The Roots: Linked data and the foundations of successful Agriculture Data
PPTX
A Blind Date With (Big) Data: Student Data in (Higher) Education
PPTX
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
PPTX
The Challenge of Deeper Knowledge Graphs for Science
PPTX
Thinking About the Making of Data
PPTX
The Future(s) of the World Wide Web
PPT
Broad Data (India 2015)
PDF
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
PPTX
What Can Happen when Genome Sciences Meets Data Sciences?
PDF
Data stories
PPT
The Semantic Web: It's for Real
PDF
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
Democratizing Data Science in the Cloud
Tragedy of the (Data) Commons
Machines are people too
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Tragedy of the Data Commons (ODSC-East, 2021)
Knowledge Graph Semantics/Interoperability
Thoughts on Knowledge Graphs & Deeper Provenance
The Roots: Linked data and the foundations of successful Agriculture Data
A Blind Date With (Big) Data: Student Data in (Higher) Education
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
The Challenge of Deeper Knowledge Graphs for Science
Thinking About the Making of Data
The Future(s) of the World Wide Web
Broad Data (India 2015)
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
What Can Happen when Genome Sciences Meets Data Sciences?
Data stories
The Semantic Web: It's for Real
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Ad

Similar to Urban Data Science at UW (20)

PPTX
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
PDF
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
PDF
New and Emerging Forms of Data
PPTX
Lauren Michael: The Missing Millions Democratizing Computation and Data ...
PPTX
Rdaeu russia_fg_1_july2014_final
PDF
Twist
PDF
Research Metadata Mechanics - Simon Porter
PDF
CSS-Intro-Lecture.pdf
PDF
African Open Science Platform: Pilot Phase
PDF
Panel: The Global Research Platform: An Overview
PPTX
Big data divided (24 march2014)
PDF
Researcher Reliance on Digital Libraries: A Descriptive Analysis
PDF
Download Complete The Data Journalism Handbook First Edition Liliana Bounegry...
PPTX
Data Science Meets Biomedicine, Does Anything Change
PPTX
Chapter 16
PPTX
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
PPTX
Ppt shark global forum session 3 2012 v4
PPTX
Web and Complex Systems Lab @ Kno.e.sis
PDF
2013 Melbourne Software Freedom Day talk - FOSS in Public Decision Making
PDF
The Data Journalism Handbook First Edition Liliana Bounegry
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
Crowdsourcing: A Geographic Approach to Identifying Policy Opportunities and ...
New and Emerging Forms of Data
Lauren Michael: The Missing Millions Democratizing Computation and Data ...
Rdaeu russia_fg_1_july2014_final
Twist
Research Metadata Mechanics - Simon Porter
CSS-Intro-Lecture.pdf
African Open Science Platform: Pilot Phase
Panel: The Global Research Platform: An Overview
Big data divided (24 march2014)
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Download Complete The Data Journalism Handbook First Edition Liliana Bounegry...
Data Science Meets Biomedicine, Does Anything Change
Chapter 16
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
Ppt shark global forum session 3 2012 v4
Web and Complex Systems Lab @ Kno.e.sis
2013 Melbourne Software Freedom Day talk - FOSS in Public Decision Making
The Data Journalism Handbook First Edition Liliana Bounegry
Ad

More from University of Washington (20)

PPTX
Database Agnostic Workload Management (CIDR 2019)
PPTX
Data Responsibly: The next decade of data science
PPTX
Thoughts on Big Data and more for the WA State Legislature
PPTX
The Other HPC: High Productivity Computing in Polystore Environments
PPTX
Big Data + Big Sim: Query Processing over Unstructured CFD Models
PPTX
Big Data Middleware: CIDR 2015 Gong Show Talk, David Maier, Bill Howe
PPTX
XLDB South America Keynote: eScience Institute and Myria
PPTX
Myria: Analytics-as-a-Service for (Data) Scientists
PPTX
eResearch New Zealand Keynote
PPTX
Data science curricula at UW
PPTX
Enabling Collaborative Research Data Management with SQLShare
PPTX
Virtual Appliances, Cloud Computing, and Reproducible Research
PPT
End-to-End eScience
PPT
HaLoop: Efficient Iterative Processing on Large-Scale Clusters
PPT
Query-Driven Visualization in the Cloud with MapReduce
PPT
Visual Data Analytics in the Cloud for Exploratory Science
PPT
A New Partnership for Cross-Scale, Cross-Domain eScience
PPT
Data-Intensive Scalable Science
PPT
Research Dataspaces: Pay-as-you-go Integration and Analysis
PPT
SQL is Dead; Long Live SQL: Lightweight Query Services for Long Tail Science
Database Agnostic Workload Management (CIDR 2019)
Data Responsibly: The next decade of data science
Thoughts on Big Data and more for the WA State Legislature
The Other HPC: High Productivity Computing in Polystore Environments
Big Data + Big Sim: Query Processing over Unstructured CFD Models
Big Data Middleware: CIDR 2015 Gong Show Talk, David Maier, Bill Howe
XLDB South America Keynote: eScience Institute and Myria
Myria: Analytics-as-a-Service for (Data) Scientists
eResearch New Zealand Keynote
Data science curricula at UW
Enabling Collaborative Research Data Management with SQLShare
Virtual Appliances, Cloud Computing, and Reproducible Research
End-to-End eScience
HaLoop: Efficient Iterative Processing on Large-Scale Clusters
Query-Driven Visualization in the Cloud with MapReduce
Visual Data Analytics in the Cloud for Exploratory Science
A New Partnership for Cross-Scale, Cross-Domain eScience
Data-Intensive Scalable Science
Research Dataspaces: Pay-as-you-go Integration and Analysis
SQL is Dead; Long Live SQL: Lightweight Query Services for Long Tail Science

Recently uploaded (20)

PPTX
C1 cut-Methane and it's Derivatives.pptx
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Microbiology with diagram medical studies .pptx
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
PPTX
Fluid dynamics vivavoce presentation of prakash
PPTX
Science Quipper for lesson in grade 8 Matatag Curriculum
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
The scientific heritage No 166 (166) (2025)
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PPTX
Overview of calcium in human muscles.pptx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
BIOMOLECULES PPT........................
PDF
Sciences of Europe No 170 (2025)
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
. Radiology Case Scenariosssssssssssssss
C1 cut-Methane and it's Derivatives.pptx
ECG_Course_Presentation د.محمد صقران ppt
Microbiology with diagram medical studies .pptx
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
Fluid dynamics vivavoce presentation of prakash
Science Quipper for lesson in grade 8 Matatag Curriculum
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Introduction to Cardiovascular system_structure and functions-1
The scientific heritage No 166 (166) (2025)
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
Overview of calcium in human muscles.pptx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
BIOMOLECULES PPT........................
Sciences of Europe No 170 (2025)
lecture 2026 of Sjogren's syndrome l .pdf
POSITIONING IN OPERATION THEATRE ROOM.ppt
neck nodes and dissection types and lymph nodes levels
. Radiology Case Scenariosssssssssssssss

Urban Data Science at UW

  • 2. 2 “It’s a great time to be a data geek.” -- Roger Barga, Microsoft Research “The greatest minds of my generation are trying to figure out how to make people click on ads” -- Jeff Hammerbacher, co-founder, Cloudera
  • 3. The Fourth Paradigm 1. Empirical + experimental 2. Theoretical 3. Computational 4. Data-Intensive Jim Gray 7/13/2015 Bill Howe, UW 3
  • 4. “All across our campus, the process of discovery will increasingly rely on researchers’ ability to extract knowledge from vast amounts of data… In order to remain at the forefront, UW must be a leader in advancing these techniques and technologies, and in making [them] accessible to researchers in the broadest imaginable range of fields.” 2005-2008 In other words: • Data-driven discovery will be ubiquitous • UW must be a leader in inventing the capabilities • UW must be a leader in translational activities – in putting these capabilities to work • It’s about intellectual infrastructure (human capital) and software infrastructure (shared tools and services – digital capital)
  • 5. A 5-year, US$37.8 million cross-institutional collaboration to create a data science environment 5 2014
  • 6. 7/13/2015 Bill Howe, UW 7 Data Science Kickoff Session: 137 posters from 30+ departments and units
  • 7. 8 PIs on Moore/Sloan effort + eScience Institute Steering Committee + UW participants in February 7 Data Science poster session Broad collaborations
  • 8. Establish a virtuous cycle • 6 working groups, each with • 3-6 faculty from each institution
  • 9. 10 Assessing Community Well-Being Third-Place Technologies Optimization of King County Metro Paratransit Computer Science & Engineering Predictors of Permanent Housing for Homeless Families Bill and Melinda Gates Foundation Open Sidewalk Graph for Accessible Trip Planning Electrical Engineering
  • 10. 11 1. Form a City/University collaboration within their respective community memorialized in a Memorandum of Understanding; 2. Appoint a representative from each partner responsible for maintaining the collaboration; 3. Through the collaboration, identify and undertake at least three research, development and deployment projects within the coming year (by May 2016); 4. Participate as a founding member of the Metro Lab Network through workshops and other knowledge sharing activities (see Metro Lab Network SUMMARY).
  • 11. Seattle crime map using open data, UW EE ugrad Jay Feng
  • 12. 13
  • 14. OneBusAway: Transit Traveler Information Systems Alan Borning Dept of Computer Science and Engineering University of Washington Design Use Build – University of Washington
  • 16. University of Washington Usage  Started as a grad student project by Brian Ferris and Kari Watkins; became their PhD dissertations  Over 100,000 unique weekly users in Puget Sound  Deployments in Atlanta, Tampa, versions in New York and Detroit; experimental deployment in Washington DC  Goal: OneBusAway Foundation to provide long-term stability and support

Editor's Notes

  • #4: 3
  • #6: Institutional change rather than specific research projects
  • #7: Institutional change rather than specific research projects