SlideShare a Scribd company logo
Hacking for innovation to solve grand challenges
Dr. Sabine Brunswicker | Elizabeth Thompson | Matt Harris| Jia Lin Cheoh |
Satyam Mukherjee | other RCODI team members
July 27th, 2020
Research Center for
Open Digital Innovation (RCODI)
Covid-19 Data Science Challenges – Powered by IronHacks:
Info Session #3
Award Number: #1462044
Agenda
ā–Ŗ Background: Who we are
ā–Ŗ What is IronHacks - its past and its future!
ā–Ŗ The COVID-19 Data Science Challenges 2020
ā–Ŗ The COVID-19 Challenge: Summer 2020: Tasks, process, etc.
ā–Ŗ Q&A
Agenda
Connected
everything
Analytics/AI
/ML
Cloud
Open platforms for
collective innovation
Social
media
Mobile
Background
Digital technologies afford more distributed and open forms of organizing the design
and use of new products/services
RCODI aims to empower individuals to use digital technologies to solve grand
challenges
Background
We are group of researchers and practitioners with interdisciplinary skills
Dr. Sabine Brunswicker
Director, RCODI
Elizabeth Thompson
PhD Student
Researcher
Jia Lin Cheoh
PhD Student Researcher
Background
Get to know the core group of people behind IronHacks 2020
Matt Harris
Platform
Developer
Marlen Promann
PhD Student Researcher
Satyam Mukherjee
RedHat Fellow
Shatayu
Undergraduate Student
Researcher
About IronHacks:
What is it - The past and its future
In the past, we thought innovation is about special people walled off
behind the R&D walls of large companies...
A challenge
A digital innovation
Digital platforms offer new ways to give hands to individuals to tackle
difficult problems
Source: https://flic.kr/p/Wd54U
Ironhacks is a platform that allows people around the world
to use data to solve problems
AFFORDABLE
HOUSING
Build a website with
interactive
visualizations that
helps new students in
finding safe and
affordable housing
near their university.
HEALTHY
LIVING
Use open data and
develop an app that
helps citizens in
finding cheap and
seasonally fresh
vegetables from
local markets.
About IronHacks
The IronHacks tasks focus on societal challenges that matter to all of us
COVID-19
Build a statistical
model using social
movement,
distancing and
infection data to
predict COVID-19
impact.
About IronHacks
The IronHacks supports a new hacking experience -
it is not a traditional hack
Traditional ā€˜Hackathons’ The Purdue IronHacks
36 hrs of high energy; less creativity
~ 1 month: 5x high-energy hacking + time for
creativity
Proprietary code: no sharing with
others
Open access: shared and can build on others’
code and data models
Mentorship during the 36 hrs;
Feedback on final solution
5 iterations: hackers get valuable technical,
market, and user feedback from data science
experts; constant guidance
Constant physical presence and
interaction
Virtual presence and communication;
hackers can work on problems when they wish
INNOVATE WITH UNIQUE DATA: Use unique, real-world and actual datasets to
create novel, cool, and performative models and visualizations that tackle real
challenges
OUTPERFORM & LEARN FROM OTHERS: IronHacks is a competition, but you also
engage in community guided by support from our team
HACK VIRTUALLY AT SCALE: IronHacks offers a novel cloud-based environment that
allows you to perform data analysis at scale
GAIN & FAME IN MANY WAYS: Learn, improve your score, gain reputation and win
a prize.
About IronHacks
Multiphase: hack, breathe, learn, iterate, and win...
About IronHacks
In the past we focused on web-app development and mash-up designs using datahe
Key technologies/programming languages:
HTML, CSS, Javascript, D3.js
Key skills/performance focus
1) Web programming
2) Interactive mash-up design (map, charts,
user input)
3) Visualization quality & novelty
The IronHacks process is iterative; there are 3 to 5+ phases; at each phase
your progress will be evaluated
About IronHacks
*Note: This is a sample timeline
Registration
Cloud-based editor Forum
The IronHacks platform has a range of unique features to facilitate participants
during the hacking process...
About IronHacks
Dashboard
About IronHacks
In the platform, participants can utilize a no setup coding environment...
File structure
& templates
Editor to create source code
Hosted app (page view)
Easy submission & github integration
The platform is designed for scalability and integrated process supporting
automated scoring and expert rating
Scalability
Integrated process (no set up
environment)
User experience
Real-time feedback
Supports randomized field experiments
and trace data collection
About IronHacks
About IronHacks
Since 2015, we have been growing; we ran hacks in US, Colombia, and China
4
2015:
Pilot
2016:
Pilot
2016 2016 2017
22 26 68
200
Numbers represent participants per hack
2017 2018
Bubble represents the number of participants at
the IronHacks competition, globally.
108 130
2019
54
Empowering Citizens
Existing processes for open data contests/hacking primarily focus on triggering
competition and neglect the power of digital technologies
DIGITAL PLATFORMS
In our research we design
and examine digital
platforms with the goal to
increase the productivity of
programmers.
REAL WORLD
IMPACT:
Supporting programmers
to design models and
visualizations that
support real-world
decisionmaking.
CURRENT HACKING:
...neglect the constituting
role of digital platforms
that encourage collective
learning and augment
human creativity.
Brunswicker, S., Jensen, B., Song, Z., & Majchrzak, A. (2018). Transparency as design choice of open data
contests. Journal of the Association for Information Science and Technology, 69(10), 1205–1222.
https://doi.org/10.1002/asi.24033
Career
opportunities
IronHacks
Certificate
Prize
money
Class
Credit
About IronHacks
In the past, we were able to offer financial and non-financial incentives
Global
recognition
In the past, we worked with corporate and government sponsors; this allowed
individuals to gain reputation
About IronHacks
About IronHacks
Our past editor supported building webapps in the cloud
File structure
& templates
Editor to create source code
Hosted app (page view)
Easy submission & github integration
In the hack in spring 2019, the students faced a housing challenge: ā€œFind a safe and
affordable place to live in New York City (NYC)ā€
Imagine you are living in New York City (NYC) as a new student at NYU Stern School of Business and you
want help others that have no knowledge about the city to find an area to live that is safe, affordable
and close to the university.ā€
TASK: Develop a mashup that integrates, analyses and interactively displays open data in order to support new
students to evaluate and rank the 59 habitable districts in New York. Your mashup should allow the new students
to rank and compare the various districts using a diverse set of decision parameters. Your app should allow students to
find the top 10 districts in New York City based single parameters ranking and the top 3 districts based on average of
the three following parameters:
1. Safety
2. Distance to NYU Stern School of Business (Note: the distance must be from the center of the district to NYU Stern
School of Business).
3. Affordability (Note: affordability levels of a district are based on the maximum number of low income defined
by New York City).
ā€œFind me a safe and affordable district to live near the NYU Stern School of
Business, New York.ā€
About IronHacks
App 3
About IronHacks
About IronHacks
Today, the IronHacks platform offers a state of the art data science environment: Your
workspace will be a JupyterLab
Key technologies/programming languages:
Jupyter notebook, BigQuery, R, Python, SQL
Key skills/performance focus:
ā— Descriptive data analysis
ā— Data visualization for decision making
ā— Predictive modeling (Regressions,time
series, simulations)
JupyterLab 1.0: Jupyter’s Next-Generation Notebook Interface
ā— JupyterLab is a web-based interactive development
environment for Jupyter notebooks, code, and data
ā— JupyterLab support a wide range of workflows in data
science, scientific computing, and machine learning.
ā— JupyterLab is extensible and modular
About IronHacks
...so we are somewhat similar to Kaggle :-) ...
The COVID-19 Data Science Challenges
– Powered by IronHacks
The COVID-19 Hack - Powered by IronHacks
Predictions are essential during this COVID-19 pandemic; however, they might change
from week to week due to non-linear behavior
COVID-19 Challenges
Powered by IronHacks
Summer and Fall 2020
Sign-up today to predict the COVID-19
social and economic risks for citizens in
Indiana and other states of the US based
on actual data
The COVID-19 Hack – Powered by IronHacks
Registration opens
Tutorials and
Q&A start
Start of
competition
(min 60 ideally 100
committed
participants)
Submission 1 Submission 2 Submission 3 Submission 4
Score
release 1
Score
release 2
Score
release 3
The COVID-19 Hack – Powered by IronHacks
Both hacks will follow an iterative process
Award ceremony
In our hack, you will use a range of BIG data sets to predict visiting patterns at point of
interests/core places in different counties
Key datasets (subset of eligible ones)
1. Core Places Patterns
2. SafeGraph Places
Patterns
3. Social Distancing
Metrics
4. Transaction Data
1. Average
housing Price
per
Neighborhood
(county-level, historical)
(neighborhood level, yearly)
(POI level)
1. COVID-cases
2. COVID-deaths
(zip-code level,
daily updated)
The COVID-19 Hack – Powered by IronHacks
1. Unemployment claims
(zip-code level)
2. Skill-data
To make sure that you are not struggling with the data, we have
invested pro-processing and have created BigQuery datasets
In addition to providing you with access to pre-processed datasets, we will also
preinstall a large number of libraries
Libraries in R
1. BigRQuery
2. DBI
3. Tidyverse packages ( e.g. dplyr, tibble, stringr)
4. Lubridate
5. Data.table
6. Matrix
7. Zoo
8. Xts
9. Stats
10. Lmtest
11. Car
12. Sandwich
13. AER
14. DYNLM
15. ADLR
16. Forecast
17. Tidymodels
18. Lme4/lme- Linear and Non-linear mixed effects models
19. Caret
20. Ggvis
21. htmlwidgets
The COVID-19 Hack – Powered by IronHacks
1. Pandas
2. Numpy
3. Scipy
4. Scikit-Learn
5. Statsmodels
6. Matplotlib
7. Seaborn
8. Plotly
9. Bokeh
10. Pydot
11. ...
Libraries in Python
Your task will be about predictions using different pre-processed datasets; the task will
be revealed at the start of the hack
Example...
Develop and execute a Python or R code in a Jupyter notebook that predicts COVID-19
impact using the datasets available via BigQuery (e.g. spending data, mobility data):
1. Prediction using actual data: Build a model to predict the visitor count/foot traffic for
the coming week using a statistical model of your choice.
2. Visualization: Show your predictions so that end-users can understand the historic
and the future COVID-19 risk!
The COVID-19 Hack – Powered by IronHacks
You will submit your results in a csv file along with your Jupyter notebook and other
complementary information
The COVID-19 Hack – Powered by IronHacks
.ipynb
.ipynb
To score/rank the submission, we plan to judge the submissions in four categories; we
combine objective machine-readable metrics with human expert judging
Usefulness of
visualization
• Presents
predictions of top
crowded places in
a useful way
Performance Evaluation
Accuracy of
prediction
• Accuracy of
model in
predicting (Mean
Square Error)
Quality of modeling/
reproducibility
• Includes model
description/logic
• Reproducibility of
results
• Quality of code
The COVID-19 Hack – Powered by IronHacks
We will offer tutorials specific for the task after registration has opened; in addition, a
compilation of important online resources will help you to get ready for the hack
The COVID-19 Hack – Powered by IronHacks
Jupyter Lab
BigQuery &
Jupyter
Notebooks
SQL
Essentials
Tbd: Dates
& times
And lots of additional curated online links..
Top 3 Solutions
Improvement
Spirit
We will reward you in different categories, depending on your and your peers
performance
Brase, G. L. (2009). How different types of participant payments alter task performance. Judgment and
Decision Making, 4(5), 419.
In total, we plan to have 100 participants complete the hack; we plan for a prize budget of ~ 3000 USD
The COVID-19 Hack – Powered by IronHacks
0 to 25 percentile
25 to 50 percentile
50 to 75 percentile
75 to 100 percentile
Increasing
pay
Other Award
Categories
The COVID-19 Hack – Powered by IronHacks
There are a range of benefits
For Citizens:
A high-quality Covid-19
model based on real data and
key insights about metrics
like social distancing can help
keep families informed on
the best actions for them to
take for themselves and how
doing so can protect others!
For COVID-19 Research:
New insights into nature of COVID-19
from a social and economic
perspective
For Policymakers:
Policymakers can utilize IronHacks
results to help make critical
decisions on budgets, health
policy and other critical decisions
during challenging times.
The COVID-19 Hack – Powered by IronHacks
Here are some of our partners for the COVID-19: Data Science Challenges
The COVID-19 Hack – Powered by IronHacks
What should you do next?
Let’s do it
About RCODI:
We are suggesting a task that focuses on prediction of visitor counts in Indiana
ā€œWhen and where are the most crowded places in our counties?ā€
The scenario: Imagine you are part of a government task force to assess the COVID-19
risk emerging from ā€œcrowdingā€ at public places. Using data analysis/data science skills
you want to help the government to understand and predict the visitor account on a
weekly basis to support real-time decision-making.
Your tasks:
Develop and execute a Python or R code in a Jupyter notebook that presents the results
in numeric values and visualizations related to the following three subtasks:
1. Historic description: Where are the places with the greatest decrease/increase in
visitor count since March 2020 relative to a baseline of 2019?
2. Prediction for following week: Build a model to predict the visitor count/foot traffic
for the coming week using a statistical model of your choice.
3. Visualization: Show your predictions in a spatial-temporal way so that end-users can
understand the historic and the future COVID-19 risk!
The COVID-19 Hack - Powered by IronHacks
About Me and RCODI
I founded RCODI in 2014 as a university-wide center; today, it brings together about 20
researchers and fellows from different disciplines and countries
Advisory Board
Center Director
Core Faculty
Affiliated Faculty
Purdue
Affiliated Faculty in the
US, Europe, and Asia
Members (donations
and data transfer)
Academic, research
members
Performance metrics
• Intellectual contributions to
scholarship of digital
innovation (Publications)
• Research grant applications
and research expenditures
• Successful graduated
students in the area of
open digital innovation
• Undergraduate research
support/fellows
• International relationships
with academia and industry
• Endowments
• Real world impact and
advocacy
Connected
everything
Analytics/
Big Data
Cloud
Open platforms
for collective
innovation
Social
media
Mobile
About Me and RCODI
The phenomenon of open digital innovation motivates our research team
Open digital innovation describes open and collective forms of organizing the design
and use of new products, enabled through digital platforms/technologies.
About Me and RCODI
Scientifically we span the boundary of social sciences (including economics), data
science (focus network science) and information systems
Dr. Sabine
Brunswicker Director,
RCODI
Elizabeth Thompson
PhD Student
Researcher
Jia Lin Cheoh
PhD Student Researcher
IronHacks Team
Some of RCODI’s team members are involved in IronHacks 2020
Matt Harris
Platform
Developer
Marlen Promann
PhD Student Researcher
Satyam Mukherjee
RedHat Fellow
Shatayu
Undergraduate
Student Researcher

More Related Content

PPTX
Opening Up Innovation at NASA (NASA's Open Innovation Toolkit and Experience)
PPTX
Open Innovation: The Secret to Unlocking Cross Industry Innovation - Steve Ra...
PDF
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
PDF
First QuantUniversity Online Summit -2020
PDF
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
PDF
Understanding the New World of Cognitive Computing
PDF
Graph visualization options and latest developments
PDF
1. The Importance of Graphs in Government
Ā 
Opening Up Innovation at NASA (NASA's Open Innovation Toolkit and Experience)
Open Innovation: The Secret to Unlocking Cross Industry Innovation - Steve Ra...
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
First QuantUniversity Online Summit -2020
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Understanding the New World of Cognitive Computing
Graph visualization options and latest developments
1. The Importance of Graphs in Government
Ā 

What's hot (20)

PPTX
Citizen Science And a Manufacturing Revolution: Major trends research notes
PDF
Experiments With Knowledge Graphs in Fisheries & Oceans Canada
Ā 
PDF
BioIT World 2016 - HPC Trends from the Trenches
PDF
Applications of Machine Learning at USC
PDF
Translating the Human Analog to Digital with Graphs
Ā 
PDF
new_kitching_cv
PPTX
Top 5 Deep Learning and AI Stories - November 3, 2017
Ā 
PDF
The International Information Conference on Search, Data Mining and Visualiza...
PPT
The Archives Forum - The National Archives - 02 March 2011
PPTX
Ai open powermeetupmarch25th
Ā 
PPT
Querying Heterogeneous Datasets on the Linked Data Web
PDF
IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...
PDF
Big Data for Defense and Security
Ā 
PPT
Making sense of big data
PDF
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
PDF
Crowdsourcing Approaches to Big Data Curation for Earth Sciences
PPTX
National seminar on emergence of internet of things (io t) trends and challe...
PDF
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
PDF
The Big Data Value PPP: A Standardisation Opportunity for Europe
PDF
How it works- Data Science
Citizen Science And a Manufacturing Revolution: Major trends research notes
Experiments With Knowledge Graphs in Fisheries & Oceans Canada
Ā 
BioIT World 2016 - HPC Trends from the Trenches
Applications of Machine Learning at USC
Translating the Human Analog to Digital with Graphs
Ā 
new_kitching_cv
Top 5 Deep Learning and AI Stories - November 3, 2017
Ā 
The International Information Conference on Search, Data Mining and Visualiza...
The Archives Forum - The National Archives - 02 March 2011
Ai open powermeetupmarch25th
Ā 
Querying Heterogeneous Datasets on the Linked Data Web
IC-SDV 2019: The Economics of Artificial Intelligence and Machine Learning fo...
Big Data for Defense and Security
Ā 
Making sense of big data
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
Crowdsourcing Approaches to Big Data Curation for Earth Sciences
National seminar on emergence of internet of things (io t) trends and challe...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
The Big Data Value PPP: A Standardisation Opportunity for Europe
How it works- Data Science

Similar to IronHacks Live: Info session #3 - COVID-19 Data Science Challenge (20)

PDF
2018 23-03 purdue-unal kick-off-final-final
PDF
Building an Equitable Tech Future - By ThoughtWorks Brisbane
PPTX
Purdue unal iron hacks 2018 spring - award ceremony
PDF
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Ā 
PPTX
Sweden future of ai 20180921 v7
Ā 
PPTX
SEDAtalk February 2025: Impact and implications of GenerativeAI
Ā 
PPTX
SEDAtalk February 2025: Impact and implications of GenerativeAI
Ā 
PDF
Open Source Software Development by TLV Partners
PDF
Open source presentation
PPTX
20210128 jim spohrer ai house_fund v4
Ā 
PPTX
Open Source Insight: GitHub Finds 4M Flaws, IAST Magic Quadrant, 2018 Open So...
PPTX
Ai open powermeetupmarch25th
Ā 
PPTX
Ai open powermeetupmarch25th
Ā 
PDF
Clip Studio Paint EX Download (Latest 2025)
PDF
Final Cut Pro 11.1.0 for MacOS Free Download
PPTX
KUBRICK Graphs: A journey from in vogue to success-ion
Ā 
PDF
Deep Freeze Enterprise Crack FREE Download
PDF
Aiseesoft Video Converter Ultimate 10.9.6 Download
PDF
CCleaner Pro 6.33 Crack + Key Free Download 2025
PDF
ACS Summer Institute - Emerging Roles of Librarians - 14_0731
2018 23-03 purdue-unal kick-off-final-final
Building an Equitable Tech Future - By ThoughtWorks Brisbane
Purdue unal iron hacks 2018 spring - award ceremony
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Ā 
Sweden future of ai 20180921 v7
Ā 
SEDAtalk February 2025: Impact and implications of GenerativeAI
Ā 
SEDAtalk February 2025: Impact and implications of GenerativeAI
Ā 
Open Source Software Development by TLV Partners
Open source presentation
20210128 jim spohrer ai house_fund v4
Ā 
Open Source Insight: GitHub Finds 4M Flaws, IAST Magic Quadrant, 2018 Open So...
Ai open powermeetupmarch25th
Ā 
Ai open powermeetupmarch25th
Ā 
Clip Studio Paint EX Download (Latest 2025)
Final Cut Pro 11.1.0 for MacOS Free Download
KUBRICK Graphs: A journey from in vogue to success-ion
Ā 
Deep Freeze Enterprise Crack FREE Download
Aiseesoft Video Converter Ultimate 10.9.6 Download
CCleaner Pro 6.33 Crack + Key Free Download 2025
ACS Summer Institute - Emerging Roles of Librarians - 14_0731

More from Purdue RCODI (19)

PPTX
Berkeley-Teaching Engineers about OI_final.pptx
PPTX
Purdue unal iron hacks 2019 spring - award ceremony (1)
PPTX
FACT - A New Way to Get News
PDF
2018 purdue cupl IronHacks 2018_hack challenge_final
PDF
2018 purdue unal IronHacks 2018_hack challenge_final
PDF
Purdue IronHacks CUPL kickoff
PDF
2017 dec-1-purdue-unal gold iron hacks 2017 - award ceremony Final version
PDF
Purdue Gold IronHacks Fall 2017 Kick-off Presentation
PDF
October 9, 2017 Purdue-UNAL Kick-off Presentation
PDF
Open Data Hacking.Tech499 Class Info Session
PDF
August 29, 2017 Purdue Honors College Call-Out Presentation
PDF
Purdue UNAL Gold IronHacks May 2017 Award Ceremony
PDF
Purdue Gold IronHacks April 2017 Award Ceremony
PDF
Purdue IronHacks
PDF
Purdue IronHacks
PDF
Open Source and Open Innovation - Dr. Sabine Brunswicker - Red Hat Summit 2016
PPTX
Shaping the Future of Open Innovation
PDF
Purdue IronHacks
PPTX
The Purdue IronHacks
Berkeley-Teaching Engineers about OI_final.pptx
Purdue unal iron hacks 2019 spring - award ceremony (1)
FACT - A New Way to Get News
2018 purdue cupl IronHacks 2018_hack challenge_final
2018 purdue unal IronHacks 2018_hack challenge_final
Purdue IronHacks CUPL kickoff
2017 dec-1-purdue-unal gold iron hacks 2017 - award ceremony Final version
Purdue Gold IronHacks Fall 2017 Kick-off Presentation
October 9, 2017 Purdue-UNAL Kick-off Presentation
Open Data Hacking.Tech499 Class Info Session
August 29, 2017 Purdue Honors College Call-Out Presentation
Purdue UNAL Gold IronHacks May 2017 Award Ceremony
Purdue Gold IronHacks April 2017 Award Ceremony
Purdue IronHacks
Purdue IronHacks
Open Source and Open Innovation - Dr. Sabine Brunswicker - Red Hat Summit 2016
Shaping the Future of Open Innovation
Purdue IronHacks
The Purdue IronHacks

Recently uploaded (20)

PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Computer network topology notes for revision
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PDF
Foundation of Data Science unit number two notes
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Mega Projects Data Mega Projects Data
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Lecture1 pattern recognition............
PDF
Clinical guidelines as a resource for EBP(1).pdf
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Fluorescence-microscope_Botany_detailed content
Supervised vs unsupervised machine learning algorithms
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Moving the Public Sector (Government) to a Digital Adoption
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Computer network topology notes for revision
Business Acumen Training GuidePresentation.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Foundation of Data Science unit number two notes
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Mega Projects Data Mega Projects Data
Miokarditis (Inflamasi pada Otot Jantung)
STUDY DESIGN details- Lt Col Maksud (21).pptx
Reliability_Chapter_ presentation 1221.5784
Lecture1 pattern recognition............
Clinical guidelines as a resource for EBP(1).pdf

IronHacks Live: Info session #3 - COVID-19 Data Science Challenge

  • 1. Hacking for innovation to solve grand challenges Dr. Sabine Brunswicker | Elizabeth Thompson | Matt Harris| Jia Lin Cheoh | Satyam Mukherjee | other RCODI team members July 27th, 2020 Research Center for Open Digital Innovation (RCODI) Covid-19 Data Science Challenges – Powered by IronHacks: Info Session #3 Award Number: #1462044
  • 2. Agenda ā–Ŗ Background: Who we are ā–Ŗ What is IronHacks - its past and its future! ā–Ŗ The COVID-19 Data Science Challenges 2020 ā–Ŗ The COVID-19 Challenge: Summer 2020: Tasks, process, etc. ā–Ŗ Q&A Agenda
  • 3. Connected everything Analytics/AI /ML Cloud Open platforms for collective innovation Social media Mobile Background Digital technologies afford more distributed and open forms of organizing the design and use of new products/services RCODI aims to empower individuals to use digital technologies to solve grand challenges
  • 4. Background We are group of researchers and practitioners with interdisciplinary skills
  • 5. Dr. Sabine Brunswicker Director, RCODI Elizabeth Thompson PhD Student Researcher Jia Lin Cheoh PhD Student Researcher Background Get to know the core group of people behind IronHacks 2020 Matt Harris Platform Developer Marlen Promann PhD Student Researcher Satyam Mukherjee RedHat Fellow Shatayu Undergraduate Student Researcher
  • 6. About IronHacks: What is it - The past and its future
  • 7. In the past, we thought innovation is about special people walled off behind the R&D walls of large companies...
  • 8. A challenge A digital innovation Digital platforms offer new ways to give hands to individuals to tackle difficult problems
  • 9. Source: https://flic.kr/p/Wd54U Ironhacks is a platform that allows people around the world to use data to solve problems
  • 10. AFFORDABLE HOUSING Build a website with interactive visualizations that helps new students in finding safe and affordable housing near their university. HEALTHY LIVING Use open data and develop an app that helps citizens in finding cheap and seasonally fresh vegetables from local markets. About IronHacks The IronHacks tasks focus on societal challenges that matter to all of us COVID-19 Build a statistical model using social movement, distancing and infection data to predict COVID-19 impact.
  • 11. About IronHacks The IronHacks supports a new hacking experience - it is not a traditional hack Traditional ā€˜Hackathons’ The Purdue IronHacks 36 hrs of high energy; less creativity ~ 1 month: 5x high-energy hacking + time for creativity Proprietary code: no sharing with others Open access: shared and can build on others’ code and data models Mentorship during the 36 hrs; Feedback on final solution 5 iterations: hackers get valuable technical, market, and user feedback from data science experts; constant guidance Constant physical presence and interaction Virtual presence and communication; hackers can work on problems when they wish
  • 12. INNOVATE WITH UNIQUE DATA: Use unique, real-world and actual datasets to create novel, cool, and performative models and visualizations that tackle real challenges OUTPERFORM & LEARN FROM OTHERS: IronHacks is a competition, but you also engage in community guided by support from our team HACK VIRTUALLY AT SCALE: IronHacks offers a novel cloud-based environment that allows you to perform data analysis at scale GAIN & FAME IN MANY WAYS: Learn, improve your score, gain reputation and win a prize. About IronHacks Multiphase: hack, breathe, learn, iterate, and win...
  • 13. About IronHacks In the past we focused on web-app development and mash-up designs using datahe Key technologies/programming languages: HTML, CSS, Javascript, D3.js Key skills/performance focus 1) Web programming 2) Interactive mash-up design (map, charts, user input) 3) Visualization quality & novelty
  • 14. The IronHacks process is iterative; there are 3 to 5+ phases; at each phase your progress will be evaluated About IronHacks *Note: This is a sample timeline
  • 15. Registration Cloud-based editor Forum The IronHacks platform has a range of unique features to facilitate participants during the hacking process... About IronHacks Dashboard
  • 16. About IronHacks In the platform, participants can utilize a no setup coding environment... File structure & templates Editor to create source code Hosted app (page view) Easy submission & github integration
  • 17. The platform is designed for scalability and integrated process supporting automated scoring and expert rating Scalability Integrated process (no set up environment) User experience Real-time feedback Supports randomized field experiments and trace data collection About IronHacks
  • 18. About IronHacks Since 2015, we have been growing; we ran hacks in US, Colombia, and China 4 2015: Pilot 2016: Pilot 2016 2016 2017 22 26 68 200 Numbers represent participants per hack 2017 2018 Bubble represents the number of participants at the IronHacks competition, globally. 108 130 2019 54
  • 19. Empowering Citizens Existing processes for open data contests/hacking primarily focus on triggering competition and neglect the power of digital technologies DIGITAL PLATFORMS In our research we design and examine digital platforms with the goal to increase the productivity of programmers. REAL WORLD IMPACT: Supporting programmers to design models and visualizations that support real-world decisionmaking. CURRENT HACKING: ...neglect the constituting role of digital platforms that encourage collective learning and augment human creativity. Brunswicker, S., Jensen, B., Song, Z., & Majchrzak, A. (2018). Transparency as design choice of open data contests. Journal of the Association for Information Science and Technology, 69(10), 1205–1222. https://doi.org/10.1002/asi.24033
  • 20. Career opportunities IronHacks Certificate Prize money Class Credit About IronHacks In the past, we were able to offer financial and non-financial incentives Global recognition
  • 21. In the past, we worked with corporate and government sponsors; this allowed individuals to gain reputation About IronHacks
  • 22. About IronHacks Our past editor supported building webapps in the cloud File structure & templates Editor to create source code Hosted app (page view) Easy submission & github integration
  • 23. In the hack in spring 2019, the students faced a housing challenge: ā€œFind a safe and affordable place to live in New York City (NYC)ā€ Imagine you are living in New York City (NYC) as a new student at NYU Stern School of Business and you want help others that have no knowledge about the city to find an area to live that is safe, affordable and close to the university.ā€ TASK: Develop a mashup that integrates, analyses and interactively displays open data in order to support new students to evaluate and rank the 59 habitable districts in New York. Your mashup should allow the new students to rank and compare the various districts using a diverse set of decision parameters. Your app should allow students to find the top 10 districts in New York City based single parameters ranking and the top 3 districts based on average of the three following parameters: 1. Safety 2. Distance to NYU Stern School of Business (Note: the distance must be from the center of the district to NYU Stern School of Business). 3. Affordability (Note: affordability levels of a district are based on the maximum number of low income defined by New York City). ā€œFind me a safe and affordable district to live near the NYU Stern School of Business, New York.ā€ About IronHacks
  • 25. About IronHacks Today, the IronHacks platform offers a state of the art data science environment: Your workspace will be a JupyterLab Key technologies/programming languages: Jupyter notebook, BigQuery, R, Python, SQL Key skills/performance focus: ā— Descriptive data analysis ā— Data visualization for decision making ā— Predictive modeling (Regressions,time series, simulations) JupyterLab 1.0: Jupyter’s Next-Generation Notebook Interface ā— JupyterLab is a web-based interactive development environment for Jupyter notebooks, code, and data ā— JupyterLab support a wide range of workflows in data science, scientific computing, and machine learning. ā— JupyterLab is extensible and modular
  • 26. About IronHacks ...so we are somewhat similar to Kaggle :-) ...
  • 27. The COVID-19 Data Science Challenges – Powered by IronHacks
  • 28. The COVID-19 Hack - Powered by IronHacks Predictions are essential during this COVID-19 pandemic; however, they might change from week to week due to non-linear behavior
  • 29. COVID-19 Challenges Powered by IronHacks Summer and Fall 2020 Sign-up today to predict the COVID-19 social and economic risks for citizens in Indiana and other states of the US based on actual data The COVID-19 Hack – Powered by IronHacks
  • 30. Registration opens Tutorials and Q&A start Start of competition (min 60 ideally 100 committed participants) Submission 1 Submission 2 Submission 3 Submission 4 Score release 1 Score release 2 Score release 3 The COVID-19 Hack – Powered by IronHacks Both hacks will follow an iterative process Award ceremony
  • 31. In our hack, you will use a range of BIG data sets to predict visiting patterns at point of interests/core places in different counties Key datasets (subset of eligible ones) 1. Core Places Patterns 2. SafeGraph Places Patterns 3. Social Distancing Metrics 4. Transaction Data 1. Average housing Price per Neighborhood (county-level, historical) (neighborhood level, yearly) (POI level) 1. COVID-cases 2. COVID-deaths (zip-code level, daily updated) The COVID-19 Hack – Powered by IronHacks 1. Unemployment claims (zip-code level) 2. Skill-data To make sure that you are not struggling with the data, we have invested pro-processing and have created BigQuery datasets
  • 32. In addition to providing you with access to pre-processed datasets, we will also preinstall a large number of libraries Libraries in R 1. BigRQuery 2. DBI 3. Tidyverse packages ( e.g. dplyr, tibble, stringr) 4. Lubridate 5. Data.table 6. Matrix 7. Zoo 8. Xts 9. Stats 10. Lmtest 11. Car 12. Sandwich 13. AER 14. DYNLM 15. ADLR 16. Forecast 17. Tidymodels 18. Lme4/lme- Linear and Non-linear mixed effects models 19. Caret 20. Ggvis 21. htmlwidgets The COVID-19 Hack – Powered by IronHacks 1. Pandas 2. Numpy 3. Scipy 4. Scikit-Learn 5. Statsmodels 6. Matplotlib 7. Seaborn 8. Plotly 9. Bokeh 10. Pydot 11. ... Libraries in Python
  • 33. Your task will be about predictions using different pre-processed datasets; the task will be revealed at the start of the hack Example... Develop and execute a Python or R code in a Jupyter notebook that predicts COVID-19 impact using the datasets available via BigQuery (e.g. spending data, mobility data): 1. Prediction using actual data: Build a model to predict the visitor count/foot traffic for the coming week using a statistical model of your choice. 2. Visualization: Show your predictions so that end-users can understand the historic and the future COVID-19 risk! The COVID-19 Hack – Powered by IronHacks
  • 34. You will submit your results in a csv file along with your Jupyter notebook and other complementary information The COVID-19 Hack – Powered by IronHacks .ipynb .ipynb
  • 35. To score/rank the submission, we plan to judge the submissions in four categories; we combine objective machine-readable metrics with human expert judging Usefulness of visualization • Presents predictions of top crowded places in a useful way Performance Evaluation Accuracy of prediction • Accuracy of model in predicting (Mean Square Error) Quality of modeling/ reproducibility • Includes model description/logic • Reproducibility of results • Quality of code The COVID-19 Hack – Powered by IronHacks
  • 36. We will offer tutorials specific for the task after registration has opened; in addition, a compilation of important online resources will help you to get ready for the hack The COVID-19 Hack – Powered by IronHacks Jupyter Lab BigQuery & Jupyter Notebooks SQL Essentials Tbd: Dates & times And lots of additional curated online links..
  • 37. Top 3 Solutions Improvement Spirit We will reward you in different categories, depending on your and your peers performance Brase, G. L. (2009). How different types of participant payments alter task performance. Judgment and Decision Making, 4(5), 419. In total, we plan to have 100 participants complete the hack; we plan for a prize budget of ~ 3000 USD The COVID-19 Hack – Powered by IronHacks 0 to 25 percentile 25 to 50 percentile 50 to 75 percentile 75 to 100 percentile Increasing pay Other Award Categories
  • 38. The COVID-19 Hack – Powered by IronHacks There are a range of benefits For Citizens: A high-quality Covid-19 model based on real data and key insights about metrics like social distancing can help keep families informed on the best actions for them to take for themselves and how doing so can protect others! For COVID-19 Research: New insights into nature of COVID-19 from a social and economic perspective For Policymakers: Policymakers can utilize IronHacks results to help make critical decisions on budgets, health policy and other critical decisions during challenging times.
  • 39. The COVID-19 Hack – Powered by IronHacks Here are some of our partners for the COVID-19: Data Science Challenges
  • 40. The COVID-19 Hack – Powered by IronHacks What should you do next?
  • 43. We are suggesting a task that focuses on prediction of visitor counts in Indiana ā€œWhen and where are the most crowded places in our counties?ā€ The scenario: Imagine you are part of a government task force to assess the COVID-19 risk emerging from ā€œcrowdingā€ at public places. Using data analysis/data science skills you want to help the government to understand and predict the visitor account on a weekly basis to support real-time decision-making. Your tasks: Develop and execute a Python or R code in a Jupyter notebook that presents the results in numeric values and visualizations related to the following three subtasks: 1. Historic description: Where are the places with the greatest decrease/increase in visitor count since March 2020 relative to a baseline of 2019? 2. Prediction for following week: Build a model to predict the visitor count/foot traffic for the coming week using a statistical model of your choice. 3. Visualization: Show your predictions in a spatial-temporal way so that end-users can understand the historic and the future COVID-19 risk! The COVID-19 Hack - Powered by IronHacks
  • 44. About Me and RCODI I founded RCODI in 2014 as a university-wide center; today, it brings together about 20 researchers and fellows from different disciplines and countries Advisory Board Center Director Core Faculty Affiliated Faculty Purdue Affiliated Faculty in the US, Europe, and Asia Members (donations and data transfer) Academic, research members Performance metrics • Intellectual contributions to scholarship of digital innovation (Publications) • Research grant applications and research expenditures • Successful graduated students in the area of open digital innovation • Undergraduate research support/fellows • International relationships with academia and industry • Endowments • Real world impact and advocacy
  • 45. Connected everything Analytics/ Big Data Cloud Open platforms for collective innovation Social media Mobile About Me and RCODI The phenomenon of open digital innovation motivates our research team Open digital innovation describes open and collective forms of organizing the design and use of new products, enabled through digital platforms/technologies.
  • 46. About Me and RCODI Scientifically we span the boundary of social sciences (including economics), data science (focus network science) and information systems
  • 47. Dr. Sabine Brunswicker Director, RCODI Elizabeth Thompson PhD Student Researcher Jia Lin Cheoh PhD Student Researcher IronHacks Team Some of RCODI’s team members are involved in IronHacks 2020 Matt Harris Platform Developer Marlen Promann PhD Student Researcher Satyam Mukherjee RedHat Fellow Shatayu Undergraduate Student Researcher