SlideShare a Scribd company logo
Big Data and Open Data Reuse
by Nonprofits for the Creation of
Sustainable Social Services
Nonprofit Technology Conference, Austin TX
Wed March 4, 2015 10:30 AM
Schedule: http://sched.co/1z1r
Eval: 15NTCSessionEval?c=1208
Hashtag: #15NTCReuseData
Who We Are – TechSoup Global
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata2
TechSoup Global is a nonprofit serving
the nonprofit community worldwide.
We have built nonprofit sector capacity through
technology for 25 years.
We are working toward a time when every social benefit
organization on the planet has the technology,
resources, and knowledge they need to operate at their
full potential.
Who We Are
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata3
• Steve Nagoski - Data Scientist
• Michael Enos - Director of Community and Platform
Who You Are & What You Care About
How do we Sustainably Connect our Information & Insights?
• Stories of Success – Collaboration Panel
• Questions About Open Data & Sustainability
Use #15NTCreusedata & Question Cards & Q&A
Data Reuse by Nonprofits
• Big Data & Open Data Trends
• Open Data Concerns
• Case Study: Balkans Data Academy
• Case Studies: Digital Humanitarians
• Data Science and Machine Learning
• Case Study: Hunger Index
• Sustainability of Open Information Initiatives
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata4
“The purpose of computing is insight,
not numbers.”
-Richard Hamming, 1961
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata5
Data Trends – Long Term
“What a computer is to me is it’s the most
remarkable tool that we’ve ever come up with,
and it’s the equivalent of a bicycle for our minds.”
- Steve Jobs, 1990
Big Data Trends - Global
• # of orgs and governments operating “Data Driven” grows
every year, instrumenting & collecting broader data to
make smarter decisions
• Online connectivity:
─ 350B SMS Messages/mo
─ 1.5T App Messages/mo (Whatsapp)
─ 15T Tweets/mo
─ 30B unique Facebook shares/mo
─ 3B Internet Users worldwide (40%), growing 8% YoY
• Cloud Storage makes storing 100PB/org affordable
─ Facebook, Microsoft, Amazon, Twitter, Thousands more.
─ Millions in the next 2 years
• New Analysis Tools are Efficient at those sizes
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata6
Open Data Trends - Global
• 2013 : G8 signs Open Data Charter
• 2014 : G20 pledge:
─ advance open data as weapon against corruption
• 2014 : UN recognizes need for “Data Revolution”
Still a LONG way to go
• 8% of participating countries publish spending figures
• 6% publish open data on government contracts
• 3% publish open data on ownership of companies
• Many Open Data initiatives not yet sustaining, growing
─ OpenDataBarometer.org, Jan 2015
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata7
Open Data Trends - US
• White House hires first Chief Data Scientist @dpatil
• Obama keynotes O’Reilly Strata conference Feb 2015
─ “Understanding and Innovating with Data has the potential to
change the way we do almost anything for the better”
https://www.youtube.com/watch?v=vbb-AjiXyh0
• 135,000 open govt datasets available at Data.gov
─ Weather, Maps, Healthcare, Political Funding, Census
• Collaboration between NGOs (Why) & Data Scientists
(How) & Analysts/Engineers (What) to deliver stronger
insights
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata8
Open Data Concerns - US
• Privacy vs Accountability & Transparency
─ Most open data Anonymous for Privacy
 Census
 Public Services Usage Info
 Driving Traffic Patterns
─ Some must be detailed for Accountability
 Health Inspection Data for Restaurants
 Campaign Finance data for Politicians
─ Some we have committed to record for Accountability but
have not put collection/access systems in place
 Police Shootings and/or Deaths Records
 Public Access to Police Event Video
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata9
Open Data Concerns
• Misuse of Open Data and Misinterpretation
• Correlation != Causation
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata10
“The temptation to form
premature theories upon
insufficient data is the
bane of our profession.”
– Sherlock Holmes
“Torture the data, and it
will confess to anything.”
– Ronald Coase
Data Reuse by Nonprofits
• Big Data & Open Data Trends
• Open Data Concerns
• Case Study: Balkans Data Academy
• Case Studies: Digital Humanitarians
• Data Science and Machine Learning
• Case Study: Hunger Index
• Sustainability of Open Data Initiatives
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata11
Balkans Data Academy : Why / Why Not?
• 1 week Hackathon in Sarajevo Aug 2014
─ expose Bosnian election data to voters
• Project managed by TechSoup Foundation + Local Civic
Activists ZastoNe https://www.youtube.com/watch?v=BcxgAOCFppY
• Team– 15 people from 7 different Nonprofit Orgs w/
different skills + 1 common goal
• Set up framework for future Data Academies, expand
footprint, enable more local NGOs to expand project
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata12
Balkans Data Academy : What
• Outcomes – Success!
─ Database & API Created, Open Source Project - Github
─ Data now easy to reload and expand
─ Website Created
─ Introduction Video created
• Next Steps
─ Use for live data in October 2014 Election
─ Collaborate & Train to expand local nonprofit capabilities in
future Academies
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata13
Digital Humanitarians
Feb 2015, Dr. Patrick Meier
• The Rise of Digital Humanitarians
• The Rise of Big Crisis Data
• Crowd Computing Satellite & Aerial Imagery
• Artificial Intelligence applied to Disaster Response
• Verifying Big Crisis Data – Dealing with False Data
• Dictators vs Digital Humanitarians (Egypt, China, Iran)
http://iRevolution.net http://DigitalHumanitarians.com #DigitalJedis
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata14
Digital Humanitarians – Haiti Earthquake 2010
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata15
Digital Humanitarians – Philippines 2012
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata16
HDX – Ebola, West Africa, Feb 2015
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata17
Resistance to AI / Machine Learning
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata18
• Oct 2010: Crowdsourcerer vs Muggles
“How Harry Potter Explains Humanitarian Crowd-Sourcing”
What is Machine Learning + AI Today
• Predictive Modeling + Threshold Automation
• Abuse prevention in Financial Svcs, Social Media
– Spam
– Personal/Community Abuse
– Fraud
– AML - Anti Money Laundering
– ATO - Account Take Over detection
• Detecting False Data
• Stitching Many sources to get the truest picture
• Constantly Adjusting, Measuring, Improving
– Learning from False Positives, Negatives, most valuable Measures
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata19
Applying Machine Learning to #OpenData
• Counting Tents in Refugee Camp Satellite Images
• Stitching together area images from UAV cameras
• Translation Services for Global Responses
• Identifying unreliable/false posts in Social Media
• Smart Geolocation with minimal input metadata
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata20
Data Reuse by Nonprofits
• Big Data & Open Data Trends
• Open Data Concerns
• Case Study: Balkans Data Academy
• Case Studies: Digital Humanitarians
• Data Science and Machine Learning
• Case Study: Hunger Index
• Sustainability of Open Data Initiatives
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata21
Hunger Index - What problems are we trying to solve?
• Are Food Assistance Providers achieving our goals?
• How do we forecast and communicate the need for food?
• How can food assistance programs make better decisions
about programs and investments.
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata22
Total
Meals
Required
Meals
Purchased
Food
Assistance
Missing
Meals
What is the Hunger Index?
• An aggregate measure of the need for food by the most
vulnerable member of a community.
• An index for comparing performance year-to-year and
region-to-region.
• A measure of how well we are serving those in need in
our community.
• Began in 2007 in Santa Clara and San Mateo Counties,
expanding to Alameda, Sonoma and Santa Cruz Counties
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata23
Hunger Index Methodology: Components
 Scope – Community, Income and Time Range
 TMR – Total Meals Required
 MP – Meals Purchased
 FAP – Food Assistance Provided
 TNF – Total Need for Food Assistance
 MM – Missing Meals
 HI – Hunger Index
• Counties
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata24
The Hunger Index
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata25
Hunger Index Methodology: Vulnerable Population
Scope
Geography
Time range
Income Demographics
http://www.census.gov/acs/www/
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata26
Hunger Index Methodology: TMR
TMR: Total Meals Required
• Households with Incomes < $50K
• Average Household Size
–Table B25010
–Santa Clara County 2010 = 2.94 persons/household
• Number of Meals per year =
1095/person/year
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata27
Hunger Index Example: TMR, Santa Clara County 2010
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata28
Annual Income Households
Meals Required
(millions)
0 thru $10,000 26,848 86.4
$10,000 to $20,000 38,863 125.1
$20,000 to $30,000 40,182 129.4
$30,000 to $40,000 38,351 123.5
$40,000 to $50,000 40,967 131.9
Total 185,211 596.3
Methodology: Meals Purchased (MP)
• From Consumer Expenditure Survey
–http://www.bls.gov/cex/csxstnd.htm
• No. of Households * Average Annual
Expenditure per household
• Important Correction: Subtract SNAP
purchases.
http://www.cdss.ca.gov/research/PG352.htm
• Divide by Cost of a Meal to get Meals
Purchased
http://www.cnpp.usda.gov/usdafoodcost-home.htm
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata29
Example MP Data: Santa Clara County 2010
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata30
Annual Income
(000)
Households
Average Annual Expenditure
on Food
0 thru $10 26,848 $3,189
$10 to $20 38,863 $3,413
$20 to 30 40,182 $4,008
$30 to 40 38,351 $4,883
$40 to 50 40,967 $5,515
Methodology: Food Assistance Provided (FAP)
• Data in different formats normalized to
meals
• Time range
• For SC and SM Counties
–Food Banks, SNAP, WIC, Government School Meal
Programs Senior Nutrition, CACFP
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata31
Example FAP: Santa Clara County 2010
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata32
Source
Meals
(millions)
SNAP 81.4
Second Harvest Food Bank 24.7
School meals 21.3
WIC 14.1
CACFP 4.7
Other 1.6
Total (FAP) 147.8
Final Calculations
TNF: Total Need for Food Assistance
TNF = TMR – MP
296.6M = 596.2M – 299.6M
MM: Missing Meals
MM = TNF – FAP
148.8.M = 296.6M - 147.8M
HI: Hunger Index
HI = MM/TNF
0.502 or 50.2%
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata33
Example Final Calc: Santa Clara County 2010
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata34
TMR: Total Meals Required 596.2
MP: Meals Purchased 299.6
FAP: Food Assistance Provided 147.8
TNF: Total Need for Food 296.6
MM: Missing Meals 148.8
HI: Hunger Index 0.502
Findings and Implications
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata35
Analysis
– Compare against
previous year
– Look for major shifts in
components
– Trends
Collateral benefits
– Understanding of need
• Who, where, when
– Understanding of Food
Assistance
• Who, where, when
– Use of data in other contexts
– How is the population,
demographics and economics
changing over time
Findings and Implications
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata36
How many households are vulnerable and
how much food do they need to be healthy?
Year Households Meals Needed
2010 173,000 564 million
2011 185,000 596 million
Growth 7% 5.7%
Findings and Implications
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata37
Purchased
300
Food
Assistance
148.8
Missing
Meals
147.8
Santa Clara
County 2011
596 Million Meals
185,000 households
CalFresh
55%
Food Bank
17%
School
meals
14%
WIC
10%
Other
4%
Food Assistance in
Santa Clara 2011
Total Food Assistance: 149 million meals
Santa Clara County Hunger Index
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata38
109.5
136.6 147.8
110.4
137.1
148.8
0
50
100
150
200
250
300
350
2009 2010 2011
Food Assistance Provided Missing Meals
Santa Clara County Hunger Index 2011
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata39
• Hunger Index indicates agencies still struggling to
catch up.
• Vulnerable households increased by more than
7% and need grew by over 8%
• Food Assistance grew by just over 8%.
• Most growth: CalFresh and WIC
• 149 million meals missing last year – enough to
feed 136,000 people for one year, more than the
population of Santa Clara.
What does the Hunger Index tell us?
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata40
• Households are spending less on food and
using more food assistance
• It will be a challenge for food assistance
programs to keep up
• We need to continue to work together to make
a difference
Data Reuse by Nonprofits
• Big Data & Open Data Trends
• Open Data Concerns
• Case Study: Balkans Data Academy
• Case Studies: Digital Humanitarians
• Data Science and Machine Learning
• Case Study: Hunger Index
• Sustainability of Open Data Initiatives
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata41
Sustainability of Open Data Initiatives
• Sustainability through Collaboration
• Collaboration Panel – share Successes
• Q&A on Open Data opportunities to Panel
• Questions from #NTC15reusedata
March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata42

More Related Content

PPT
Building Data-centric Media Organizations
PDF
Open data + open government open goodness
PPT
Middlebury Institute May 2016
PDF
2230 Massachusetts Ave NW
DOCX
Hermione v. Harry POS
PPT
Electricity 2
PPTX
The history of bicycle
PPTX
NTC16 - Open Data and Open Source Data Science
Building Data-centric Media Organizations
Open data + open government open goodness
Middlebury Institute May 2016
2230 Massachusetts Ave NW
Hermione v. Harry POS
Electricity 2
The history of bicycle
NTC16 - Open Data and Open Source Data Science

Viewers also liked (7)

PDF
2230 Brochure-1
PPT
Aljoheri's Portfolio | JPG | Slideshow
PDF
Getting Started to take an architectural decision in AngularJs.
PPTX
آرت نوڤو | مجموعة تصميمات التواصل البصري
PDF
Chains of the Constitution
PDF
Document - Certifications - Created Feb 24, 2015_1
PPTX
Pennsylvania board of probation and parole power point
2230 Brochure-1
Aljoheri's Portfolio | JPG | Slideshow
Getting Started to take an architectural decision in AngularJs.
آرت نوڤو | مجموعة تصميمات التواصل البصري
Chains of the Constitution
Document - Certifications - Created Feb 24, 2015_1
Pennsylvania board of probation and parole power point
Ad

Similar to NTC 2015 - Reuse of Open & Big Data for Sustainable Services for Social Good (20)

PDF
Data for Sustainable Development - NODA16
PPTX
Philanthropy Data Jam
PPTX
Open Data Initiatives
PDF
Data in international development (extended)
PDF
Opening Plenary - Prof. Nigel Shadbolt
 
PDF
Digitizing Your Impact | 2020 Hunger and Poverty Conference
PPTX
Open Data Presentation
PPT
EDF2012 Nigel Shadbolt - Transparency and Open Data
PPTX
NetHope World Economic Forum - Data Driven Development
PPTX
Emerging Trends in Crisis Informatics
PDF
Open Data Sources for Disaster Management
PPTX
Data.gov Overview, August 2012
PDF
Big data for development
PPTX
The Power of Open Data!
PDF
Briefing on US EPA Open Data Strategy using a Linked Data Approach
DOCX
Information is knowledge
PPTX
A coordinated framework for open data open science in Botswana/Simon Hodson
PDF
Big Data: A glimpse of today’s state of play
PDF
Open Data & ODI Overview 2014-11 (long version)
PPTX
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
Data for Sustainable Development - NODA16
Philanthropy Data Jam
Open Data Initiatives
Data in international development (extended)
Opening Plenary - Prof. Nigel Shadbolt
 
Digitizing Your Impact | 2020 Hunger and Poverty Conference
Open Data Presentation
EDF2012 Nigel Shadbolt - Transparency and Open Data
NetHope World Economic Forum - Data Driven Development
Emerging Trends in Crisis Informatics
Open Data Sources for Disaster Management
Data.gov Overview, August 2012
Big data for development
The Power of Open Data!
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Information is knowledge
A coordinated framework for open data open science in Botswana/Simon Hodson
Big Data: A glimpse of today’s state of play
Open Data & ODI Overview 2014-11 (long version)
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
Ad

Recently uploaded (20)

PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Database Infoormation System (DBIS).pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
1_Introduction to advance data techniques.pptx
PDF
Introduction to Business Data Analytics.
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
climate analysis of Dhaka ,Banglades.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Acceptance and paychological effects of mandatory extra coach I classes.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Database Infoormation System (DBIS).pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Fluorescence-microscope_Botany_detailed content
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Business Ppt On Nestle.pptx huunnnhhgfvu
1_Introduction to advance data techniques.pptx
Introduction to Business Data Analytics.
Introduction-to-Cloud-ComputingFinal.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Moving the Public Sector (Government) to a Digital Adoption

NTC 2015 - Reuse of Open & Big Data for Sustainable Services for Social Good

  • 1. Big Data and Open Data Reuse by Nonprofits for the Creation of Sustainable Social Services Nonprofit Technology Conference, Austin TX Wed March 4, 2015 10:30 AM Schedule: http://sched.co/1z1r Eval: 15NTCSessionEval?c=1208 Hashtag: #15NTCReuseData
  • 2. Who We Are – TechSoup Global March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata2 TechSoup Global is a nonprofit serving the nonprofit community worldwide. We have built nonprofit sector capacity through technology for 25 years. We are working toward a time when every social benefit organization on the planet has the technology, resources, and knowledge they need to operate at their full potential.
  • 3. Who We Are March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata3 • Steve Nagoski - Data Scientist • Michael Enos - Director of Community and Platform Who You Are & What You Care About How do we Sustainably Connect our Information & Insights? • Stories of Success – Collaboration Panel • Questions About Open Data & Sustainability Use #15NTCreusedata & Question Cards & Q&A
  • 4. Data Reuse by Nonprofits • Big Data & Open Data Trends • Open Data Concerns • Case Study: Balkans Data Academy • Case Studies: Digital Humanitarians • Data Science and Machine Learning • Case Study: Hunger Index • Sustainability of Open Information Initiatives March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata4
  • 5. “The purpose of computing is insight, not numbers.” -Richard Hamming, 1961 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata5 Data Trends – Long Term “What a computer is to me is it’s the most remarkable tool that we’ve ever come up with, and it’s the equivalent of a bicycle for our minds.” - Steve Jobs, 1990
  • 6. Big Data Trends - Global • # of orgs and governments operating “Data Driven” grows every year, instrumenting & collecting broader data to make smarter decisions • Online connectivity: ─ 350B SMS Messages/mo ─ 1.5T App Messages/mo (Whatsapp) ─ 15T Tweets/mo ─ 30B unique Facebook shares/mo ─ 3B Internet Users worldwide (40%), growing 8% YoY • Cloud Storage makes storing 100PB/org affordable ─ Facebook, Microsoft, Amazon, Twitter, Thousands more. ─ Millions in the next 2 years • New Analysis Tools are Efficient at those sizes March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata6
  • 7. Open Data Trends - Global • 2013 : G8 signs Open Data Charter • 2014 : G20 pledge: ─ advance open data as weapon against corruption • 2014 : UN recognizes need for “Data Revolution” Still a LONG way to go • 8% of participating countries publish spending figures • 6% publish open data on government contracts • 3% publish open data on ownership of companies • Many Open Data initiatives not yet sustaining, growing ─ OpenDataBarometer.org, Jan 2015 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata7
  • 8. Open Data Trends - US • White House hires first Chief Data Scientist @dpatil • Obama keynotes O’Reilly Strata conference Feb 2015 ─ “Understanding and Innovating with Data has the potential to change the way we do almost anything for the better” https://www.youtube.com/watch?v=vbb-AjiXyh0 • 135,000 open govt datasets available at Data.gov ─ Weather, Maps, Healthcare, Political Funding, Census • Collaboration between NGOs (Why) & Data Scientists (How) & Analysts/Engineers (What) to deliver stronger insights March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata8
  • 9. Open Data Concerns - US • Privacy vs Accountability & Transparency ─ Most open data Anonymous for Privacy  Census  Public Services Usage Info  Driving Traffic Patterns ─ Some must be detailed for Accountability  Health Inspection Data for Restaurants  Campaign Finance data for Politicians ─ Some we have committed to record for Accountability but have not put collection/access systems in place  Police Shootings and/or Deaths Records  Public Access to Police Event Video March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata9
  • 10. Open Data Concerns • Misuse of Open Data and Misinterpretation • Correlation != Causation March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata10 “The temptation to form premature theories upon insufficient data is the bane of our profession.” – Sherlock Holmes “Torture the data, and it will confess to anything.” – Ronald Coase
  • 11. Data Reuse by Nonprofits • Big Data & Open Data Trends • Open Data Concerns • Case Study: Balkans Data Academy • Case Studies: Digital Humanitarians • Data Science and Machine Learning • Case Study: Hunger Index • Sustainability of Open Data Initiatives March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata11
  • 12. Balkans Data Academy : Why / Why Not? • 1 week Hackathon in Sarajevo Aug 2014 ─ expose Bosnian election data to voters • Project managed by TechSoup Foundation + Local Civic Activists ZastoNe https://www.youtube.com/watch?v=BcxgAOCFppY • Team– 15 people from 7 different Nonprofit Orgs w/ different skills + 1 common goal • Set up framework for future Data Academies, expand footprint, enable more local NGOs to expand project March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata12
  • 13. Balkans Data Academy : What • Outcomes – Success! ─ Database & API Created, Open Source Project - Github ─ Data now easy to reload and expand ─ Website Created ─ Introduction Video created • Next Steps ─ Use for live data in October 2014 Election ─ Collaborate & Train to expand local nonprofit capabilities in future Academies March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata13
  • 14. Digital Humanitarians Feb 2015, Dr. Patrick Meier • The Rise of Digital Humanitarians • The Rise of Big Crisis Data • Crowd Computing Satellite & Aerial Imagery • Artificial Intelligence applied to Disaster Response • Verifying Big Crisis Data – Dealing with False Data • Dictators vs Digital Humanitarians (Egypt, China, Iran) http://iRevolution.net http://DigitalHumanitarians.com #DigitalJedis March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata14
  • 15. Digital Humanitarians – Haiti Earthquake 2010 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata15
  • 16. Digital Humanitarians – Philippines 2012 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata16
  • 17. HDX – Ebola, West Africa, Feb 2015 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata17
  • 18. Resistance to AI / Machine Learning March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata18 • Oct 2010: Crowdsourcerer vs Muggles “How Harry Potter Explains Humanitarian Crowd-Sourcing”
  • 19. What is Machine Learning + AI Today • Predictive Modeling + Threshold Automation • Abuse prevention in Financial Svcs, Social Media – Spam – Personal/Community Abuse – Fraud – AML - Anti Money Laundering – ATO - Account Take Over detection • Detecting False Data • Stitching Many sources to get the truest picture • Constantly Adjusting, Measuring, Improving – Learning from False Positives, Negatives, most valuable Measures March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata19
  • 20. Applying Machine Learning to #OpenData • Counting Tents in Refugee Camp Satellite Images • Stitching together area images from UAV cameras • Translation Services for Global Responses • Identifying unreliable/false posts in Social Media • Smart Geolocation with minimal input metadata March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata20
  • 21. Data Reuse by Nonprofits • Big Data & Open Data Trends • Open Data Concerns • Case Study: Balkans Data Academy • Case Studies: Digital Humanitarians • Data Science and Machine Learning • Case Study: Hunger Index • Sustainability of Open Data Initiatives March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata21
  • 22. Hunger Index - What problems are we trying to solve? • Are Food Assistance Providers achieving our goals? • How do we forecast and communicate the need for food? • How can food assistance programs make better decisions about programs and investments. March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata22 Total Meals Required Meals Purchased Food Assistance Missing Meals
  • 23. What is the Hunger Index? • An aggregate measure of the need for food by the most vulnerable member of a community. • An index for comparing performance year-to-year and region-to-region. • A measure of how well we are serving those in need in our community. • Began in 2007 in Santa Clara and San Mateo Counties, expanding to Alameda, Sonoma and Santa Cruz Counties March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata23
  • 24. Hunger Index Methodology: Components  Scope – Community, Income and Time Range  TMR – Total Meals Required  MP – Meals Purchased  FAP – Food Assistance Provided  TNF – Total Need for Food Assistance  MM – Missing Meals  HI – Hunger Index • Counties March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata24
  • 25. The Hunger Index March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata25
  • 26. Hunger Index Methodology: Vulnerable Population Scope Geography Time range Income Demographics http://www.census.gov/acs/www/ March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata26
  • 27. Hunger Index Methodology: TMR TMR: Total Meals Required • Households with Incomes < $50K • Average Household Size –Table B25010 –Santa Clara County 2010 = 2.94 persons/household • Number of Meals per year = 1095/person/year March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata27
  • 28. Hunger Index Example: TMR, Santa Clara County 2010 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata28 Annual Income Households Meals Required (millions) 0 thru $10,000 26,848 86.4 $10,000 to $20,000 38,863 125.1 $20,000 to $30,000 40,182 129.4 $30,000 to $40,000 38,351 123.5 $40,000 to $50,000 40,967 131.9 Total 185,211 596.3
  • 29. Methodology: Meals Purchased (MP) • From Consumer Expenditure Survey –http://www.bls.gov/cex/csxstnd.htm • No. of Households * Average Annual Expenditure per household • Important Correction: Subtract SNAP purchases. http://www.cdss.ca.gov/research/PG352.htm • Divide by Cost of a Meal to get Meals Purchased http://www.cnpp.usda.gov/usdafoodcost-home.htm March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata29
  • 30. Example MP Data: Santa Clara County 2010 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata30 Annual Income (000) Households Average Annual Expenditure on Food 0 thru $10 26,848 $3,189 $10 to $20 38,863 $3,413 $20 to 30 40,182 $4,008 $30 to 40 38,351 $4,883 $40 to 50 40,967 $5,515
  • 31. Methodology: Food Assistance Provided (FAP) • Data in different formats normalized to meals • Time range • For SC and SM Counties –Food Banks, SNAP, WIC, Government School Meal Programs Senior Nutrition, CACFP March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata31
  • 32. Example FAP: Santa Clara County 2010 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata32 Source Meals (millions) SNAP 81.4 Second Harvest Food Bank 24.7 School meals 21.3 WIC 14.1 CACFP 4.7 Other 1.6 Total (FAP) 147.8
  • 33. Final Calculations TNF: Total Need for Food Assistance TNF = TMR – MP 296.6M = 596.2M – 299.6M MM: Missing Meals MM = TNF – FAP 148.8.M = 296.6M - 147.8M HI: Hunger Index HI = MM/TNF 0.502 or 50.2% March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata33
  • 34. Example Final Calc: Santa Clara County 2010 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata34 TMR: Total Meals Required 596.2 MP: Meals Purchased 299.6 FAP: Food Assistance Provided 147.8 TNF: Total Need for Food 296.6 MM: Missing Meals 148.8 HI: Hunger Index 0.502
  • 35. Findings and Implications March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata35 Analysis – Compare against previous year – Look for major shifts in components – Trends Collateral benefits – Understanding of need • Who, where, when – Understanding of Food Assistance • Who, where, when – Use of data in other contexts – How is the population, demographics and economics changing over time
  • 36. Findings and Implications March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata36 How many households are vulnerable and how much food do they need to be healthy? Year Households Meals Needed 2010 173,000 564 million 2011 185,000 596 million Growth 7% 5.7%
  • 37. Findings and Implications March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata37 Purchased 300 Food Assistance 148.8 Missing Meals 147.8 Santa Clara County 2011 596 Million Meals 185,000 households CalFresh 55% Food Bank 17% School meals 14% WIC 10% Other 4% Food Assistance in Santa Clara 2011 Total Food Assistance: 149 million meals
  • 38. Santa Clara County Hunger Index March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata38 109.5 136.6 147.8 110.4 137.1 148.8 0 50 100 150 200 250 300 350 2009 2010 2011 Food Assistance Provided Missing Meals
  • 39. Santa Clara County Hunger Index 2011 March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata39 • Hunger Index indicates agencies still struggling to catch up. • Vulnerable households increased by more than 7% and need grew by over 8% • Food Assistance grew by just over 8%. • Most growth: CalFresh and WIC • 149 million meals missing last year – enough to feed 136,000 people for one year, more than the population of Santa Clara.
  • 40. What does the Hunger Index tell us? March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata40 • Households are spending less on food and using more food assistance • It will be a challenge for food assistance programs to keep up • We need to continue to work together to make a difference
  • 41. Data Reuse by Nonprofits • Big Data & Open Data Trends • Open Data Concerns • Case Study: Balkans Data Academy • Case Studies: Digital Humanitarians • Data Science and Machine Learning • Case Study: Hunger Index • Sustainability of Open Data Initiatives March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata41
  • 42. Sustainability of Open Data Initiatives • Sustainability through Collaboration • Collaboration Panel – share Successes • Q&A on Open Data opportunities to Panel • Questions from #NTC15reusedata March 4 2015Open Data Reuse by Nonprofits #15NTCreusedata42

Editor's Notes

  • #6: “The Father of Information Theory”
  • #10: Federal Bill for collecting info on Shootings, Shooting Deaths, and Deaths while in Police Custody passed in 2000, expired in 2006 without collecting valuable info. Bill just re-passed in 2014 but still concerns on sustainability.
  • #14: End video at 1:01 with spoken term “House of Peoples”
  • #15: End video at 1:01 with spoken term “House of Peoples”