SlideShare a Scribd company logo
Disaster Data Informatics
for Situation Awareness
                Ashutosh Jadhav
               ashutosh@knoesis.org


   Ohio Center of Excellence in Knowledge Enabled
                Computing (Kno.e.sis)
         Wright State University, Dayton, OH
Research Problem
Disaster Data Informatics for
Situation Awareness
Expedite decision making process in the disaster situation by
identifying useful/actionable information from social media

1. Informativeness Analysis
   a. Identify information rich tweet messages (filtering noisy tweets)
       based on variety of analysis

2. Classifying information rich messages
   a. People at the disaster site, suffering people asking for help
   b. Global response about the disaster (opinions, comments, news
      etc.)

3. Expedite decision making process and situational awareness
   a. Considering (2.a) understand needs at disaster site
   b. Make connection resource-->needs
Informativeness Analysis
Motivation: Information
Overload
●

●   5,500 tweets per seconds during japanese earthquake and tsunami

***Within a minute of the quake, there were more than
40,000 earthquake-related Tweets. The micro-blogging site
said it hit about 5,500 Tweets per second on the quake......
             -The New York Times

     How to find useful and actionable
information quickly from such huge stream
         of incoming event data?
Multidimensional
 Event Analysis
Data generated at the             Data generated
    Dimensions                disaster location              around the world
   MultidimensionalNGO
            Affected people, data
Who generates the data?
(People)    volunteers
                                                            People not directly
                                                            involved in the diaster

                          Reports about                     -Opinions, concerns,
                            - current situation,            sympathy, desire for help
What data is                - needs for resources,
generated?                  - medical & other               -Sharing of related news,
(Content)                 emergencies                       blogs and other
                          - complains etc.                  multimedia


                          - Social media (Twitter, FB)      Majorly through social
How the data is                                             media (Twitter, Facebook,
                          - SMS and Web reports to
generated?                                                  blogs, etc)
                          involved NGOs and
(Network)
                          government organization

                          - Seeking for help                Sharing personal view-
Why data is generated?
(Intention)               - Inform current situation, needs points on the disaster
                          etc                               related incidents

When data is generated    After the disaster, in recovery   Mostly after the disaster
(Time)                    and rebuild phase
Research problem

                How can we identify
   useful/ informative (actionable) information
                 that can be used to
expedite decision making & situational awareness
             in the disaster situation?
Approach
Informativeness Analysis
- Definition
● Useful/actionable information in the disaster situation
  that can help for better and faster situation awareness
Examples messages
We need tent, cover, rice. Uneted Nation never Help us since the
earthquake, we live in Carre-four, Lapot street,

if women and children are victim of rape or other agressions in provisionnal
shelter, what number can we call to have fast assistance.

We are still under the sheets. We do not have: Tents, prelates, sanitary
articles and household etc. Bastien the city Alix fontamara 27

we don't have some water in the delmas camp 40b

We need tent indelmas 18 because we don't find nothing in the area.

How can we find help and food in fontamara 43 rue menos

A father, whose wife passed away, and has two children who need medical
attention. One child has a broken arm, and he is afraid of infection
Data generated at the                       Data generated
  Dimensions              disaster location                        around the world
   Multidimensional data
Who generates the
data? (People)
                    Affected people, NGO volunteers               People not directly involved in the
                                                                  disaster

                                                                  -Opinions, concerns, sympathy,
                    Reports about                                 desire for help

What data is          - current situation,                        -Sharing of related news, blogs
generated?            - needs for resources,                      and other multimedia
                      - medical & other
(Content)
                    emergencies
                    - complains etc.
                    - Social media (Twitter, FB)                  Majorly through social media
How the data is                                                   (Twitter, Facebook, blogs, etc)
generated?          - SMS and Web reports to involved NGOs
(Network)           and government organization


Why data is         - Seeking for help                            Sharing personal view-points on
                    - Inform current situation, needs etc         the disaster related incidents
generated?
(Intention)
                    After the disaster, in recovery and rebuild   Mostly after the disaster
When data is        phase
generated
(Time)
Data set
●   Social Networking Messages
    ○   Twitter, Facebook

●   News articles
    ○   News websites, external links from tweets, FB status

●   NGO messages
    ○   Ushahidi messages/reports

●   Mobile messages
    ○   SMS
Informativeness Analysis
                    ●   Structure and syntactic analysis
                    ●   Linguistic analysis
Content Analysis    ●   Text analysis
                    ●   Metadata Analysis
                    ●   Author profile description
                    ●   Social connectivity
 People Analysis    ●   Activity level
                    ●   Author credibility/influence
                    ●   Content analysis
                    ●   Social share analysis
 News Analysis      ●   URL credibility
                    ●   Alexa analysis
                    ●   Content annotation using disaster domain model
                        considering:
Semantic Analysis        entities mentioned, needs, resources, location,
                        organizations, people, disaster type etc.
Content Analysis
● Structure and syntactic analysis
  ○ Message length
  ○ Number of words, special characters, slags, dictionary words

● Linguistic analysis
  ○ Number of nouns, verbs, adverbs, adjective
  ○ POS patterns

● Text analysis
  ○ N-gram analysis
  ○ TF_IDF statistics
  ○ Entities (dbpedia/ontology)

● Metadata analysis
  ○ Publish time
  ○ Location (explicit and implicit)
People Analysis
● Author profile description
  ○ Profession
  ○ Demographic information (age, gender, location)

● Social connectivity
  ○ Number of follow-followers

● Activity level
  ○ Number of tweets
  ○ Number of tweets "on topic"

● Author credibility/influence
  ○ Klout
  ○ SocialMatica
  ○ Peer index
News Analysis
●   News and other event related stories are generally linked in many
    of the event related messages (tweets, etc.) primarily
     ○ Message size limitation (140 characters for Twitter)
     ○ Bringin external authoritative context


●   Analyzing news and other event related stories plays a crucial role in
    event analysis

       Many news stories about the event
       ■ which news stories to focus on?
       ■ how to extract useful and actionable information
         nuggets from these news stories ?
News Analysis
                              - Structure and syntactic analysis
   Content Analysis           - Linguistic analysis
                              - Text analysis
                              - Metadata Analysis

                              - Number tweets, retweets
                              - Facebook share, like, comments,
Social share analysis         recommendations
                              - Google plus, LinkedIn shares

                              - Google page rank
    URL credibility
                              - Local credibility (?)

    Alexa analysis            - Alexa global and country rank
(Alexa is a web information   - Alexa url authority
         company)             - Alexa url & subdomain mozRank
                              - Alexa page & domain authority
Semantic Analysis
●   Content annotation using disaster domain model
    considering variety of entities mentioned (DBPedia)
     ○ needs, resources, location, organizations, people,
       disaster type etc.
Semantic Disaster Model***
Reuse/ (formalise and build) disaster domain model considering:
                Earthquake, floods, terror attack (disaster type will help us
Disaster type
                for better understanding of needs)

                Model of basic human needs needs in disasters like food,
    Needs
                water, medicines, shelter, etc

                Model of resources which can satisfy some need like need:
 Resources      thirsty -> resource: water, fruit juice, need: hungry ->
                resource: food etc.

  Location      Location of incidents, geo-location data

Organization    Involved government and non-government organizations

                Model of people base on gender, age group, role (mother,
   People &     father, son, etc.) (This can be help in
  social role   understanding/reasoning needs like if there is mention of
                mother and baby then need may be milk)
QCRI Internship Proposal

More Related Content

PDF
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...
PPTX
Understanding Online Socials Harm: Examples of Harassment and Radicalization
PPTX
Processing Social Media Messages in Mass Emergency: A Survey
PDF
14 mejri new configurationict-indisasters-ws2014
PPTX
Computational Social Science as the Ultimate Web Intelligence
PDF
Public Health Crisis Analytics for Gender Violence
PPTX
Twitris in Action - a review of its many applications
PDF
Mantelero collective privacy in3_def
Humanitarian Diplomacy in the Digital Age: Analysis and use of digital inform...
Understanding Online Socials Harm: Examples of Harassment and Radicalization
Processing Social Media Messages in Mass Emergency: A Survey
14 mejri new configurationict-indisasters-ws2014
Computational Social Science as the Ultimate Web Intelligence
Public Health Crisis Analytics for Gender Violence
Twitris in Action - a review of its many applications
Mantelero collective privacy in3_def

Viewers also liked (20)

PPT
Disaster Management
PPT
PPTX
Losses Resulting from a Disaster
ODP
OpenStreetMap Response - 2015 Nepal Earthquake
PPTX
Composites , Fire Extinguisher with tire IDM10
PPTX
Earthquake Safety Video
PDF
Engaging Students in Disaster Prevention Education
PPTX
EARTHQUAKE Presentation [TIPS 2]
PPTX
Earthquake Preparedness Infographic
PPT
Bam Earthquake
PPTX
Disaster management ppt
PPT
What to do before,during & after disasters
PPTX
Natural disaster presentation
PDF
Family plan
PPTX
Earthquake Presentation
PPSX
PPT on 10th disaster management
PPTX
Disaster management
PPT
Disaster Management
PPTX
Disaster ppt
PPTX
Natural disaster powerpoint
Disaster Management
Losses Resulting from a Disaster
OpenStreetMap Response - 2015 Nepal Earthquake
Composites , Fire Extinguisher with tire IDM10
Earthquake Safety Video
Engaging Students in Disaster Prevention Education
EARTHQUAKE Presentation [TIPS 2]
Earthquake Preparedness Infographic
Bam Earthquake
Disaster management ppt
What to do before,during & after disasters
Natural disaster presentation
Family plan
Earthquake Presentation
PPT on 10th disaster management
Disaster management
Disaster Management
Disaster ppt
Natural disaster powerpoint
Ad

Similar to Disaster data informatics for situation awareness (20)

PDF
What to Expect When the Unexpected Happens: Social Media Communications Acros...
PPTX
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...
PDF
Crisis Information Management: A Primer
PDF
Crowd Based Information Management
PDF
Crisis Information Management: A Primer
PPTX
Ocha social media trends challenges
PPTX
Open analytics social media framework
PPT
Smile presentation
PPTX
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
PPTX
role of social media.
KEY
Swift Update May 6
KEY
2nd Annual Geospatial Conference
DOC
Semantic Social Mashup approach for Designing Citizen Diplomacy
DOCX
Disaster Strikes. Social Media Responds. Helpful Resources
PDF
Big Data from Social Media and Crowdsourcing in Emergencies
KEY
Final Draft - Sloan Presentation
PDF
Information System to Assist Survivors of Disasters
PDF
The New Cartographers - Liu and Palen
DOCX
The case for integrating crisis response with social media
PDF
Situational Awareness Through Social Media
What to Expect When the Unexpected Happens: Social Media Communications Acros...
Crisis Mapping, Citizen Sensing and Social Media Analytics: Leveraging Citize...
Crisis Information Management: A Primer
Crowd Based Information Management
Crisis Information Management: A Primer
Ocha social media trends challenges
Open analytics social media framework
Smile presentation
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
role of social media.
Swift Update May 6
2nd Annual Geospatial Conference
Semantic Social Mashup approach for Designing Citizen Diplomacy
Disaster Strikes. Social Media Responds. Helpful Resources
Big Data from Social Media and Crowdsourcing in Emergencies
Final Draft - Sloan Presentation
Information System to Assist Survivors of Disasters
The New Cartographers - Liu and Palen
The case for integrating crisis response with social media
Situational Awareness Through Social Media
Ad

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PDF
Modernizing your data center with Dell and AMD
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Empathic Computing: Creating Shared Understanding
PDF
Spectral efficient network and resource selection model in 5G networks
Cloud computing and distributed systems.
Modernizing your data center with Dell and AMD
MYSQL Presentation for SQL database connectivity
Chapter 3 Spatial Domain Image Processing.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Review of recent advances in non-invasive hemoglobin estimation
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
The Rise and Fall of 3GPP – Time for a Sabbatical?
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
“AI and Expert System Decision Support & Business Intelligence Systems”
Understanding_Digital_Forensics_Presentation.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Empathic Computing: Creating Shared Understanding
Spectral efficient network and resource selection model in 5G networks

Disaster data informatics for situation awareness

  • 1. Disaster Data Informatics for Situation Awareness Ashutosh Jadhav ashutosh@knoesis.org Ohio Center of Excellence in Knowledge Enabled Computing (Kno.e.sis) Wright State University, Dayton, OH
  • 3. Disaster Data Informatics for Situation Awareness Expedite decision making process in the disaster situation by identifying useful/actionable information from social media 1. Informativeness Analysis a. Identify information rich tweet messages (filtering noisy tweets) based on variety of analysis 2. Classifying information rich messages a. People at the disaster site, suffering people asking for help b. Global response about the disaster (opinions, comments, news etc.) 3. Expedite decision making process and situational awareness a. Considering (2.a) understand needs at disaster site b. Make connection resource-->needs
  • 5. Motivation: Information Overload ● ● 5,500 tweets per seconds during japanese earthquake and tsunami ***Within a minute of the quake, there were more than 40,000 earthquake-related Tweets. The micro-blogging site said it hit about 5,500 Tweets per second on the quake...... -The New York Times How to find useful and actionable information quickly from such huge stream of incoming event data?
  • 7. Data generated at the Data generated Dimensions disaster location around the world MultidimensionalNGO Affected people, data Who generates the data? (People) volunteers People not directly involved in the diaster Reports about -Opinions, concerns, - current situation, sympathy, desire for help What data is - needs for resources, generated? - medical & other -Sharing of related news, (Content) emergencies blogs and other - complains etc. multimedia - Social media (Twitter, FB) Majorly through social How the data is media (Twitter, Facebook, - SMS and Web reports to generated? blogs, etc) involved NGOs and (Network) government organization - Seeking for help Sharing personal view- Why data is generated? (Intention) - Inform current situation, needs points on the disaster etc related incidents When data is generated After the disaster, in recovery Mostly after the disaster (Time) and rebuild phase
  • 8. Research problem How can we identify useful/ informative (actionable) information that can be used to expedite decision making & situational awareness in the disaster situation?
  • 10. Informativeness Analysis - Definition ● Useful/actionable information in the disaster situation that can help for better and faster situation awareness
  • 11. Examples messages We need tent, cover, rice. Uneted Nation never Help us since the earthquake, we live in Carre-four, Lapot street, if women and children are victim of rape or other agressions in provisionnal shelter, what number can we call to have fast assistance. We are still under the sheets. We do not have: Tents, prelates, sanitary articles and household etc. Bastien the city Alix fontamara 27 we don't have some water in the delmas camp 40b We need tent indelmas 18 because we don't find nothing in the area. How can we find help and food in fontamara 43 rue menos A father, whose wife passed away, and has two children who need medical attention. One child has a broken arm, and he is afraid of infection
  • 12. Data generated at the Data generated Dimensions disaster location around the world Multidimensional data Who generates the data? (People) Affected people, NGO volunteers People not directly involved in the disaster -Opinions, concerns, sympathy, Reports about desire for help What data is - current situation, -Sharing of related news, blogs generated? - needs for resources, and other multimedia - medical & other (Content) emergencies - complains etc. - Social media (Twitter, FB) Majorly through social media How the data is (Twitter, Facebook, blogs, etc) generated? - SMS and Web reports to involved NGOs (Network) and government organization Why data is - Seeking for help Sharing personal view-points on - Inform current situation, needs etc the disaster related incidents generated? (Intention) After the disaster, in recovery and rebuild Mostly after the disaster When data is phase generated (Time)
  • 13. Data set ● Social Networking Messages ○ Twitter, Facebook ● News articles ○ News websites, external links from tweets, FB status ● NGO messages ○ Ushahidi messages/reports ● Mobile messages ○ SMS
  • 14. Informativeness Analysis ● Structure and syntactic analysis ● Linguistic analysis Content Analysis ● Text analysis ● Metadata Analysis ● Author profile description ● Social connectivity People Analysis ● Activity level ● Author credibility/influence ● Content analysis ● Social share analysis News Analysis ● URL credibility ● Alexa analysis ● Content annotation using disaster domain model considering: Semantic Analysis entities mentioned, needs, resources, location, organizations, people, disaster type etc.
  • 15. Content Analysis ● Structure and syntactic analysis ○ Message length ○ Number of words, special characters, slags, dictionary words ● Linguistic analysis ○ Number of nouns, verbs, adverbs, adjective ○ POS patterns ● Text analysis ○ N-gram analysis ○ TF_IDF statistics ○ Entities (dbpedia/ontology) ● Metadata analysis ○ Publish time ○ Location (explicit and implicit)
  • 16. People Analysis ● Author profile description ○ Profession ○ Demographic information (age, gender, location) ● Social connectivity ○ Number of follow-followers ● Activity level ○ Number of tweets ○ Number of tweets "on topic" ● Author credibility/influence ○ Klout ○ SocialMatica ○ Peer index
  • 17. News Analysis ● News and other event related stories are generally linked in many of the event related messages (tweets, etc.) primarily ○ Message size limitation (140 characters for Twitter) ○ Bringin external authoritative context ● Analyzing news and other event related stories plays a crucial role in event analysis Many news stories about the event ■ which news stories to focus on? ■ how to extract useful and actionable information nuggets from these news stories ?
  • 18. News Analysis - Structure and syntactic analysis Content Analysis - Linguistic analysis - Text analysis - Metadata Analysis - Number tweets, retweets - Facebook share, like, comments, Social share analysis recommendations - Google plus, LinkedIn shares - Google page rank URL credibility - Local credibility (?) Alexa analysis - Alexa global and country rank (Alexa is a web information - Alexa url authority company) - Alexa url & subdomain mozRank - Alexa page & domain authority
  • 19. Semantic Analysis ● Content annotation using disaster domain model considering variety of entities mentioned (DBPedia) ○ needs, resources, location, organizations, people, disaster type etc.
  • 20. Semantic Disaster Model*** Reuse/ (formalise and build) disaster domain model considering: Earthquake, floods, terror attack (disaster type will help us Disaster type for better understanding of needs) Model of basic human needs needs in disasters like food, Needs water, medicines, shelter, etc Model of resources which can satisfy some need like need: Resources thirsty -> resource: water, fruit juice, need: hungry -> resource: food etc. Location Location of incidents, geo-location data Organization Involved government and non-government organizations Model of people base on gender, age group, role (mother, People & father, son, etc.) (This can be help in social role understanding/reasoning needs like if there is mention of mother and baby then need may be milk)