SlideShare a Scribd company logo
Exploring Xenophobic Events through
GDELT Data Analysis
Modeling and Simulation Student Capstone Conference 2023
Track: Data Science
Himarsha R. Jayanetti, Erika Frydenlund, Michele C. Weigle
Presented By:
Himarsha R. Jayanetti
Department of Computer Science, ODU
Web Science & Digital Libraries Research Group
@HimarshaJ @WebSciDL @oducs
1
Global Migration: Seeking Opportunities and/or Escaping Conflicts
2
https://unis.unvienna.org/unis/en/topics/refugees-migration.html
International Organization like UNHCR and IOM: Safeguarding Migration and
Fighting Xenophobia
3
https://www.iom.int/
https://www.unhcr.org/us/
The Need for a Global Tool to Track Xenophobic Attacks on Migrants
4
https://acleddata.com/dashboard/#/dashboard
https://www.xenowatch.ac.za/
We Used the GDELT 2.0 Database to Obtain the Data
5
https://www.gdeltproject.org/
Database Updates every
15 minutes and translates
articles from around the
world from 65 different
languages into English.
A massive dataset of
online and TV and news
reporting.
We Identified Three Key Tables in the GDELT Database
● Event table contains data
about events happening
globally.
● Event Mentions table each
mention of the event in a
news article or other
source.
● Global Knowledge Graph
(GKG) form an extensive
interconnected network
(events, context, actors,
sentiment of the article,
themes)
6
Event
Event Mentions
Global Knowledge
Graph (GKG)
Relationship Among the Three Database Tables: Events, Event Mentions, and
GKG
7
MentionIdentifier field to
connect the Mentions
table with the GKG
table.
A One-to-Many relationship
between the GLOBALEVENTID
in the Event table and the
Event Mentions table.
We Used Google BigQuery to Collect Data
8
https://cloud.google.com/bigquery
GDELT users can access
the data using BigQuery,
which uses SQL-like
queries.
GDELT database is a large,
open-source database and
is supported by Google.
A simple query to get 10
GLOBALEVENTIDs of
event data from the
Event table
We Used Two Separate Data Collection Criteria
Criteria 1: Events where Actor2Code is REF.
Criteria 2: Events where Actor2Code is REF AND has Refugee
related GKG themes
Eight GKG themes relevant to refugees:
• DISCRIMINATION_IMMIGRATION_XENOPHOBIA
• DISCRIMINATION_IMMIGRATION_ANTIIMMIGRANTS
• DISCRIMINATION_IMMIGRATION_OPPOSED_TO_IMMIGRANTS
• DISCRIMINATION_IMMIGRATION_AGAINST_IMMIGRANTS
• DISCRIMINATION_IMMIGRATION_ATTACKS_ON_IMMIGRANTS
• DISCRIMINATION_IMMIGRATION_ATTACKS_AGAINST_IMMIGRANTS
• DISCRIMINATION_IMMIGRATION_XENOPHOBE
• DISCRIMINATION_IMMIGRATION_XENOPHOBES 9
Actor2Code is REF is
indicative of Actor 1
performing the act
on ‘refugee’.
These eight themes
were named as the
“GKGthemes_REF”.
We Conducted Two Case Studies
Case Study 1: Alan Kurdi Incident.
● September 2015
● Alan and his family (refugees from Syria)
● Travel to Europe from Turkey
● Lost life by drowning in the Mediterranean Sea
Case Study 2: Spike in Number of Articles in March 2021.
● A surge in the number of articles in March 2021.
● 1st hypothesis: due to spa shooting in Atlanta, GA.
● GDELT API to gain insight into the data before
querying for the data itself.
10
We downloaded the data
six months before and
after the incident (March
2015 to March 2016)
where Actor2Code is REF.
We downloaded the data
for March of 2021 where
Actor2Code is REF and the
theme was one in the
“GKGthemes_REF” set.
GDELT DOC 2.0 API To Query the GDELT Database
11
https://api.gdeltproject.org/api/v2/doc/doc?query=xenophobia
&mode=artlist&maxrecords=100&timespan=1week
● Full text search API
● Search back to Jan 2017
● Search across 65 languages
● Instant Visualizations
● Support JSON output
https:/blog.gdeltproject.org/gdelt-doc-2-0-api-debuts/
Queried on April 18, 2023
We Used the GDELT Doc API Python Client to Fetch Data From the GDELT API
12
https://github.com/alex9smith/gdelt-doc-api
1. Article search
● A list of news articles
that match the filters.
1. Timeline
● A timeline of the
volume of news
coverage matching
the filters (number of
articles and a total).
Percentage of Number of Articles with Refugee Related Themes From January
2017 to December 2022
13
Case Study 1
Alan Kurdi Incident
14
Alan Kurdi's Tragic Death Sparks Global Media Attention on Refugee Crisis
15
AvgTone of the News Articles Remained Consistently Negative Over Time
16
Case Study 2
Spike in Number of Articles around March 2021
17
Number of Articles Over Time for March, 2021 Data
18
Top 20 Most Prevalent Actor1CountryCode
19
Choropleth Map to Visualize the Location-Based Data
20
Future Work
● Develop a dashboard to monitor xenophobic violence against refugees and
migrants.
● Developing impactful visualizations that can aid us in addressing the
following:
○ Identify xenophobic “hotspots”
○ Identify when xenophobic outbreaks are escalating
○ Triggers of Xenophobic Violence
21
Key Takeaways
22
● Understanding xenophobic events
● GDELT database
● BigQuery and GDELT APIs to access GDELT data
● Case Study 1
○ The period surrounding the death of Alan Kurdi
○ Increase in media attention
○ Mostly negative sentiment
○ A shift in the range of emotions
● Case Study 2
○ March 2021 spike
○ Increase of African migrants on the Canary Islands
○ A choropleth map for the location-based analysis.

More Related Content

PDF
The Grassroots Covid-19 Resilience
PPTX
Classifying Crisis Information Relevancy with Semantics (ESWC 2018)
PDF
Multipleregression covidmobility and Covid-19 policy recommendation
PDF
Using Data for Science Journalism
PDF
Using Data for Science Journalism
PDF
#NISWAW Session 2
PDF
Natural Language Engineering in the Golden Age of Artificial Intelligence
PPTX
Transforming Social Big Data into Timely Decisions and Actions for Crisis Mi...
The Grassroots Covid-19 Resilience
Classifying Crisis Information Relevancy with Semantics (ESWC 2018)
Multipleregression covidmobility and Covid-19 policy recommendation
Using Data for Science Journalism
Using Data for Science Journalism
#NISWAW Session 2
Natural Language Engineering in the Golden Age of Artificial Intelligence
Transforming Social Big Data into Timely Decisions and Actions for Crisis Mi...

Similar to Exploring Xenophobic Events through GDELT Data Analysis (20)

PPTX
understanding the pandemic through mining covid news using natural language p...
PDF
UROP Poster
PDF
Estimating migrant stocks and flows using social media data
PDF
#migrantsfiles international
PDF
WG-misinfosec report out to CredCo.pdf
PDF
Progressive ethics in the digital age
PDF
Pandexit presentation
PDF
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
PPTX
[DSC Croatia 22] Experience in collaboration between academia and industry: N...
PPTX
From Research to Applications: What Can We Extract with Social Media Sensing?
PPT
2006 multinational intelligence (centcom ccc)
DOCX
MASINT and Global War on Terror
PDF
Kostiantyn Korsun - State Cybersecurity vs. Cybersecurity of the State. #FRD ...
PPTX
ICCM 2014 -- Ignite Talks -- Session 2
DOCX
Running head BUSINESS CASE FOR INVESTMENT OUTLINE1BUSINES.docx
PDF
Keep in touch for cyber peace_20150212
PPTX
Disaster database
PDF
PPTX
Disarm vanguards 2022-02-25 (3)
PDF
Data Activism: data as rhetoric, data as power
understanding the pandemic through mining covid news using natural language p...
UROP Poster
Estimating migrant stocks and flows using social media data
#migrantsfiles international
WG-misinfosec report out to CredCo.pdf
Progressive ethics in the digital age
Pandexit presentation
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
[DSC Croatia 22] Experience in collaboration between academia and industry: N...
From Research to Applications: What Can We Extract with Social Media Sensing?
2006 multinational intelligence (centcom ccc)
MASINT and Global War on Terror
Kostiantyn Korsun - State Cybersecurity vs. Cybersecurity of the State. #FRD ...
ICCM 2014 -- Ignite Talks -- Session 2
Running head BUSINESS CASE FOR INVESTMENT OUTLINE1BUSINES.docx
Keep in touch for cyber peace_20150212
Disaster database
Disarm vanguards 2022-02-25 (3)
Data Activism: data as rhetoric, data as power
Ad

More from Himarsha Jayanetti (6)

PPTX
Infrastructure for Tracking Information Flow from Social Media to U.S. TV New...
PPTX
Evaluating Social Media Reach via Mainstream Media Discourse - CIKM '24 - PhD...
PPTX
Evaluating Social Media Reach via Mainstream Media Discourse
PPTX
Supporting Account-based Queries for Archived Instagram Posts
PDF
Robots Still Outnumber Humans in Web Archives, But Less Than Before
PDF
Creating Structure in Web Archives With Collections: Different Concepts From ...
Infrastructure for Tracking Information Flow from Social Media to U.S. TV New...
Evaluating Social Media Reach via Mainstream Media Discourse - CIKM '24 - PhD...
Evaluating Social Media Reach via Mainstream Media Discourse
Supporting Account-based Queries for Archived Instagram Posts
Robots Still Outnumber Humans in Web Archives, But Less Than Before
Creating Structure in Web Archives With Collections: Different Concepts From ...
Ad

Recently uploaded (20)

PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Leprosy and NLEP programme community medicine
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Introduction to the R Programming Language
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Computer network topology notes for revision
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
Quality review (1)_presentation of this 21
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Business Analytics and business intelligence.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
annual-report-2024-2025 original latest.
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
Reliability_Chapter_ presentation 1221.5784
Clinical guidelines as a resource for EBP(1).pdf
IB Computer Science - Internal Assessment.pptx
Introduction to Knowledge Engineering Part 1
Introduction-to-Cloud-ComputingFinal.pptx
Leprosy and NLEP programme community medicine
[EN] Industrial Machine Downtime Prediction
Introduction to the R Programming Language
Qualitative Qantitative and Mixed Methods.pptx
Computer network topology notes for revision
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Quality review (1)_presentation of this 21
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Business Analytics and business intelligence.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
annual-report-2024-2025 original latest.

Exploring Xenophobic Events through GDELT Data Analysis

  • 1. Exploring Xenophobic Events through GDELT Data Analysis Modeling and Simulation Student Capstone Conference 2023 Track: Data Science Himarsha R. Jayanetti, Erika Frydenlund, Michele C. Weigle Presented By: Himarsha R. Jayanetti Department of Computer Science, ODU Web Science & Digital Libraries Research Group @HimarshaJ @WebSciDL @oducs 1
  • 2. Global Migration: Seeking Opportunities and/or Escaping Conflicts 2 https://unis.unvienna.org/unis/en/topics/refugees-migration.html
  • 3. International Organization like UNHCR and IOM: Safeguarding Migration and Fighting Xenophobia 3 https://www.iom.int/ https://www.unhcr.org/us/
  • 4. The Need for a Global Tool to Track Xenophobic Attacks on Migrants 4 https://acleddata.com/dashboard/#/dashboard https://www.xenowatch.ac.za/
  • 5. We Used the GDELT 2.0 Database to Obtain the Data 5 https://www.gdeltproject.org/ Database Updates every 15 minutes and translates articles from around the world from 65 different languages into English. A massive dataset of online and TV and news reporting.
  • 6. We Identified Three Key Tables in the GDELT Database ● Event table contains data about events happening globally. ● Event Mentions table each mention of the event in a news article or other source. ● Global Knowledge Graph (GKG) form an extensive interconnected network (events, context, actors, sentiment of the article, themes) 6 Event Event Mentions Global Knowledge Graph (GKG)
  • 7. Relationship Among the Three Database Tables: Events, Event Mentions, and GKG 7 MentionIdentifier field to connect the Mentions table with the GKG table. A One-to-Many relationship between the GLOBALEVENTID in the Event table and the Event Mentions table.
  • 8. We Used Google BigQuery to Collect Data 8 https://cloud.google.com/bigquery GDELT users can access the data using BigQuery, which uses SQL-like queries. GDELT database is a large, open-source database and is supported by Google. A simple query to get 10 GLOBALEVENTIDs of event data from the Event table
  • 9. We Used Two Separate Data Collection Criteria Criteria 1: Events where Actor2Code is REF. Criteria 2: Events where Actor2Code is REF AND has Refugee related GKG themes Eight GKG themes relevant to refugees: • DISCRIMINATION_IMMIGRATION_XENOPHOBIA • DISCRIMINATION_IMMIGRATION_ANTIIMMIGRANTS • DISCRIMINATION_IMMIGRATION_OPPOSED_TO_IMMIGRANTS • DISCRIMINATION_IMMIGRATION_AGAINST_IMMIGRANTS • DISCRIMINATION_IMMIGRATION_ATTACKS_ON_IMMIGRANTS • DISCRIMINATION_IMMIGRATION_ATTACKS_AGAINST_IMMIGRANTS • DISCRIMINATION_IMMIGRATION_XENOPHOBE • DISCRIMINATION_IMMIGRATION_XENOPHOBES 9 Actor2Code is REF is indicative of Actor 1 performing the act on ‘refugee’. These eight themes were named as the “GKGthemes_REF”.
  • 10. We Conducted Two Case Studies Case Study 1: Alan Kurdi Incident. ● September 2015 ● Alan and his family (refugees from Syria) ● Travel to Europe from Turkey ● Lost life by drowning in the Mediterranean Sea Case Study 2: Spike in Number of Articles in March 2021. ● A surge in the number of articles in March 2021. ● 1st hypothesis: due to spa shooting in Atlanta, GA. ● GDELT API to gain insight into the data before querying for the data itself. 10 We downloaded the data six months before and after the incident (March 2015 to March 2016) where Actor2Code is REF. We downloaded the data for March of 2021 where Actor2Code is REF and the theme was one in the “GKGthemes_REF” set.
  • 11. GDELT DOC 2.0 API To Query the GDELT Database 11 https://api.gdeltproject.org/api/v2/doc/doc?query=xenophobia &mode=artlist&maxrecords=100&timespan=1week ● Full text search API ● Search back to Jan 2017 ● Search across 65 languages ● Instant Visualizations ● Support JSON output https:/blog.gdeltproject.org/gdelt-doc-2-0-api-debuts/ Queried on April 18, 2023
  • 12. We Used the GDELT Doc API Python Client to Fetch Data From the GDELT API 12 https://github.com/alex9smith/gdelt-doc-api 1. Article search ● A list of news articles that match the filters. 1. Timeline ● A timeline of the volume of news coverage matching the filters (number of articles and a total).
  • 13. Percentage of Number of Articles with Refugee Related Themes From January 2017 to December 2022 13
  • 14. Case Study 1 Alan Kurdi Incident 14
  • 15. Alan Kurdi's Tragic Death Sparks Global Media Attention on Refugee Crisis 15
  • 16. AvgTone of the News Articles Remained Consistently Negative Over Time 16
  • 17. Case Study 2 Spike in Number of Articles around March 2021 17
  • 18. Number of Articles Over Time for March, 2021 Data 18
  • 19. Top 20 Most Prevalent Actor1CountryCode 19
  • 20. Choropleth Map to Visualize the Location-Based Data 20
  • 21. Future Work ● Develop a dashboard to monitor xenophobic violence against refugees and migrants. ● Developing impactful visualizations that can aid us in addressing the following: ○ Identify xenophobic “hotspots” ○ Identify when xenophobic outbreaks are escalating ○ Triggers of Xenophobic Violence 21
  • 22. Key Takeaways 22 ● Understanding xenophobic events ● GDELT database ● BigQuery and GDELT APIs to access GDELT data ● Case Study 1 ○ The period surrounding the death of Alan Kurdi ○ Increase in media attention ○ Mostly negative sentiment ○ A shift in the range of emotions ● Case Study 2 ○ March 2021 spike ○ Increase of African migrants on the Canary Islands ○ A choropleth map for the location-based analysis.