SlideShare a Scribd company logo
HELP International Project
Humanitarian Aid to Underdeveloped Countries
By: Pikasha Sharma
Problem Statement
 Help “HELP INTERNTIONAL” to categories countries based on socio economic factors such
as child mortality, income, GDP, life expectancy etc., to optimally and strategically allocate
the recent $ 10 million funding that the NGO raised.
 The aim is to find out the countries in most need of aid based on various socio economic
and health factors that will determine the overall development of the country.
Insights from Exploring Data
• From the graph we can see that GDP and Income are highly positively correlated, higher the
income of a country higher is the GDP.
• High income and GDP also leads to higher expenditure on health facilities by people.
• Hence, we can conclude that income and GDP are good measures to indicate the health of a
country.
Insights from Exploring Data
• Countries with high export and import indicates robust domestic and international demand, good
economic strength and sustainable trade surplus and deficit. Thus, they have a high GDP which can
also be seen from the graph.
• Inflation rate is very high in poor countries (countries with low GDP) indicative of an unstable
economy.
Insights from Exploring Data
• Countries having low income have very high fertility (The number of children that would be born to
each woman if the current age-fertility rates remain the same) this maybe due to lack of education
on birth control methods.
• Rich countries have low total fertility.
• Poor countries have high child mortality and low life expectancy.
Solution for Categorizing Countries
 Having figured out how various factors affects the overall economic status of a country we want
to group similar countries together.
 This problem can be handled by using clustering algorithm known as k-Means Clustering. K-
Means Clustering is an iterative algorithm which tries to partition the data set into K pre-defined
distinct and non-overlapping sub groups wherein each data point belongs to only one group.
 Each data point is assigned to a cluster (sub-group) such that the sum of the squared distance
between the data points and the cluster’s centroid (arithmetic mean of all the data points that
belong to that cluster) is at the minimum.
 We will use the features of our data set to find out the subgroups of countries which are similar
to each other. For example countries having
similar income, GDP, child mortality rate
will be clubbed together.
How does the k Algorithm works?
 First we decided on the number of clusters the countries would be divided into suppose that’s k.
 Pick k random points as cluster centers.
 Assign each data points to their closest cluster centers based on Euclidean Distance.
 Update the center of each cluster based on the included observations (Mean of all the
observations)
 Terminate if no observations changed cluster, otherwise go back to step 3.
After performing clustering countries were divided into 4
clusters:-
- cluster 0: very low income and very high mortality rate.
- cluster 1: very high income and low mortality rate.
- cluster 2: low income and high mortality rate.
- cluster 3: high income and low mortality rate.
Analysis of clusters formed
 Cluster 0 has the highest child mortality rate.
 The lowest GDP and It is safe to assume that these countries will also have the lowest income.
 Countries in these clusters also spend minimum on health.
 We can say that our countries of interest lie in cluster 0 , as they have the lowest income, lowest GDP
and the highest child mortality which is a measure of economic development.
Countries in Cluster 0
Countries in need of aid
Congo, Dem.
Rep.
Malawi Madagascar Afghanistan Chad Cote d'Ivoire Pakistan
Liberia Guinea Comoros Gambia Tanzania Ghana Yemen
Burundi Togo Eritrea Kiribati Senegal Zambia Nigeria
Niger Sierra Leone Burkina Faso Benin Lesotho Mauritania Congo, Rep.
Central African
Republic
Rwanda Haiti Timor-Leste Kenya Sudan Angola
Mozambique Guinea-Bissau Uganda Mali Cameroon Lao Namibia
• Most of the countries in cluster 0 belong to the Sub-Saharan Africa region.
• All of these countries have high child mortality, low income, low GDP, low life expectancy and
very high inflation.
Top 5 Countries in dire need of aid
• Child mortality for these
countries is above 85th
percentile.
• Income of these 5 countries
is below the 3rd percentile
mark.
• They also have very low life
expectancy (less than 18th
percentile i.e., 61 years).
Conclusion
 Congo, Liberia, Burundi, Niger, Central African Republic require aid most dearly.
 They all have extremely low income and GDP which is an important measure of economic
status of a country.
 They also have very high child mortality this will be because of lack of basic medical
facilities in these countries.
 These countries also have very high inflation rates with poor import and export.

More Related Content

PPSX
Programming in c
PDF
Discrete Mathematics – Graphs and Trees.pdf
PPTX
String matching algorithms(knuth morris-pratt)
PDF
CBSE Class 12 Computer Science(083) Sample Question Paper 2020-21
PPT
Lecture_3_k-mean-clustering.ppt
PPTX
Longest Common Subsequence
PPT
Simple Java Programs
PPTX
K means clustering algorithm
Programming in c
Discrete Mathematics – Graphs and Trees.pdf
String matching algorithms(knuth morris-pratt)
CBSE Class 12 Computer Science(083) Sample Question Paper 2020-21
Lecture_3_k-mean-clustering.ppt
Longest Common Subsequence
Simple Java Programs
K means clustering algorithm

Similar to Help international clustering project (20)

PDF
Clustering of world countries on socio economic factors
PPT
Ss 11 chap 12 living standards
PDF
Clustering and pca assignment -1.pdf
PPTX
economicdevelopmentlecture5-170318205911.pptx
PPTX
Todaro12e_PPT_CH02_AS.pptx
PPTX
LECTURE ONE DE-II PPT.pptx
PPTX
Problems of the third world countries
PPT
Lecture 6-Population Growth and Development.ppt
PPTX
On Sustainable Development Goals and Inclusion in Africa
PPT
8.3 standard of living
PPT
Economic problems of development
PDF
Economic development as equalization of opportunities
PPTX
Concept of Underdevelopment and development.pptx
PPTX
Human geography review
PPTX
Global Cooperation
PPTX
Geography test population
PDF
How demographic change affects development
PPTX
Class Presentation on Economic development, inequality and foreign aid.pptx
PPT
Development
PPTX
Chapter-2 Characterstics of UDs.pptx
Clustering of world countries on socio economic factors
Ss 11 chap 12 living standards
Clustering and pca assignment -1.pdf
economicdevelopmentlecture5-170318205911.pptx
Todaro12e_PPT_CH02_AS.pptx
LECTURE ONE DE-II PPT.pptx
Problems of the third world countries
Lecture 6-Population Growth and Development.ppt
On Sustainable Development Goals and Inclusion in Africa
8.3 standard of living
Economic problems of development
Economic development as equalization of opportunities
Concept of Underdevelopment and development.pptx
Human geography review
Global Cooperation
Geography test population
How demographic change affects development
Class Presentation on Economic development, inequality and foreign aid.pptx
Development
Chapter-2 Characterstics of UDs.pptx
Ad

Recently uploaded (20)

PPTX
Managing Community Partner Relationships
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
Global Data and Analytics Market Outlook Report
PDF
Introduction to the R Programming Language
PPT
DU, AIS, Big Data and Data Analytics.ppt
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPT
Predictive modeling basics in data cleaning process
DOCX
Factor Analysis Word Document Presentation
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
modul_python (1).pptx for professional and student
PDF
Transcultural that can help you someday.
PPTX
Business_Capability_Map_Collection__pptx
PDF
Introduction to Data Science and Data Analysis
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
CYBER SECURITY the Next Warefare Tactics
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
Managing Community Partner Relationships
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Global Data and Analytics Market Outlook Report
Introduction to the R Programming Language
DU, AIS, Big Data and Data Analytics.ppt
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
SAP 2 completion done . PRESENTATION.pptx
Predictive modeling basics in data cleaning process
Factor Analysis Word Document Presentation
A Complete Guide to Streamlining Business Processes
STERILIZATION AND DISINFECTION-1.ppthhhbx
modul_python (1).pptx for professional and student
Transcultural that can help you someday.
Business_Capability_Map_Collection__pptx
Introduction to Data Science and Data Analysis
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
CYBER SECURITY the Next Warefare Tactics
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
Pilar Kemerdekaan dan Identi Bangsa.pptx
Ad

Help international clustering project

  • 1. HELP International Project Humanitarian Aid to Underdeveloped Countries By: Pikasha Sharma
  • 2. Problem Statement  Help “HELP INTERNTIONAL” to categories countries based on socio economic factors such as child mortality, income, GDP, life expectancy etc., to optimally and strategically allocate the recent $ 10 million funding that the NGO raised.  The aim is to find out the countries in most need of aid based on various socio economic and health factors that will determine the overall development of the country.
  • 3. Insights from Exploring Data • From the graph we can see that GDP and Income are highly positively correlated, higher the income of a country higher is the GDP. • High income and GDP also leads to higher expenditure on health facilities by people. • Hence, we can conclude that income and GDP are good measures to indicate the health of a country.
  • 4. Insights from Exploring Data • Countries with high export and import indicates robust domestic and international demand, good economic strength and sustainable trade surplus and deficit. Thus, they have a high GDP which can also be seen from the graph. • Inflation rate is very high in poor countries (countries with low GDP) indicative of an unstable economy.
  • 5. Insights from Exploring Data • Countries having low income have very high fertility (The number of children that would be born to each woman if the current age-fertility rates remain the same) this maybe due to lack of education on birth control methods. • Rich countries have low total fertility. • Poor countries have high child mortality and low life expectancy.
  • 6. Solution for Categorizing Countries  Having figured out how various factors affects the overall economic status of a country we want to group similar countries together.  This problem can be handled by using clustering algorithm known as k-Means Clustering. K- Means Clustering is an iterative algorithm which tries to partition the data set into K pre-defined distinct and non-overlapping sub groups wherein each data point belongs to only one group.  Each data point is assigned to a cluster (sub-group) such that the sum of the squared distance between the data points and the cluster’s centroid (arithmetic mean of all the data points that belong to that cluster) is at the minimum.  We will use the features of our data set to find out the subgroups of countries which are similar to each other. For example countries having similar income, GDP, child mortality rate will be clubbed together.
  • 7. How does the k Algorithm works?  First we decided on the number of clusters the countries would be divided into suppose that’s k.  Pick k random points as cluster centers.  Assign each data points to their closest cluster centers based on Euclidean Distance.  Update the center of each cluster based on the included observations (Mean of all the observations)  Terminate if no observations changed cluster, otherwise go back to step 3. After performing clustering countries were divided into 4 clusters:- - cluster 0: very low income and very high mortality rate. - cluster 1: very high income and low mortality rate. - cluster 2: low income and high mortality rate. - cluster 3: high income and low mortality rate.
  • 8. Analysis of clusters formed  Cluster 0 has the highest child mortality rate.  The lowest GDP and It is safe to assume that these countries will also have the lowest income.  Countries in these clusters also spend minimum on health.  We can say that our countries of interest lie in cluster 0 , as they have the lowest income, lowest GDP and the highest child mortality which is a measure of economic development.
  • 9. Countries in Cluster 0 Countries in need of aid Congo, Dem. Rep. Malawi Madagascar Afghanistan Chad Cote d'Ivoire Pakistan Liberia Guinea Comoros Gambia Tanzania Ghana Yemen Burundi Togo Eritrea Kiribati Senegal Zambia Nigeria Niger Sierra Leone Burkina Faso Benin Lesotho Mauritania Congo, Rep. Central African Republic Rwanda Haiti Timor-Leste Kenya Sudan Angola Mozambique Guinea-Bissau Uganda Mali Cameroon Lao Namibia • Most of the countries in cluster 0 belong to the Sub-Saharan Africa region. • All of these countries have high child mortality, low income, low GDP, low life expectancy and very high inflation.
  • 10. Top 5 Countries in dire need of aid • Child mortality for these countries is above 85th percentile. • Income of these 5 countries is below the 3rd percentile mark. • They also have very low life expectancy (less than 18th percentile i.e., 61 years).
  • 11. Conclusion  Congo, Liberia, Burundi, Niger, Central African Republic require aid most dearly.  They all have extremely low income and GDP which is an important measure of economic status of a country.  They also have very high child mortality this will be because of lack of basic medical facilities in these countries.  These countries also have very high inflation rates with poor import and export.