SlideShare a Scribd company logo
CLUSTERING
ASSIGNMENT
Done By : Akash More
PROBLEM STATEMENT
HELP International is an international humanitarian NGO that is committed
to fighting poverty and providing the people of backward countries with
basic amenities and relief during the time of disasters and natural
calamities. It runs a lot of operational projects from time to time along
with advocacy drives to raise awareness as well as for funding purposes.
After the recent funding programmes, they have been able to raise
around $ 10 million. Now the CEO of the NGO needs to decide how to
use this money strategically and effectively. The significant issues that
come while making this decision are mostly related to choosing the
countries that are in the direst need of aid.
OBJECTIVE
The requisite is:
- To categorise the countries using some socio-economic and health
factors that determine the overall development of the country.
- To suggest the countries which the CEO needs to focus on the
most.
DATA PROCESSING
- It was found that there were no null values
- There were also no duplicate values for country
- There were a few outliers and they were treated later on during
PCA
- The data was standardized for Principal Component Analysis
PCA (SCREEPLOT)
PCA
CLUSTERING
- Both the methods K means and Hierarchical Clustering was used
on the 4 PCA components
- For K means , K= 3 was taken.
- While doing the Hopkins Statistics a value of 0.77 was attained.
[ If the values are:
0.01 - 0.3 : Low chase of clustering
around 0.5 : Random
0.7 - 0.99 : High chance of clustering]
SILHOUETTE ANALYSIS (PEAKING AT 3)
SUM OF SQUARED DISTANCES
CLUSTERING
CLUSTERING
CLUSTERING
CLUSTERING
CLUSTERING
HIERARCHICAL CLUSTERING
CONCLUSION
The countries that require help the most are listed below:
Afghanistan, Angola, Benin, Botswana, Burkina Faso, Burundi, Cameroon, Central African
Republic, Chad, Comoros, Congo, Dem. Rep., Congo, Rep., Cote d'Ivoire, Equatorial Guinea,
Eritrea, Gabon, Gambia, Ghana, Guinea, Guinea-Bissau, Haiti, Iraq, Kenya, Lao, Madagascar,
Malawi, Mali, Mauritania, Mozambique, Namibia, Niger, Pakistan, Rwanda, Senegal, Sierra
Leone, South Africa, Sudan, Tanzania, Timor-Leste, Togo, Uganda, Yemen and Zambia.
These countries have
- very low rate of net income per person, GDP per capita, average number of years a new
born child would live, total health spending and imports of goods and services.
- very high rate of measurement of the annual growth rate, number of children that would be
born and child mortality rate.
It is clear that these countries require very quick aid in terms of money, education and services.

More Related Content

PPTX
Help international clustering project
PDF
Clustering of world countries on socio economic factors
PPTX
Hierarchical and Non Hierarchical Clustering.pptx
PPTX
Locality service planning with geographical information system: Spatial analy...
PPTX
LECTURE ONE DE-II PPT.pptx
PDF
Using survey data to predict poverty in relation to financial service access ...
PDF
dissertation_master
PDF
Does Humanitarian Aid Crowd Out Development Aid? A Dynamic Panel Data Analysi...
Help international clustering project
Clustering of world countries on socio economic factors
Hierarchical and Non Hierarchical Clustering.pptx
Locality service planning with geographical information system: Spatial analy...
LECTURE ONE DE-II PPT.pptx
Using survey data to predict poverty in relation to financial service access ...
dissertation_master
Does Humanitarian Aid Crowd Out Development Aid? A Dynamic Panel Data Analysi...

Recently uploaded (20)

PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
modul_python (1).pptx for professional and student
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Transcultural that can help you someday.
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
Microsoft Core Cloud Services powerpoint
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
chrmotography.pptx food anaylysis techni
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPT
DU, AIS, Big Data and Data Analytics.ppt
PDF
Introduction to the R Programming Language
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
Managing Community Partner Relationships
STERILIZATION AND DISINFECTION-1.ppthhhbx
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
modul_python (1).pptx for professional and student
[EN] Industrial Machine Downtime Prediction
Transcultural that can help you someday.
Optimise Shopper Experiences with a Strong Data Estate.pdf
Microsoft Core Cloud Services powerpoint
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
chrmotography.pptx food anaylysis techni
CYBER SECURITY the Next Warefare Tactics
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
DU, AIS, Big Data and Data Analytics.ppt
Introduction to the R Programming Language
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Managing Community Partner Relationships
Ad
Ad

Clustering and pca assignment -1.pdf

  • 2. PROBLEM STATEMENT HELP International is an international humanitarian NGO that is committed to fighting poverty and providing the people of backward countries with basic amenities and relief during the time of disasters and natural calamities. It runs a lot of operational projects from time to time along with advocacy drives to raise awareness as well as for funding purposes. After the recent funding programmes, they have been able to raise around $ 10 million. Now the CEO of the NGO needs to decide how to use this money strategically and effectively. The significant issues that come while making this decision are mostly related to choosing the countries that are in the direst need of aid.
  • 3. OBJECTIVE The requisite is: - To categorise the countries using some socio-economic and health factors that determine the overall development of the country. - To suggest the countries which the CEO needs to focus on the most.
  • 4. DATA PROCESSING - It was found that there were no null values - There were also no duplicate values for country - There were a few outliers and they were treated later on during PCA - The data was standardized for Principal Component Analysis
  • 6. PCA
  • 7. CLUSTERING - Both the methods K means and Hierarchical Clustering was used on the 4 PCA components - For K means , K= 3 was taken. - While doing the Hopkins Statistics a value of 0.77 was attained. [ If the values are: 0.01 - 0.3 : Low chase of clustering around 0.5 : Random 0.7 - 0.99 : High chance of clustering]
  • 9. SUM OF SQUARED DISTANCES
  • 16. CONCLUSION The countries that require help the most are listed below: Afghanistan, Angola, Benin, Botswana, Burkina Faso, Burundi, Cameroon, Central African Republic, Chad, Comoros, Congo, Dem. Rep., Congo, Rep., Cote d'Ivoire, Equatorial Guinea, Eritrea, Gabon, Gambia, Ghana, Guinea, Guinea-Bissau, Haiti, Iraq, Kenya, Lao, Madagascar, Malawi, Mali, Mauritania, Mozambique, Namibia, Niger, Pakistan, Rwanda, Senegal, Sierra Leone, South Africa, Sudan, Tanzania, Timor-Leste, Togo, Uganda, Yemen and Zambia. These countries have - very low rate of net income per person, GDP per capita, average number of years a new born child would live, total health spending and imports of goods and services. - very high rate of measurement of the annual growth rate, number of children that would be born and child mortality rate. It is clear that these countries require very quick aid in terms of money, education and services.