Making Susanne
THE WORLD RECESSION SUSCEPTIBILITY ANALYSIS WEBTOOL
Aditya Gupta (2011009)
1
Overview
 Objective and Motive
 Process
 Data
 Data Collection
 Data Cleanup
 Dependent Variable
 Analysis
 Regression, Classification
 Clustering and Results
 Further Work
 Presentation
 Drawbacks
2
Objective
Key Questions
 To study what economic
composition makes economies
more susceptible to
Global Recession
 By how much? How significantly?
 Can we predict recession impact?
 Studying similarity of economies
 How to quantify
 Recession impact
 Susceptibility
Key Goals
 Empirically ascertain significance
and impact of certain economic
traits vis-à-vis expert opinions
 Develop a powerful model to
predict recession susceptibility
 Present a global and intuitive view
across parameters and their
significance, i.e., Susanne.
3
Why? 4
 Recessions Hurt
 2008 – 2010
 Unemployment and layoffs
 Austerity
 Bearish Markets
 Inflation
 Bank Runs
Data
A BIG PROBLEM IN ITSELF
5
Data Collection
 60+ economic variables linked to Recession
 Allegedly, as per sources like Economist, Forbes, World Bank, WWW
 Or as per our suspicion
 Preferable country-specific ratios (Normalized and Structural Information)
 13 years, starting from 2000 to 2013
 210 Countries
 2500 rows of 60+ columns, each row is a country-year identified
 Source:
 World Bank (OECD National Account File): http://data.worldbank.org/
 United Nations Comtrade Database, International Monetary Fund, Direction
of Trade Database, Balance of Payments Database, and more.
6
Simple Enough?
 No, its blistering gunk.
 Only 86 out of ~2500 rows have
complete data (no NA’s)
 Non-normalized values
 What now?
7
Data Cleanup
 Manually add values for nearly complete columns
 Compress and remove years 2008-2010
 Observed Class Variables – not Causal
 Remove countries with almost no data (Afghanistan and 30 others)
 Down to 2100 rows
 Drop columns if:
 Significant, and have very little data available
 Year values for super-specific variables like
“Merchandise Exports to Scandinavian Countries as % of exports”
 We surely don’t have this for most countries, especially those like Albania
 Insignificant
 Determined from Regressional Analysis (MLR) p-values
8
Still too many missing values
 What to do?
 Drop row (done)
 Weighted expansion of row (did not consider)
 Infer a value
 Average value for parameter for country
over 13 years
 Result: Fixed 5000 out of 70K cells
 Still only 86 full rows
 Why?
 No value for property exists for a country at
all…
 Solution: Global Average – the sacrilege!
 Awful. Pull towards the mean, misclassify.
 But no directional bias.
 Trade-off: Unlocks a world of data
9
For a specific country…
Year V1 Pred V2 Pred
2001 15 15 15 15
2002 17 17 17 17
2003 15 15 19 19
2004 16 16 21 21
2005 ? 16 ? 18
Avg 16 18
Dependent Variable
 Goal: Capture Recession Impact between 2008-10
 Technical Definition: Absolute Growth Rate
 Problem: 15% to 1%, still not recession
 Average Growth Rate drop between 2008 – 2010?
 Problem: 1% drop for UK (at 0.5%) vs. AFG (at 15%)
 Percentage Drop in Growth Rate
 Problem: 0.1% to 3%, 3000% change!
 Solution?
 Drop.SD: Drop in Growth Rate in number of Standard
Deviations over 2008 – 2010.
Variance in Growth rate over last 20 years.
 Variance: Lot of manual data collection
10
Distribution and Discretization of
Drop.SD
 Corresponded well with web information about
countries that “avoided recession” and those
“hit worst”
 21% of the countries labelled as unsusceptible
 Less than -0.25 Drop in GDP during 2007-2010
 Middle 36% labelled as relatively unaffected
 between -0.25 to 0.75 SD GDP Drop in Recession
 Highest 43% adversely affected
 >0.75 SD drop in GDP
 How good was this division?
 lm R-squared rose from 41.58% to 44.14% (no loss in
predictive power, i.e. reasonable classification)
 Good split. Most countries were affected horribly.
11
Other Options
• Equal Density Split
• Maximise Classification Accuracy
• Purely Contextual
Analysis
AFTER WE FINALLY HAVE 2099 COMPLETE ROWS OF DATA
12
Classification: Can we predict
Recession Susceptibility?
 Assumption (non trivial)
 Drop.SD correctly represents Recession Impact in 08-10
 Recession Impact in 08-10 correctly represents Recession Susceptibility
 Can’t do better but guess a few things.
 SVM has 92% accuracy. Seems like it.
 Caveat:
 Bootstrap Analysis: Training Data = Test Data
 Workaround (can’t generate new countries or years):
 5-Fold Cross Validation
13
Important Variables and their
Impact
 Using Multilinear Regression (MLR),
 for each Economic Variable, we get
 degree
 and direction
of impact on Drop.SD
14
Clustering
 Motivation:
“Are we brute-engineering a predictor, or is there an actual underlying
economic structural pattern of recession-susceptibility?”
 70% accuracy (consistent clusters)with k-means, using multiple k
values.
 You decide.
15
Aside: Some nifty R snippets
 Populating Missing Values smartly:
 d7.agg <- aggregate(d7.norm[,5:48], by
= list(Country.Name =
d7.norm$Country.Name, Time =
d7.norm$Time), FUN = function(x) {
if(anyNA(x)) { mean(na.omit(x)) } else
{ mean(x) } })
 Class labelling:
 Rec.Affect <- ifelse
(CompleteDataFinal$Drop.SD. >= 1, 1,
ifelse(CompleteDataFinal$Drop.SD. >= -
0.25, 0, -1)).
 SVM (predict Recession Susceptibility)
16
> model <- svm(as.factor(Rec.Affect) ~
., data = CompleteDataFinal[,
c(6:46)])
> predictions <- predict(model,
CompleteDataFinal[,c(6:46)])
> table(pred = predictions, true =
Rec.Affect)
true
pred -1 0 1
-1 383 8 7
0 33 675 41
1 15 61 876
> t <- table(pred = predictions, true
= Rec.Affect)
> (t[1,1] + t[2,2] + t[3,3])/sum(t)
[1] 0.9213911
 Allowing similarity checks between
Economies
 Overall
 Over Economic Categories
 Using a Semantic Web compliant
Cardinality Checks and Ontology
 Classify Economic Variables into
one or more of:
 Central Government
 Economic Structure
 Net Exports
 Banking
 Manufacturing
 GDP
 Discretize them into , and over:
 Value
 And Impact
 Using middle 80 percentile cut-offs
Further Work - Accessibility 17
Presentation
THE WEBTOOL, I.E, SUSANNE
18
A Knowledge Map (Ontology) 19
Ontology (2) 20
• About 3500 Nodes
• And 7000 Edges
Backend 21
Frontend 22
 Takeaway:
 Single Page Architecture
 Global State:
 (a) Country
 (a) Year
 (a) Property
 Multiple Interesting Views:
 Map
 Country
 Property
 And 2-way state
integration and state-
update
 thanks, angular
Search (Tutorial) 23
Search (2) 24
Search (3) 25
Search (4) 26
Search (5) 27
Search (6) 28
Search (7) – and the last one,
breathe.
29
Demo
http://susanne.bitballoon.com/
30
Thank You
Sincerely
31

More Related Content

PPTX
Linear Regression - an 80 year study of the Dow Jones industrial average
PDF
Chapter 4 - Model of Production
PDF
Chapter 6 - Romer Model
PPTX
法雅客攝影馬拉松記錄
DOCX
Bet365 review
PPTX
mmmmmmm
PPTX
Crazy camel
PPTX
Jenny jaguar
Linear Regression - an 80 year study of the Dow Jones industrial average
Chapter 4 - Model of Production
Chapter 6 - Romer Model
法雅客攝影馬拉松記錄
Bet365 review
mmmmmmm
Crazy camel
Jenny jaguar

Viewers also liked (16)

PPSX
Endocrine system 1
PDF
Infographie passage au cloud
PPTX
Woody woodchuck
PPTX
Arnie aardvark
PPTX
Old joe crow
PDF
Sonicwall
PDF
China air pollution season kicks off with a cough and a wheeze as coal plants...
PDF
第14屆法雅客24小時攝影紀錄
DOCX
Cognitive dissonance is a Good thing
PDF
e-bulletin-November2010
PDF
2014法雅客攝影馬拉松記錄
PDF
Holiday shopping - When and How?
PPTX
Rip Mix Feed- ETEC 540
PPTX
Daisy dragon
PPT
Team building
DOCX
Holland vs Argentina Predictions
Endocrine system 1
Infographie passage au cloud
Woody woodchuck
Arnie aardvark
Old joe crow
Sonicwall
China air pollution season kicks off with a cough and a wheeze as coal plants...
第14屆法雅客24小時攝影紀錄
Cognitive dissonance is a Good thing
e-bulletin-November2010
2014法雅客攝影馬拉松記錄
Holiday shopping - When and How?
Rip Mix Feed- ETEC 540
Daisy dragon
Team building
Holland vs Argentina Predictions
Ad

Similar to Making susanne (20)

PDF
Isabelle Roland - The Aggregate E ects of Credit Market Frictions: Evidence f...
PPTX
Costs of sovereign default: restructuring strategies, bank distress and the c...
PPTX
HLEG thematic workshop on measuring economic, social and environmental resili...
PPTX
Exchange rate pass-through: What has changed since the crisis?
PPT
Intro to quant_analysis_students
PDF
Andrea Linarello - Allocative Efficiency and Finance
PDF
PPTX
The UK Management and Expectations Survey: First Results
PPTX
Chapter 2 of the book Basic Statistics as described by teacher
PDF
Project Risk Assessment PowerPoint Presentation Slides
PDF
On the economic effect of Western sanctions on Russia
PDF
Monetary policy, factor allocation and growth
PDF
Financial Heterogeneity and Monetary Union
PDF
Project Risk Assessment Powerpoint Presentation Slides
PPTX
Type of data @ web mining discussion
PPTX
Type of data @ Web Mining Discussion
PDF
Operative Risk Analysis of Industrial Small and medium Enterprises
PPT
Demand forecasting methods 1 gp
PDF
Data Driven Dashboards For Economic and Policy Analysis
DOCX
EMPRICAL FINANCE RESEARCH ON NON FINANCIAL COMPANIES OF PAKISTAN
Isabelle Roland - The Aggregate E ects of Credit Market Frictions: Evidence f...
Costs of sovereign default: restructuring strategies, bank distress and the c...
HLEG thematic workshop on measuring economic, social and environmental resili...
Exchange rate pass-through: What has changed since the crisis?
Intro to quant_analysis_students
Andrea Linarello - Allocative Efficiency and Finance
The UK Management and Expectations Survey: First Results
Chapter 2 of the book Basic Statistics as described by teacher
Project Risk Assessment PowerPoint Presentation Slides
On the economic effect of Western sanctions on Russia
Monetary policy, factor allocation and growth
Financial Heterogeneity and Monetary Union
Project Risk Assessment Powerpoint Presentation Slides
Type of data @ web mining discussion
Type of data @ Web Mining Discussion
Operative Risk Analysis of Industrial Small and medium Enterprises
Demand forecasting methods 1 gp
Data Driven Dashboards For Economic and Policy Analysis
EMPRICAL FINANCE RESEARCH ON NON FINANCIAL COMPANIES OF PAKISTAN
Ad

More from Aditya Gupta (7)

PPTX
Career Advice for Engineering Students in India
PDF
Meetups - A realtime distributed monitoring application (Android)
PDF
Multi Agent Systems presentation
PPTX
Equity research project, 2014 (banking)
PDF
Food Corporation of India - HR Policy Translation
PDF
India Energy Security Scenarios Calculator - BTech Project
PDF
Responses
Career Advice for Engineering Students in India
Meetups - A realtime distributed monitoring application (Android)
Multi Agent Systems presentation
Equity research project, 2014 (banking)
Food Corporation of India - HR Policy Translation
India Energy Security Scenarios Calculator - BTech Project
Responses

Recently uploaded (20)

PDF
epic-retirement-criteria-for-funds (1).pdf
PPT
Project_finance_introduction in finance.ppt
PPTX
Machine Learning (ML) is a branch of Artificial Intelligence (AI)
PDF
2018_Simulating Hedge Fund Strategies Generalising Fund Performance Presentat...
PDF
Lundin Gold Corporate Presentation August 2025
PDF
USS pension Report and Accounts 2025.pdf
PPTX
balanced_and_unbalanced_growth_theory_ppt.pptx
PDF
Pension Trustee Training (1).pdf From Salih Shah
PPTX
Simple linear regression model an important topic in econometrics
PDF
GVCParticipation_Automation_Climate_India
PDF
NewBase 22 August 2025 Energy News issue - 1818 by Khaled Al Awadi_compresse...
PDF
Management Accounting Information for Decision-Making and Strategy Execution ...
PDF
Chapterrrrrrrrrrrrrrrrrrrrrrrrr 2_AP.pdf
PPTX
RISK MANAGEMENT IN MEDICAL LABORATORIES 2.pptx
PPTX
Lesson Environment and Economic Growth.pptx
PDF
Call cute girls 😀 Delhi, call now pls cute girls delhi call🔙
PPTX
Financial literacy among Collage students.pptx
PDF
Financial discipline for educational purpose
PDF
MPEDA Export License Apply Online for Seafood Export License in India.pdf
PDF
Lundin Gold - August 2025.pdf presentation
epic-retirement-criteria-for-funds (1).pdf
Project_finance_introduction in finance.ppt
Machine Learning (ML) is a branch of Artificial Intelligence (AI)
2018_Simulating Hedge Fund Strategies Generalising Fund Performance Presentat...
Lundin Gold Corporate Presentation August 2025
USS pension Report and Accounts 2025.pdf
balanced_and_unbalanced_growth_theory_ppt.pptx
Pension Trustee Training (1).pdf From Salih Shah
Simple linear regression model an important topic in econometrics
GVCParticipation_Automation_Climate_India
NewBase 22 August 2025 Energy News issue - 1818 by Khaled Al Awadi_compresse...
Management Accounting Information for Decision-Making and Strategy Execution ...
Chapterrrrrrrrrrrrrrrrrrrrrrrrr 2_AP.pdf
RISK MANAGEMENT IN MEDICAL LABORATORIES 2.pptx
Lesson Environment and Economic Growth.pptx
Call cute girls 😀 Delhi, call now pls cute girls delhi call🔙
Financial literacy among Collage students.pptx
Financial discipline for educational purpose
MPEDA Export License Apply Online for Seafood Export License in India.pdf
Lundin Gold - August 2025.pdf presentation

Making susanne

  • 1. Making Susanne THE WORLD RECESSION SUSCEPTIBILITY ANALYSIS WEBTOOL Aditya Gupta (2011009) 1
  • 2. Overview  Objective and Motive  Process  Data  Data Collection  Data Cleanup  Dependent Variable  Analysis  Regression, Classification  Clustering and Results  Further Work  Presentation  Drawbacks 2
  • 3. Objective Key Questions  To study what economic composition makes economies more susceptible to Global Recession  By how much? How significantly?  Can we predict recession impact?  Studying similarity of economies  How to quantify  Recession impact  Susceptibility Key Goals  Empirically ascertain significance and impact of certain economic traits vis-à-vis expert opinions  Develop a powerful model to predict recession susceptibility  Present a global and intuitive view across parameters and their significance, i.e., Susanne. 3
  • 4. Why? 4  Recessions Hurt  2008 – 2010  Unemployment and layoffs  Austerity  Bearish Markets  Inflation  Bank Runs
  • 5. Data A BIG PROBLEM IN ITSELF 5
  • 6. Data Collection  60+ economic variables linked to Recession  Allegedly, as per sources like Economist, Forbes, World Bank, WWW  Or as per our suspicion  Preferable country-specific ratios (Normalized and Structural Information)  13 years, starting from 2000 to 2013  210 Countries  2500 rows of 60+ columns, each row is a country-year identified  Source:  World Bank (OECD National Account File): http://data.worldbank.org/  United Nations Comtrade Database, International Monetary Fund, Direction of Trade Database, Balance of Payments Database, and more. 6
  • 7. Simple Enough?  No, its blistering gunk.  Only 86 out of ~2500 rows have complete data (no NA’s)  Non-normalized values  What now? 7
  • 8. Data Cleanup  Manually add values for nearly complete columns  Compress and remove years 2008-2010  Observed Class Variables – not Causal  Remove countries with almost no data (Afghanistan and 30 others)  Down to 2100 rows  Drop columns if:  Significant, and have very little data available  Year values for super-specific variables like “Merchandise Exports to Scandinavian Countries as % of exports”  We surely don’t have this for most countries, especially those like Albania  Insignificant  Determined from Regressional Analysis (MLR) p-values 8
  • 9. Still too many missing values  What to do?  Drop row (done)  Weighted expansion of row (did not consider)  Infer a value  Average value for parameter for country over 13 years  Result: Fixed 5000 out of 70K cells  Still only 86 full rows  Why?  No value for property exists for a country at all…  Solution: Global Average – the sacrilege!  Awful. Pull towards the mean, misclassify.  But no directional bias.  Trade-off: Unlocks a world of data 9 For a specific country… Year V1 Pred V2 Pred 2001 15 15 15 15 2002 17 17 17 17 2003 15 15 19 19 2004 16 16 21 21 2005 ? 16 ? 18 Avg 16 18
  • 10. Dependent Variable  Goal: Capture Recession Impact between 2008-10  Technical Definition: Absolute Growth Rate  Problem: 15% to 1%, still not recession  Average Growth Rate drop between 2008 – 2010?  Problem: 1% drop for UK (at 0.5%) vs. AFG (at 15%)  Percentage Drop in Growth Rate  Problem: 0.1% to 3%, 3000% change!  Solution?  Drop.SD: Drop in Growth Rate in number of Standard Deviations over 2008 – 2010. Variance in Growth rate over last 20 years.  Variance: Lot of manual data collection 10
  • 11. Distribution and Discretization of Drop.SD  Corresponded well with web information about countries that “avoided recession” and those “hit worst”  21% of the countries labelled as unsusceptible  Less than -0.25 Drop in GDP during 2007-2010  Middle 36% labelled as relatively unaffected  between -0.25 to 0.75 SD GDP Drop in Recession  Highest 43% adversely affected  >0.75 SD drop in GDP  How good was this division?  lm R-squared rose from 41.58% to 44.14% (no loss in predictive power, i.e. reasonable classification)  Good split. Most countries were affected horribly. 11 Other Options • Equal Density Split • Maximise Classification Accuracy • Purely Contextual
  • 12. Analysis AFTER WE FINALLY HAVE 2099 COMPLETE ROWS OF DATA 12
  • 13. Classification: Can we predict Recession Susceptibility?  Assumption (non trivial)  Drop.SD correctly represents Recession Impact in 08-10  Recession Impact in 08-10 correctly represents Recession Susceptibility  Can’t do better but guess a few things.  SVM has 92% accuracy. Seems like it.  Caveat:  Bootstrap Analysis: Training Data = Test Data  Workaround (can’t generate new countries or years):  5-Fold Cross Validation 13
  • 14. Important Variables and their Impact  Using Multilinear Regression (MLR),  for each Economic Variable, we get  degree  and direction of impact on Drop.SD 14
  • 15. Clustering  Motivation: “Are we brute-engineering a predictor, or is there an actual underlying economic structural pattern of recession-susceptibility?”  70% accuracy (consistent clusters)with k-means, using multiple k values.  You decide. 15
  • 16. Aside: Some nifty R snippets  Populating Missing Values smartly:  d7.agg <- aggregate(d7.norm[,5:48], by = list(Country.Name = d7.norm$Country.Name, Time = d7.norm$Time), FUN = function(x) { if(anyNA(x)) { mean(na.omit(x)) } else { mean(x) } })  Class labelling:  Rec.Affect <- ifelse (CompleteDataFinal$Drop.SD. >= 1, 1, ifelse(CompleteDataFinal$Drop.SD. >= - 0.25, 0, -1)).  SVM (predict Recession Susceptibility) 16 > model <- svm(as.factor(Rec.Affect) ~ ., data = CompleteDataFinal[, c(6:46)]) > predictions <- predict(model, CompleteDataFinal[,c(6:46)]) > table(pred = predictions, true = Rec.Affect) true pred -1 0 1 -1 383 8 7 0 33 675 41 1 15 61 876 > t <- table(pred = predictions, true = Rec.Affect) > (t[1,1] + t[2,2] + t[3,3])/sum(t) [1] 0.9213911
  • 17.  Allowing similarity checks between Economies  Overall  Over Economic Categories  Using a Semantic Web compliant Cardinality Checks and Ontology  Classify Economic Variables into one or more of:  Central Government  Economic Structure  Net Exports  Banking  Manufacturing  GDP  Discretize them into , and over:  Value  And Impact  Using middle 80 percentile cut-offs Further Work - Accessibility 17
  • 19. A Knowledge Map (Ontology) 19
  • 20. Ontology (2) 20 • About 3500 Nodes • And 7000 Edges
  • 22. Frontend 22  Takeaway:  Single Page Architecture  Global State:  (a) Country  (a) Year  (a) Property  Multiple Interesting Views:  Map  Country  Property  And 2-way state integration and state- update  thanks, angular
  • 29. Search (7) – and the last one, breathe. 29