SlideShare a Scribd company logo
+
Big Data Meets Earned
Value Management
We have lots data. How can we use it to
make predictive and prescriptive
forecasts of future performance to
increase Probability of Program
Success?
Glen B. Alleman
Thomas J. Coonce
+The Killer Question For Every Manager
Of A Complex, High Risk Program Is …
2
… How Can I See An
Unanticipated Estimate At
Completion (EAC) Coming
Before It’s Too Late?
“What’s in Your Estimate at Completion?”, Pat Barker and
Roberta Tomasini, Defense AT&L, March-April, 2014
+Here’s WHY We Need Better Ways To
Forecast Estimate At Complete …
3
… the root cause starts on day one,
with a less than credible PMB.
42%
29%
21%
0%
10%
20%
30%
40%
50%
60%
From
Phase B
Start
From
PDR
From
CDR
DevelopmentCostGrowth
29%
23%
19%
0%
10%
20%
30%
40%
50%
60%
From
Phase B
Start
From
PDR
From
CDR
PhaseB/C/DScheduleGrowth
+Three Types Of Data Are Available In
Big Data Repositories
n  Descriptive – looking in the past we can learn what
happened, but it’s too late to take corrective action.
n  Predictive – using past performance we can answer
the question what will happen if we do nothing but
do the same as we’ve done in the past.
n  Prescriptive – past performance data used to make
predictions and suggest decision options to take
advantage of the predictions
4
Prescriptive analytics not only anticipates what will
happen and when it will happen, but why it will
happen.
+Descriptive Analytics
n  Descriptive Analytics – condensing big data into
smaller, useful nuggets of information.
n  Most raw Earned Value data is not suitable for human
consumption since it is reported by WBS without the
connectivity to the product or programmatic topology
n  Descriptive data summarizes what happened in the
past, many times 45 days in the past.
n  Correlations between WBS elements not defined nor
correlations between risk, technical performance or
Systems Engineering attributes – MOE, MOP, KPP†
The EVM repositories provide the raw material for Descriptive
Analytics through the IPMR (DI-MGMT-81861) submittals
5
† The Defense Acquisition Guide defines how to apply Measures of Effectiveness, Measures of Performance,
Technical Performance Measures, and Key Performance Parameters to assess program performance
+DAU Gold Card’s EAC Formula Uses
Predictive Analytics, But …
n  Past variances are wiped
out with “Cumulative to
Date” data
n  No adjustment for risk
n  Not statistically corrected
for past performance
6
+Prescriptive Analytics
n  Is a type of Predictive Analytics
n  Used when we need to prescribe an action so
leadership can take the data and act.
n  Predictive analytics doesn’t predict one future
outcome – but Multiple outcomes based on the
decision makers actions.
n  Prescriptive analytics requires a predictive model
with two additional components:
n  Actionable data.
n  Feedback system that tracks the outcome produced by the
action taken..
7
+Prescriptive Analytics Is The
Foundation For Corrective Actions
n  Prescriptive Analytics is about making
decisions based on data.
n  Prescriptive analytics requires a predictive
model with two components:
n  Actionable data
n  Feedback from those actions
n  Prescriptive models predict the possible
consequences based on different choices of
action.
8
Milestones are rocks on the side of the road.The Roman
Milestone was a measure back to Rome.You only know that
distance after you pass the milestone.
+There Is Untapped Value In An Earned
Value Data Repository
n  Most data is of little value at the detail level since it is
uncorrelated in the reporting process
n  Making correlations between cause and effect is difficult for
humans, but statistical process algorithms can do this for us
n  With correlated data in hand, we can start generating
descriptive analytics
n  But drivers of variance are not visible in the repository
n  Variances from past can be calculated, but not used in future
forecasts
n  There is no built-in mechanism to see patterns in the
data
n  Standard tools produce linear, non-statistical, non-risk adjusted
forecasts
To extract this value we need to overcome some limitations in
today’s repositories
9
+All Programmatic Forecasting Is Probabilistic,
Driven By Underlying Statistical Processes
If we make forecasts about program performance that are not
statistically and risk adjusted – we’re gonna get wet.
10
+Schedule, Related Cost And
Technical Elements Are Probabilistic
The IMS doesn’t help us much either,since the correlative
drivers are themselves non-linear stochastic processes
11
A Stochastic
process is a
collection of
random variables
used to represent
the evolution of
some random
value or system,
over time.
+The Ability To Forecast Future Performance
Starts With A Tool That Provides…
n  Forecasting of future performance,
using time series of the past using
Autoregressive Integrated Moving
Average (ARIMA) algorithm
n  Confidence intervals of these
forecasts for past performance
n  Correlation between the time series
elements (CPI, SPI,WBS element)
n  Deeper correlations between these
Earned Value elements as risk
retirement, increase effectiveness and
performance and any other recorded
measure of the program.
12
http://cran.us.r-project.org/
The combination of some data and an aching desire for an answer does not ensure
that a reasonable answer can be extracted from a given body of data. - John Tukey
+A Quick Look At Where We’re Going
Starting with Forecasting CPI/SPI
n  We have a time
series of CPI, SPI, in
the repository
n  What’s possible
behaviors in the
future can we
discover from the
past behavior?
n  The R code on the
top answers that in
4 lines.
If we want to credibly forecast the future with the past,we’ll
need better tools.We’ve got the data,just need to use it
13
14
§  The Units of Measures for Earned Value Management are
Dollars
§  Cumulative indices wipe out all the variances
§  Forecasts of future performance are not statistically adjusted
§  There is no correlative information drivers of variances
§  None of these forecasts use the risk register to adjust their
value
+Since ARIMA Is A Well Traveled Path,
We Need More and Better Tools
n  The Earned Value Management Performance measures
need to be connected to:
n  Risk retirement and buy down status Technical Performance
Measure compliance
n  Measures of Effectiveness and Measures of Performance
n  Work Breakdown Structure correlations for each work
activity
n  Correlations between performance and work performed is
available in the repository
n  We’re missing the tool to reveal these correlations, drivers, and
corrective actions to keep the program GREEN
To provide better forecasts of EAC,we need more data.
CPI/SPI needs to be augmented with technical program data
15
+We Need More Power To See Into The
Future And Take Corrective Actions
16
We need
more power
Mr. Scott
I’m given
her all she’s
got ‘Captain
+Principal Component Analysis (PCA)
Gets More Power from our data
17
Principal component analysis (PCA) is a statistical procedure
that uses orthogonal transformation to convert a set of
observations of possibly correlated variables into a set of
values of linearly uncorrelated variables called principal
components.
We want to convert a larger set of program performance
variables – SPI, CPI, Risk Retirements,TPM, MOE, MOP, KPP,
Staffing, and others, into a small set of drivers of variance.
PCA can provide visibility to what are the connections
between EAC growth and the source of that growth in a
statistically sound manner, not currently available with IPMR
reporting using CPI/SPI
+What Can PCA Tell Us?
n  If data lies in high dimensional space (more than just
CPI/SPI), then large amount of data is required to learn
distributions or decision rules.
n  For each WBS element 9 dimensions (CPI, SPI,WBS,
TPM, MOE, MOP, KPP, Risk, Staffing Profiles).
n  Each dimension has 36 levels (36 months of data).
n  We could produce a 9 dimension scatter plot for the 36
months of data and it’d look like a big blob.
n  We need to know what are the drivers in this Blob of
data?
With “all” the data in a single place – which it is not – we need
a way to reduce the dimensionality to provide analysis
18
+From 2 Dimensions (SPI/CPI) to 8
Dimensions and Back Again
n  Two components, for example –
SPI and CPI
n  Discover the correlation between
these two data samples
n  Locate in the individual samples
the time the drivers started
impacting the program
n  Extend this to 8 dimensions
n  Similar to Joint Confidence Level,
but with actual data
19
PCi=a1X1 + a2X2 + a3X3 + … + a8K8
+Program Performance Dimensions
PCA data can be simple 2 dimensional – CPI/SPI or more
complex and represent other “attributes” driving EAC
20
Variable Information that mat drive Unanticipated EAC
CPI/SPI CPI for program, time phased by reporting period
TPM
Technical Performance Measures, with control bands as program
moves left to right.These can be any measure technical
compliance
§  Weight
§  Throughput
§  Information Assurance validation
§  Any of the JROC KPPs
Risk
Risk
§  Risk retirement buy down plan
§  Risk handling and planned reduction
Margin Cost and schedule margin burn down to plan
+Call to Action for increased visibility to
Unanticipated EAC Growth using BIG Data
21
Normalize data in the Central
Repository in preparation for analysis
Apply ARIMA to normalized data to
forecast CPI, SPI, and Calculated EAC
Adjust ARIMA parameters using past
performance compliance
Integrate external data with EV
repository data to build correlations
the EAC forecasts
Apply Principal Component Analysis
(PCA) to identify correlated drivers of
EAC growth

More Related Content

PDF
Focus on the nine I's (v9)
PDF
Introduction to monte-carlo analysis for software development - Troy Magennis...
PDF
Ev+agile=success
PDF
Agile at scale resources
PDF
PDF
Evm+agile (8.8).chapter 9
PDF
Forecasting cost and schedule performance
PDF
Agile in the government
Focus on the nine I's (v9)
Introduction to monte-carlo analysis for software development - Troy Magennis...
Ev+agile=success
Agile at scale resources
Evm+agile (8.8).chapter 9
Forecasting cost and schedule performance
Agile in the government

What's hot (20)

PDF
Five immutable principles
PDF
Successfully Integrating Agile and Earned Value
PDF
EVM+Agile the darkside
PDF
Project breathalyzer
PDF
Integrated Agile with EVM -- Executive overview
PDF
Monte Carlo Simulation for Agile Development
PPTX
Capabilities Based Planning
PDF
Evm+agile estimating
PDF
Using balanced scorecard to build a project focused org2
PDF
Integrated Agile Software Development with Earned Value Management
PPT
Avoid software project horror stories - check the reality value of the estima...
PDF
Nine Best Practices
PDF
Paradigm of agile project management
PDF
5 immutable principles and 5 processes in 60 seconds
PPTX
Risk adjusted engineering management
PDF
Deliverables Based Planning
PDF
Building the Performance Measurement Baseline
PDF
Earned Value + Agile = Success
DOCX
What Does Done Look Like?
PDF
SOLVING PROJECT ALLOCATION RESOURCE PROBLEMS WITH AEROSPACE ERP
Five immutable principles
Successfully Integrating Agile and Earned Value
EVM+Agile the darkside
Project breathalyzer
Integrated Agile with EVM -- Executive overview
Monte Carlo Simulation for Agile Development
Capabilities Based Planning
Evm+agile estimating
Using balanced scorecard to build a project focused org2
Integrated Agile Software Development with Earned Value Management
Avoid software project horror stories - check the reality value of the estima...
Nine Best Practices
Paradigm of agile project management
5 immutable principles and 5 processes in 60 seconds
Risk adjusted engineering management
Deliverables Based Planning
Building the Performance Measurement Baseline
Earned Value + Agile = Success
What Does Done Look Like?
SOLVING PROJECT ALLOCATION RESOURCE PROBLEMS WITH AEROSPACE ERP
Ad

Viewers also liked (8)

PDF
Earned Value Management Meets Big Data
PDF
Managing in the presence of uncertainty
PDF
Measurement News Webinar
PDF
Cpm 200 C technical performance measures ipm2016
PDF
Building the perfect schedule (v6)
PDF
Principles and Practices of Performance-Based Project Management®
PDF
Control systems
PDF
Notes on balanced scorecard
Earned Value Management Meets Big Data
Managing in the presence of uncertainty
Measurement News Webinar
Cpm 200 C technical performance measures ipm2016
Building the perfect schedule (v6)
Principles and Practices of Performance-Based Project Management®
Control systems
Notes on balanced scorecard
Ad

Similar to Big data meets evm (submitted) (20)

PPTX
Practical Machine Learning at Work
DOCX
Beyond CPI and SPI
PDF
Forecasting cost and schedule performance
PPTX
PROMISE 2011: What Prediction Model Should Be?
PDF
Earning Value from Earned Value Management
PPTX
Six sigma tools an overview
PDF
Big data meets evm (submitted).pptx
PDF
Methods of Forecasting for Capacity Management
PDF
Earned Value Management Essentials
PDF
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
PDF
Building a Credible Performance Measurement Baseline
PDF
Forecasting and Predictive Digital Marketing. Daniele Donzella, Gianluigi Spa...
PPTX
Linear Regression with R programming.pptx
PPT
What is Oracle Demantra
PDF
Earned Value Management Essentials
PDF
Earned Value Managed Services
PPT
Data Mining and Analytics
PPTX
C O N T R O L L P R E S E N T A T I O N
PDF
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
PPT
Chapter 9 Planning Tools Techniques Ppt09
 
Practical Machine Learning at Work
Beyond CPI and SPI
Forecasting cost and schedule performance
PROMISE 2011: What Prediction Model Should Be?
Earning Value from Earned Value Management
Six sigma tools an overview
Big data meets evm (submitted).pptx
Methods of Forecasting for Capacity Management
Earned Value Management Essentials
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
Building a Credible Performance Measurement Baseline
Forecasting and Predictive Digital Marketing. Daniele Donzella, Gianluigi Spa...
Linear Regression with R programming.pptx
What is Oracle Demantra
Earned Value Management Essentials
Earned Value Managed Services
Data Mining and Analytics
C O N T R O L L P R E S E N T A T I O N
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Chapter 9 Planning Tools Techniques Ppt09
 

More from Glen Alleman (20)

PDF
Managing risk with deliverables planning
PDF
A Gentle Introduction to the IMP/IMS
PDF
Increasing the Probability of Project Success
PDF
Process Flow and Narrative for Agile+PPM
PDF
Practices of risk management
PDF
Principles of Risk Management
PDF
Deliverables Based Planning, PMBOK® and 5 Immutable Principles of Project Suc...
PDF
From Principles to Strategies for Systems Engineering
PDF
NAVAIR Integrated Master Schedule Guide guide
PDF
Integrated master plan methodology (v2)
PDF
IMP / IMS Step by Step
PDF
DHS - Using functions points to estimate agile development programs (v2)
PDF
Making the impossible possible
PDF
Heliotropic Abundance
PDF
Capabilities based planning
PDF
Process Flow and Narrative for Agile
PPTX
Program Management Office Lean Software Development and Six Sigma
PDF
Policy and Procedure Rollout
PDF
Integrated Master Plan Development
PDF
Project Management Theory
Managing risk with deliverables planning
A Gentle Introduction to the IMP/IMS
Increasing the Probability of Project Success
Process Flow and Narrative for Agile+PPM
Practices of risk management
Principles of Risk Management
Deliverables Based Planning, PMBOK® and 5 Immutable Principles of Project Suc...
From Principles to Strategies for Systems Engineering
NAVAIR Integrated Master Schedule Guide guide
Integrated master plan methodology (v2)
IMP / IMS Step by Step
DHS - Using functions points to estimate agile development programs (v2)
Making the impossible possible
Heliotropic Abundance
Capabilities based planning
Process Flow and Narrative for Agile
Program Management Office Lean Software Development and Six Sigma
Policy and Procedure Rollout
Integrated Master Plan Development
Project Management Theory

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Cloud computing and distributed systems.
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
Approach and Philosophy of On baking technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Encapsulation_ Review paper, used for researhc scholars
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Review of recent advances in non-invasive hemoglobin estimation
Understanding_Digital_Forensics_Presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
sap open course for s4hana steps from ECC to s4
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Cloud computing and distributed systems.
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Digital-Transformation-Roadmap-for-Companies.pptx

Big data meets evm (submitted)

  • 1. + Big Data Meets Earned Value Management We have lots data. How can we use it to make predictive and prescriptive forecasts of future performance to increase Probability of Program Success? Glen B. Alleman Thomas J. Coonce
  • 2. +The Killer Question For Every Manager Of A Complex, High Risk Program Is … 2 … How Can I See An Unanticipated Estimate At Completion (EAC) Coming Before It’s Too Late? “What’s in Your Estimate at Completion?”, Pat Barker and Roberta Tomasini, Defense AT&L, March-April, 2014
  • 3. +Here’s WHY We Need Better Ways To Forecast Estimate At Complete … 3 … the root cause starts on day one, with a less than credible PMB. 42% 29% 21% 0% 10% 20% 30% 40% 50% 60% From Phase B Start From PDR From CDR DevelopmentCostGrowth 29% 23% 19% 0% 10% 20% 30% 40% 50% 60% From Phase B Start From PDR From CDR PhaseB/C/DScheduleGrowth
  • 4. +Three Types Of Data Are Available In Big Data Repositories n  Descriptive – looking in the past we can learn what happened, but it’s too late to take corrective action. n  Predictive – using past performance we can answer the question what will happen if we do nothing but do the same as we’ve done in the past. n  Prescriptive – past performance data used to make predictions and suggest decision options to take advantage of the predictions 4 Prescriptive analytics not only anticipates what will happen and when it will happen, but why it will happen.
  • 5. +Descriptive Analytics n  Descriptive Analytics – condensing big data into smaller, useful nuggets of information. n  Most raw Earned Value data is not suitable for human consumption since it is reported by WBS without the connectivity to the product or programmatic topology n  Descriptive data summarizes what happened in the past, many times 45 days in the past. n  Correlations between WBS elements not defined nor correlations between risk, technical performance or Systems Engineering attributes – MOE, MOP, KPP† The EVM repositories provide the raw material for Descriptive Analytics through the IPMR (DI-MGMT-81861) submittals 5 † The Defense Acquisition Guide defines how to apply Measures of Effectiveness, Measures of Performance, Technical Performance Measures, and Key Performance Parameters to assess program performance
  • 6. +DAU Gold Card’s EAC Formula Uses Predictive Analytics, But … n  Past variances are wiped out with “Cumulative to Date” data n  No adjustment for risk n  Not statistically corrected for past performance 6
  • 7. +Prescriptive Analytics n  Is a type of Predictive Analytics n  Used when we need to prescribe an action so leadership can take the data and act. n  Predictive analytics doesn’t predict one future outcome – but Multiple outcomes based on the decision makers actions. n  Prescriptive analytics requires a predictive model with two additional components: n  Actionable data. n  Feedback system that tracks the outcome produced by the action taken.. 7
  • 8. +Prescriptive Analytics Is The Foundation For Corrective Actions n  Prescriptive Analytics is about making decisions based on data. n  Prescriptive analytics requires a predictive model with two components: n  Actionable data n  Feedback from those actions n  Prescriptive models predict the possible consequences based on different choices of action. 8 Milestones are rocks on the side of the road.The Roman Milestone was a measure back to Rome.You only know that distance after you pass the milestone.
  • 9. +There Is Untapped Value In An Earned Value Data Repository n  Most data is of little value at the detail level since it is uncorrelated in the reporting process n  Making correlations between cause and effect is difficult for humans, but statistical process algorithms can do this for us n  With correlated data in hand, we can start generating descriptive analytics n  But drivers of variance are not visible in the repository n  Variances from past can be calculated, but not used in future forecasts n  There is no built-in mechanism to see patterns in the data n  Standard tools produce linear, non-statistical, non-risk adjusted forecasts To extract this value we need to overcome some limitations in today’s repositories 9
  • 10. +All Programmatic Forecasting Is Probabilistic, Driven By Underlying Statistical Processes If we make forecasts about program performance that are not statistically and risk adjusted – we’re gonna get wet. 10
  • 11. +Schedule, Related Cost And Technical Elements Are Probabilistic The IMS doesn’t help us much either,since the correlative drivers are themselves non-linear stochastic processes 11 A Stochastic process is a collection of random variables used to represent the evolution of some random value or system, over time.
  • 12. +The Ability To Forecast Future Performance Starts With A Tool That Provides… n  Forecasting of future performance, using time series of the past using Autoregressive Integrated Moving Average (ARIMA) algorithm n  Confidence intervals of these forecasts for past performance n  Correlation between the time series elements (CPI, SPI,WBS element) n  Deeper correlations between these Earned Value elements as risk retirement, increase effectiveness and performance and any other recorded measure of the program. 12 http://cran.us.r-project.org/ The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from a given body of data. - John Tukey
  • 13. +A Quick Look At Where We’re Going Starting with Forecasting CPI/SPI n  We have a time series of CPI, SPI, in the repository n  What’s possible behaviors in the future can we discover from the past behavior? n  The R code on the top answers that in 4 lines. If we want to credibly forecast the future with the past,we’ll need better tools.We’ve got the data,just need to use it 13
  • 14. 14 §  The Units of Measures for Earned Value Management are Dollars §  Cumulative indices wipe out all the variances §  Forecasts of future performance are not statistically adjusted §  There is no correlative information drivers of variances §  None of these forecasts use the risk register to adjust their value
  • 15. +Since ARIMA Is A Well Traveled Path, We Need More and Better Tools n  The Earned Value Management Performance measures need to be connected to: n  Risk retirement and buy down status Technical Performance Measure compliance n  Measures of Effectiveness and Measures of Performance n  Work Breakdown Structure correlations for each work activity n  Correlations between performance and work performed is available in the repository n  We’re missing the tool to reveal these correlations, drivers, and corrective actions to keep the program GREEN To provide better forecasts of EAC,we need more data. CPI/SPI needs to be augmented with technical program data 15
  • 16. +We Need More Power To See Into The Future And Take Corrective Actions 16 We need more power Mr. Scott I’m given her all she’s got ‘Captain
  • 17. +Principal Component Analysis (PCA) Gets More Power from our data 17 Principal component analysis (PCA) is a statistical procedure that uses orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. We want to convert a larger set of program performance variables – SPI, CPI, Risk Retirements,TPM, MOE, MOP, KPP, Staffing, and others, into a small set of drivers of variance. PCA can provide visibility to what are the connections between EAC growth and the source of that growth in a statistically sound manner, not currently available with IPMR reporting using CPI/SPI
  • 18. +What Can PCA Tell Us? n  If data lies in high dimensional space (more than just CPI/SPI), then large amount of data is required to learn distributions or decision rules. n  For each WBS element 9 dimensions (CPI, SPI,WBS, TPM, MOE, MOP, KPP, Risk, Staffing Profiles). n  Each dimension has 36 levels (36 months of data). n  We could produce a 9 dimension scatter plot for the 36 months of data and it’d look like a big blob. n  We need to know what are the drivers in this Blob of data? With “all” the data in a single place – which it is not – we need a way to reduce the dimensionality to provide analysis 18
  • 19. +From 2 Dimensions (SPI/CPI) to 8 Dimensions and Back Again n  Two components, for example – SPI and CPI n  Discover the correlation between these two data samples n  Locate in the individual samples the time the drivers started impacting the program n  Extend this to 8 dimensions n  Similar to Joint Confidence Level, but with actual data 19 PCi=a1X1 + a2X2 + a3X3 + … + a8K8
  • 20. +Program Performance Dimensions PCA data can be simple 2 dimensional – CPI/SPI or more complex and represent other “attributes” driving EAC 20 Variable Information that mat drive Unanticipated EAC CPI/SPI CPI for program, time phased by reporting period TPM Technical Performance Measures, with control bands as program moves left to right.These can be any measure technical compliance §  Weight §  Throughput §  Information Assurance validation §  Any of the JROC KPPs Risk Risk §  Risk retirement buy down plan §  Risk handling and planned reduction Margin Cost and schedule margin burn down to plan
  • 21. +Call to Action for increased visibility to Unanticipated EAC Growth using BIG Data 21 Normalize data in the Central Repository in preparation for analysis Apply ARIMA to normalized data to forecast CPI, SPI, and Calculated EAC Adjust ARIMA parameters using past performance compliance Integrate external data with EV repository data to build correlations the EAC forecasts Apply Principal Component Analysis (PCA) to identify correlated drivers of EAC growth