SlideShare a Scribd company logo
1
INPUT DATA COLLECTION AND
ANALYSIS
Senir Justin
BITF20M550
Introduction
2
Input data includes gathering, studying, and utilizing input data
in the simulation method
The collection may be gathered from any source.
 A study of input data shows the theoretical distribution of data
the practitioner only gathers a sample of the actual data
distribution when collecting data..
Collecting Input Data
3
There are many ways to collect input data like the following:
 Historical records
 Manufacturer specifications
 Vendor claims
 Operator estimates
 Management estimates
 Automatic data capture
 Direct observation
Collecting Input Data
4
 Input data may be collected manually or with the assistance of
electronic devices
 It is the most difficult part of the simulation process
 While colleting input data, there are different classifications of data
Classification of Data
5
There are two methods for the classification of data:
 Deterministic or probabilistic
 Discrete or continuous
Deterministic /Probabilistic Data
6
Deterministic Data
 Deterministic data are those in which the event affecting the
data occurs consistently or predictably.
Probabilistic Input Data
 A probabilistic process does not occur with the same type of
regularity.
 This implies that since the value of this type of data never
changes, it only has to be gathered once.
Discrete/Continuous Data
7
Discrete Data
 It can take only certain values. Usually, this means a whole number.
 The number of students in class is an example
Continuous Data
 It can take any value in the observed range. This means that
fractional numbers are a definite possibility
 Height of children is an example
Input Data Distributions
8
Bernoulli Distribution:
 Models a random occurrence with one of two possible
outcomes
 Frequently referred to as a success or failure
Uniform Distribution:
 It can be used as a first cut for modeling the input data
 It may be either discrete or continuous
Exponential Distribution :
 Commonly utilized in conjunction with interarrival processes
 Random no. of entities will arrive within a specific time
Input Data Distributions
9
Triangle Distribution:
 Used in situations where the practitioner does not have complete
knowledge about the system
 It has only three parameters:
i) Minimum Possible Value
ii) Most Common Value
iii) Maximum Possible Value
Less Common Distributions
10
Geometric distributions:
The geometric distribution gives the probability of achieving success after
N number of failures
 It is discrete which means that distribution must be whole number
Weibull Distribution:
The Weibull distribution is often used to represent distributions that cannot
have values less than zero
 It has two parameters
Analyzing Input Data
11
Graphics Approach:
Graphic approach is the most fundamental approach to attempting to fit
input data
 It consists of visual qualitative comparison between actual and
theoretical data distribution
Chi-Square:
The chi-square test is based on the comparison of the actual number of
observations
versus the expected number of observations
Commonly accepted as preferred goodness fit technique
Kolmogorov–Smirnov:
The KS test should be utilized only when the number of data points is
Software Implementations for
Data Fitting
12
 Fitting a significant no. of observed data sets to theoretical
distributions is a time consuming task
 For this purpose, practitioners use data-fitting software
 The following two are frequently used to carry out this function:
1. Arena input analyzer
2. Expert fit
Arena input analyzer
13
 Input analyzer is part of ARENA simulation software package
available from Rockwell software
 It has the capability to:
1. Determine the quality of fit of probability distribution functions to
input data
2. Examine a total of 15 distributions for data fitting
3. Calculate Chi-square, KS and square error tests
4. Generate high-quality data plots
Expert Fit
14
 This software is available through Averil M. Law & Associates
 This software has the capability to:
1. Automatically determine best probability distribution for data
sets
2. Fits 40 distributions
3. Conduct Chi-sqaure, KS and Anderson-Darling goodness of fit
tests
4. Provide high-quality plots
5. Analyze a large no. of data sets in batch mode
15
THANK YOU

More Related Content

PPTX
Engineering Data Analysis-ProfCharlton
PPTX
Unit 6 input modeling
PPTX
model Building_updated_by_me_in_the_year.pptx
PPTX
CPSC 531: System Modeling and Simulation.pptx
PPTX
Simulation Model Building from the fourth
PPTX
SMS Module-4(theory) ppt.pptx
PPT
SMA (INPUT MODELLING).ppt
PDF
Chapter-Four.pdf
Engineering Data Analysis-ProfCharlton
Unit 6 input modeling
model Building_updated_by_me_in_the_year.pptx
CPSC 531: System Modeling and Simulation.pptx
Simulation Model Building from the fourth
SMS Module-4(theory) ppt.pptx
SMA (INPUT MODELLING).ppt
Chapter-Four.pdf

Similar to Input Data Collection and Analysis.pptx (20)

PPT
Chap_05_Data_Collection_and_Analysis.ppt
PDF
Descriptive Analytics: Data Reduction
PPTX
Nota Bab 1 JF608
PPTX
Know Your Data in data mining applications
PPTX
lecture 1 Slides.pptx
PPTX
Statistics and optimization (1)
PPT
ch09-Simulation.ppt
PPTX
Lect1.pptxdglsgldjtzjgd csjfsjtskysngfkgfhxvxfhhdhz
PDF
M4D-v0.4.pdf
PPTX
Types of Data in Machine Learning, Number aand Categorical
PPTX
BU_FCAI_SCC430_Modeling&Simulation_Ch06.pptx
PDF
03-Data-Analysis-Final.pdf
PDF
Big_DM_24_MS_Topic_02_Understanding Data.pdf
PPTX
Statistical Analysis and Hypothesis Tesing
PPTX
Stat-Lesson.pptx
PPT
Data Mining: Concepts and Techniques — Chapter 2 —
PPT
Data mining :Concepts and Techniques Chapter 2, data
PPTX
An Introduction to Simulation in the Social Sciences
PPTX
1. Descriptive statistics.pptx engineering
Chap_05_Data_Collection_and_Analysis.ppt
Descriptive Analytics: Data Reduction
Nota Bab 1 JF608
Know Your Data in data mining applications
lecture 1 Slides.pptx
Statistics and optimization (1)
ch09-Simulation.ppt
Lect1.pptxdglsgldjtzjgd csjfsjtskysngfkgfhxvxfhhdhz
M4D-v0.4.pdf
Types of Data in Machine Learning, Number aand Categorical
BU_FCAI_SCC430_Modeling&Simulation_Ch06.pptx
03-Data-Analysis-Final.pdf
Big_DM_24_MS_Topic_02_Understanding Data.pdf
Statistical Analysis and Hypothesis Tesing
Stat-Lesson.pptx
Data Mining: Concepts and Techniques — Chapter 2 —
Data mining :Concepts and Techniques Chapter 2, data
An Introduction to Simulation in the Social Sciences
1. Descriptive statistics.pptx engineering
Ad

Recently uploaded (20)

PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Computer network topology notes for revision
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
1_Introduction to advance data techniques.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Miokarditis (Inflamasi pada Otot Jantung)
Business Acumen Training GuidePresentation.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Clinical guidelines as a resource for EBP(1).pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Reliability_Chapter_ presentation 1221.5784
Computer network topology notes for revision
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
oil_refinery_comprehensive_20250804084928 (1).pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
STUDY DESIGN details- Lt Col Maksud (21).pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Ad

Input Data Collection and Analysis.pptx

  • 1. 1 INPUT DATA COLLECTION AND ANALYSIS Senir Justin BITF20M550
  • 2. Introduction 2 Input data includes gathering, studying, and utilizing input data in the simulation method The collection may be gathered from any source.  A study of input data shows the theoretical distribution of data the practitioner only gathers a sample of the actual data distribution when collecting data..
  • 3. Collecting Input Data 3 There are many ways to collect input data like the following:  Historical records  Manufacturer specifications  Vendor claims  Operator estimates  Management estimates  Automatic data capture  Direct observation
  • 4. Collecting Input Data 4  Input data may be collected manually or with the assistance of electronic devices  It is the most difficult part of the simulation process  While colleting input data, there are different classifications of data
  • 5. Classification of Data 5 There are two methods for the classification of data:  Deterministic or probabilistic  Discrete or continuous
  • 6. Deterministic /Probabilistic Data 6 Deterministic Data  Deterministic data are those in which the event affecting the data occurs consistently or predictably. Probabilistic Input Data  A probabilistic process does not occur with the same type of regularity.  This implies that since the value of this type of data never changes, it only has to be gathered once.
  • 7. Discrete/Continuous Data 7 Discrete Data  It can take only certain values. Usually, this means a whole number.  The number of students in class is an example Continuous Data  It can take any value in the observed range. This means that fractional numbers are a definite possibility  Height of children is an example
  • 8. Input Data Distributions 8 Bernoulli Distribution:  Models a random occurrence with one of two possible outcomes  Frequently referred to as a success or failure Uniform Distribution:  It can be used as a first cut for modeling the input data  It may be either discrete or continuous Exponential Distribution :  Commonly utilized in conjunction with interarrival processes  Random no. of entities will arrive within a specific time
  • 9. Input Data Distributions 9 Triangle Distribution:  Used in situations where the practitioner does not have complete knowledge about the system  It has only three parameters: i) Minimum Possible Value ii) Most Common Value iii) Maximum Possible Value
  • 10. Less Common Distributions 10 Geometric distributions: The geometric distribution gives the probability of achieving success after N number of failures  It is discrete which means that distribution must be whole number Weibull Distribution: The Weibull distribution is often used to represent distributions that cannot have values less than zero  It has two parameters
  • 11. Analyzing Input Data 11 Graphics Approach: Graphic approach is the most fundamental approach to attempting to fit input data  It consists of visual qualitative comparison between actual and theoretical data distribution Chi-Square: The chi-square test is based on the comparison of the actual number of observations versus the expected number of observations Commonly accepted as preferred goodness fit technique Kolmogorov–Smirnov: The KS test should be utilized only when the number of data points is
  • 12. Software Implementations for Data Fitting 12  Fitting a significant no. of observed data sets to theoretical distributions is a time consuming task  For this purpose, practitioners use data-fitting software  The following two are frequently used to carry out this function: 1. Arena input analyzer 2. Expert fit
  • 13. Arena input analyzer 13  Input analyzer is part of ARENA simulation software package available from Rockwell software  It has the capability to: 1. Determine the quality of fit of probability distribution functions to input data 2. Examine a total of 15 distributions for data fitting 3. Calculate Chi-square, KS and square error tests 4. Generate high-quality data plots
  • 14. Expert Fit 14  This software is available through Averil M. Law & Associates  This software has the capability to: 1. Automatically determine best probability distribution for data sets 2. Fits 40 distributions 3. Conduct Chi-sqaure, KS and Anderson-Darling goodness of fit tests 4. Provide high-quality plots 5. Analyze a large no. of data sets in batch mode