SlideShare a Scribd company logo
2
Most read
3
Most read
8
Most read
Statistical Modeling
Dr. Balamurugan M
Associate Professor
Acharya Institute of Graduate Studies
Bangalore
Statistical Model
A statistical model is a type of mathematical model that comprises of the assumptions
undertaken to describe the data generation process.
Let us focus on the two highlighted terms above:
Type of mathematical model
Statistical model is non-deterministic unlike other mathematical models where variables
have specific values. Variables in statistical models are stochastic i.e. they have probability
distributions.
Assumptions
But how do those assumptions help us understand the properties or characteristics of the
true data? Simply put, these assumptions make it easy to calculate the probability of an
event.
Need of Statistical Model
The statistical model plays a fundamental role in carrying out statistical inference which
helps in making propositions about the unknown properties and characteristics of the
population as below:
• Estimation
• Confidence Interval
• Hypothesis Testing
Estimation
 It is the central idea behind Machine Learning i.e. finding out the number which can
estimate the parameters of distribution.
 Note that the estimator is a random variable in itself, whereas an estimate is a single
number which gives us an idea of the distribution of the data generation process. For
example, the mean and sigma of Gaussian distribution.
Confidence Interval
 It gives an error bar around the single estimate number i.e. a range of values to signify the
confidence in the estimate arrived on the basis of a number of samples.
Hypothesis Testing
 It is a statement of finding statistical evidence. Let’s further understand the need to
perform statistical modeling with the help of an example below:
Aspect Statistical Modeling Mathematical Modeling
Focus Captures relationships and patterns in data. Represents real-world situations using equations.
Data Usage Utilizes empirical data to build models. Often uses theoretical or assumed data.
Assumptions Models may rely on assumptions about data distribution. Relies on assumptions about relationships between variables.
Goal Inference, hypothesis testing, understanding relationships. Solving complex problems through mathematical equations.
Applications Predictive analytics, decision-making, hypothesis testing. Physical sciences, engineering, economic models.
Model Complexity Can handle complex real-world patterns and noise. Can represent intricate systems and interactions.
Interpretability Often provides insights into data relationships. Focuses on understanding mathematical relationships.
Variables Incorporates real data variables and interactions. Utilizes mathematical variables and constants.
Validation Involves testing against empirical data. Validates against theoretical results or experiments.
Example Linear regression, ANOVA. Differential equations, optimization models.
Statistical Modeling Vs Mathematical Modeling
Uses of Statistical Modeling
Statistical modeling in data science is invaluable in various contexts:
 Exploratory Data Analysis: At the outset of a project, statistical models help identify
trends, outliers, and relationships within the dataset, setting the stage for further analysis.
 Hypothesis Testing: When you have a research question or hypothesis, statistical models
facilitate rigorous testing, confirming or refuting assumptions.
 Feature Selection: Statistical modeling aids in choosing relevant features for predictive
models, enhancing model accuracy and interpretability.
 Regression Analysis: When exploring relationships between variables, regression models
reveal how one variable influences another, enabling predictions and insights.
 Classification: Statistical models assist in classifying data into distinct categories, essential
for tasks like sentiment analysis or disease diagnosis.
 Anomaly Detection: Statistical models uncover unusual patterns, anomalies, or outliers
in data, crucial for fraud detection or quality control.
 Time Series Forecasting: For data with a temporal component, statistical models forecast
future values, aiding in inventory management and financial predictions.
 Segmentation Analysis: Models divide data into clusters based on similarities, enhancing
customer segmentation and personalized marketing.
 Predictive Modeling: In machine learning, statistical models predict outcomes based on
historical data, essential for business forecasts and decision support.

More Related Content

PPTX
GRAPHS BIOSTATICS BPHARM 8 SEM UNIT 1 & 3.pptx
PPS
Scatter Plot
PPTX
Regression analysis
PDF
Multiple linear regression
PPTX
Therapeutic drug monitoring
PPTX
Optimisation technique
PPTX
PPT
Measurement of outcomes in pharacoepidemiology
GRAPHS BIOSTATICS BPHARM 8 SEM UNIT 1 & 3.pptx
Scatter Plot
Regression analysis
Multiple linear regression
Therapeutic drug monitoring
Optimisation technique
Measurement of outcomes in pharacoepidemiology

What's hot (20)

PDF
Chapter 2 part3-Least-Squares Regression
PPTX
TYPES OF GRAPH & FLOW CHART
PDF
Simple & Multiple Regression Analysis
PPT
R for Statistical Computing
PDF
PKPD seminar
PPTX
Optimum design 2019 20
PPT
Regression analysis
PDF
Bayesian theory : @ RxVichuZ!! ;)
PPTX
Designing the methodology - B.Pharm
PDF
Pharmacovigilance Planning
PDF
Autocorrelation
PDF
Cox model
PPT
Optimization techniques
PPTX
one compartment model ppt
PPTX
Spss and software Application
PPTX
Introduction to statistical software R
PDF
Testing of hypothesis
PPTX
Basic terminologies used in pharmacovigilance.pptx
PPTX
Application of Excel and SPSS software for statistical analysis- Biostatistic...
PPTX
Experimental design techniques
Chapter 2 part3-Least-Squares Regression
TYPES OF GRAPH & FLOW CHART
Simple & Multiple Regression Analysis
R for Statistical Computing
PKPD seminar
Optimum design 2019 20
Regression analysis
Bayesian theory : @ RxVichuZ!! ;)
Designing the methodology - B.Pharm
Pharmacovigilance Planning
Autocorrelation
Cox model
Optimization techniques
one compartment model ppt
Spss and software Application
Introduction to statistical software R
Testing of hypothesis
Basic terminologies used in pharmacovigilance.pptx
Application of Excel and SPSS software for statistical analysis- Biostatistic...
Experimental design techniques
Ad

Similar to Statistical Modeling in Research_Dr.Balamurugan .pdf (20)

DOCX
Datascience
DOCX
datascience.docx
PPTX
Introduction-to-Data-Analysis_Final Content.pptx
DOCX
Descriptive and Inferential Statistics.docx
PPTX
abdi research ppt.pptx
PPTX
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
PPTX
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
PDF
Exploring Bias in Data Analysis : How to Detect and Prevent It
PPT
Pentaho Meeting 2008 - Statistics & BI
PPTX
Statistical_Tools_in_ML_Presentation.pptx
DOCX
Exam Short Preparation on Data Analytics
PPTX
Data science notes for ASDS calicut 2.pptx
PPT
A review of statistics
PPT
Edison S Statistics
PPT
Edisons Statistics
PPTX
Data Science research methodology & processes
PPTX
Presentation of BRM.pptx
PPTX
Pharmacokinetic pharmacodynamic modeling
PDF
Statistical modeling in pharmaceutical research and development
Datascience
datascience.docx
Introduction-to-Data-Analysis_Final Content.pptx
Descriptive and Inferential Statistics.docx
abdi research ppt.pptx
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Exploring Bias in Data Analysis : How to Detect and Prevent It
Pentaho Meeting 2008 - Statistics & BI
Statistical_Tools_in_ML_Presentation.pptx
Exam Short Preparation on Data Analytics
Data science notes for ASDS calicut 2.pptx
A review of statistics
Edison S Statistics
Edisons Statistics
Data Science research methodology & processes
Presentation of BRM.pptx
Pharmacokinetic pharmacodynamic modeling
Statistical modeling in pharmaceutical research and development
Ad

More from Dr. Balamurugan M (13)

PDF
Machine Learning Basics_Dr.Balamurugan.pdf
PDF
Database and Database Users_Dr.Balamurugan M.pdf
PPTX
Structure and Components of research report_Pavan.pptx
PPTX
Regression analysis in Research Methodology.pptx
PDF
Correlation in Statistical Analysis .pdf
PDF
Concept of Regression in Research Methodology.pdf
PDF
Preparing a Research Report in Research Process.pdf
PDF
Interpretation in Research Methodology.pdf
PPTX
Machine Learning Life Cycle_Dr.Balamurugan.pptx
PPTX
Defining a Research Problem_Dr.Balamurugan.pptx
PDF
Dr. Balamurugan_Research Process_Bala.pdf
PPTX
Dr.Balamurugan_Fundamentals_of_Computer.pptx
PPTX
Classifications of OS.pptx
Machine Learning Basics_Dr.Balamurugan.pdf
Database and Database Users_Dr.Balamurugan M.pdf
Structure and Components of research report_Pavan.pptx
Regression analysis in Research Methodology.pptx
Correlation in Statistical Analysis .pdf
Concept of Regression in Research Methodology.pdf
Preparing a Research Report in Research Process.pdf
Interpretation in Research Methodology.pdf
Machine Learning Life Cycle_Dr.Balamurugan.pptx
Defining a Research Problem_Dr.Balamurugan.pptx
Dr. Balamurugan_Research Process_Bala.pdf
Dr.Balamurugan_Fundamentals_of_Computer.pptx
Classifications of OS.pptx

Recently uploaded (20)

PPTX
Digestion and Absorption of Carbohydrates, Proteina and Fats
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
advance database management system book.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
RMMM.pdf make it easy to upload and study
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PDF
Trump Administration's workforce development strategy
PDF
IGGE1 Understanding the Self1234567891011
PPTX
UNIT III MENTAL HEALTH NURSING ASSESSMENT
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PDF
Indian roads congress 037 - 2012 Flexible pavement
PPTX
Introduction to Building Materials
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Digestion and Absorption of Carbohydrates, Proteina and Fats
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
advance database management system book.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Chinmaya Tiranga quiz Grand Finale.pdf
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
RMMM.pdf make it easy to upload and study
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
Trump Administration's workforce development strategy
IGGE1 Understanding the Self1234567891011
UNIT III MENTAL HEALTH NURSING ASSESSMENT
Final Presentation General Medicine 03-08-2024.pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Hazard Identification & Risk Assessment .pdf
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Indian roads congress 037 - 2012 Flexible pavement
Introduction to Building Materials
Chinmaya Tiranga Azadi Quiz (Class 7-8 )

Statistical Modeling in Research_Dr.Balamurugan .pdf

  • 1. Statistical Modeling Dr. Balamurugan M Associate Professor Acharya Institute of Graduate Studies Bangalore
  • 2. Statistical Model A statistical model is a type of mathematical model that comprises of the assumptions undertaken to describe the data generation process. Let us focus on the two highlighted terms above: Type of mathematical model Statistical model is non-deterministic unlike other mathematical models where variables have specific values. Variables in statistical models are stochastic i.e. they have probability distributions.
  • 3. Assumptions But how do those assumptions help us understand the properties or characteristics of the true data? Simply put, these assumptions make it easy to calculate the probability of an event. Need of Statistical Model The statistical model plays a fundamental role in carrying out statistical inference which helps in making propositions about the unknown properties and characteristics of the population as below: • Estimation • Confidence Interval • Hypothesis Testing
  • 4. Estimation  It is the central idea behind Machine Learning i.e. finding out the number which can estimate the parameters of distribution.  Note that the estimator is a random variable in itself, whereas an estimate is a single number which gives us an idea of the distribution of the data generation process. For example, the mean and sigma of Gaussian distribution. Confidence Interval  It gives an error bar around the single estimate number i.e. a range of values to signify the confidence in the estimate arrived on the basis of a number of samples.
  • 5. Hypothesis Testing  It is a statement of finding statistical evidence. Let’s further understand the need to perform statistical modeling with the help of an example below:
  • 6. Aspect Statistical Modeling Mathematical Modeling Focus Captures relationships and patterns in data. Represents real-world situations using equations. Data Usage Utilizes empirical data to build models. Often uses theoretical or assumed data. Assumptions Models may rely on assumptions about data distribution. Relies on assumptions about relationships between variables. Goal Inference, hypothesis testing, understanding relationships. Solving complex problems through mathematical equations. Applications Predictive analytics, decision-making, hypothesis testing. Physical sciences, engineering, economic models. Model Complexity Can handle complex real-world patterns and noise. Can represent intricate systems and interactions. Interpretability Often provides insights into data relationships. Focuses on understanding mathematical relationships. Variables Incorporates real data variables and interactions. Utilizes mathematical variables and constants. Validation Involves testing against empirical data. Validates against theoretical results or experiments. Example Linear regression, ANOVA. Differential equations, optimization models. Statistical Modeling Vs Mathematical Modeling
  • 7. Uses of Statistical Modeling Statistical modeling in data science is invaluable in various contexts:  Exploratory Data Analysis: At the outset of a project, statistical models help identify trends, outliers, and relationships within the dataset, setting the stage for further analysis.  Hypothesis Testing: When you have a research question or hypothesis, statistical models facilitate rigorous testing, confirming or refuting assumptions.  Feature Selection: Statistical modeling aids in choosing relevant features for predictive models, enhancing model accuracy and interpretability.  Regression Analysis: When exploring relationships between variables, regression models reveal how one variable influences another, enabling predictions and insights.
  • 8.  Classification: Statistical models assist in classifying data into distinct categories, essential for tasks like sentiment analysis or disease diagnosis.  Anomaly Detection: Statistical models uncover unusual patterns, anomalies, or outliers in data, crucial for fraud detection or quality control.  Time Series Forecasting: For data with a temporal component, statistical models forecast future values, aiding in inventory management and financial predictions.  Segmentation Analysis: Models divide data into clusters based on similarities, enhancing customer segmentation and personalized marketing.  Predictive Modeling: In machine learning, statistical models predict outcomes based on historical data, essential for business forecasts and decision support.