SlideShare a Scribd company logo
Probability & Statistics
for Engineers & Scientists
Authors: Walpole, Myers, Myers, YE
Instructor:
NADEEM KHAN
Lecturer, S & H Dept.,
FAST-NU, Main Campus, Karachi
Nadeem.arif@nu.edu.pk
Text Book

Reference
Book 
MT206 – Probability &
Statistics
(4 Credit Hours)
Week Topics
01 Intro. To Statistics, Measures of Central Tendency &
Dispersion
02 Bar Chart, Histogram, Stem-Leaf Plot, Box Plot, Dot Plot,
Frequency Curves, Ogive, Skewness & Kurtosis
03 Introduction to Probability: Sample Space, Tree
Diagram, Event, Set Theory, Venn Diagram
04 Counting techniques, Kinds of Events, Additive rules
05 Conditional Probability, Independence, Multiplicative
rules, Bayes’ Theorem.
06 1st
Mid-Term Examination
Course Outline:
MT206 – Probability &
Statistics
(4 Credit Hours)
Wee
k
Topics
07 Random Variables & Probability Distributions: PMF,
PDF, CDF, Joint & Marginal Probability Distributions,
Mathematical Expectation
08 Discrete Distributions: Binomial & Multinomial, Poisson,
Geometric, Hypergeometric, and Discrete uniform.
09 Continuous Distributions: Normal, Exponential, Uniform,
Chi-Square
10 Testing of Hypothesis: z-test, t-test
11 Goodness of Fit Test, Chi-Square
test of Independence
12 2nd
Mid-Term Examination
Course Outline
(Contd.)
MT206 – Probability & Statistics
(4 Credit Hours)
 Note: The above course outline & schedule is
tentative.
Wee
k
Topics
13 Correlation & Regression
14 Non-Linear Regression: Polynomial regression
15 ANOVA
16 Final Examination
Course Outline
(Contd.)
Marks Distribution
S. No. Particulars % Marks
01 Assignments 10
02 Quizzes 10
03 1st
Mid Term 15
04 2nd
Mid Term 15
05 Final Exam 50
Total 100
Important Instructions
 Be in the classroom on time.
 All students are required to maintain 80% of attendance. In case students
fail to maintain 80% of attendance, they become ineligible to take the final
exam.
 Turn off your cell phones or any other electronic devices before
entering the class.
 Maintain the decorum of the class room all the time.
 Avoid conversation with your classmates while the lecture is in progress.
 Submit your assignments on time otherwise marks will be deducted after
deadline.
Important Instructions
(Contd.)
 Assignment should include a title page
consisting of your complete Name, Roll
No, Subject Name and date etc.
 Assignment should be submitted in the
Holes clip punch folder (snap attached).
 Incomplete assignments lead to
reduction in marks.
 Avoid plagiarism.
 For Quizzes bring your own loose
pages.
 Violation of any instructions leads to
a reduction in marks.
Introduction and Data Collection
PROBABILITY AND
STATISTICS
Learning Objectives
In this topic you learn:
 How Statistics is used in business
 The sources of data used in business
 The types of data used in business
 The basics of Microsoft Excel
 The basics of Minitab
Why Learn Statistics?
So you are able to make better sense of the
ubiquitous use of numbers:
 Business memos
 Business research
 Technical reports
 Technical journals
 Newspaper articles
 Magazine articles
What is statistics?
 A branch of mathematics taking and
transforming numbers into useful information for
decision makers
 Methods for processing & analyzing numbers
 Methods for helping reduce the uncertainty
inherent in decision making
Why Study Statistics?
Decision Makers Use Statistics To:
 Present and describe business data and information properly
 Draw conclusions about large groups of individuals or items,
using information collected from subsets of the individuals or
items.
 Make reliable forecasts about a business activity
 Improve business processes
Types of Statistics
 Statistics
 The branch of mathematics that transforms data into
useful information for decision makers.
Descriptive Statistics
Collecting, summarizing, and
describing data
Inferential Statistics
Drawing conclusions and/or
making decisions concerning a
population based only on sample
data
Descriptive Statistics
 Collect data
 e.g., Survey
 Present data
 e.g., Tables and graphs
 Characterize data

e.g., Sample mean = i
X
n

Inferential Statistics
 Estimation
 e.g., Estimate the population
mean weight using the sample
mean weight
 Hypothesis testing
 e.g., Test the claim that the
population mean weight is 120
pounds
Drawing conclusions about a large group of
individuals based on a subset of the large group.
Basic Vocabulary of Statistics
VARIABLE
A variable is a characteristic of an item or individual.
DATA
Data are the different values associated with a variable.
OPERATIONAL DEFINITIONS
Data values are meaningless unless their variables have operational
definitions, universally accepted meanings that are clear to all associated
with an analysis.
Basic Vocabulary of Statistics
POPULATION
A population consists of all the items or individuals about which
you want to draw a conclusion.
SAMPLE
A sample is the portion of a population selected for analysis.
PARAMETER
A parameter is a numerical measure that describes a
characteristic of a population.
STATISTIC
A statistic is a numerical measure that describes a characteristic of
a sample.
Population vs. Sample
Population Sample
Measures used to describe the
population are called parameters
Measures computed from
sample data are called statistics
Why Collect Data?
 A marketing research analyst needs to assess the
effectiveness of a new television advertisement.
 A pharmaceutical manufacturer needs to determine
whether a new drug is more effective than those currently
in use.
 An operations manager wants to monitor a manufacturing
process to find out whether the quality of the product
being manufactured is conforming to company standards.
 An auditor wants to review the financial transactions of a
company in order to determine whether the company is in
compliance with generally accepted accounting
principles.
Sources of Data
 Primary Sources: The data collector is the one using the data
for analysis
 Data from a political survey
 Data collected from an experiment
 Observed data
 Secondary Sources: The person performing data analysis is
not the data collector
 Analyzing census data
 Examining data from print journals or data published on the internet.
Sources of data fall into four
categories
 Data distributed by an organization or an
individual
 A designed experiment
 A survey
 An observational study
Types of Variables
 Categorical (qualitative) variables have values that
can only be placed into categories, such as “yes” and
“no.”
 Numerical (quantitative) variables have values that
represent quantities.
Types of Data
Data
Discrete
Continuou
s
Examples:
 Marital Status
 Political Party
 Eye Color
(Defined categories)
Examples:
 Number of Children
 Defects per hour
(Counted items)
Examples:
 Weight
 Voltage
(Measured characteristics)
Levels of Measurement
 A nominal scale classifies data into distinct categories in
which no ranking is implied.
Categorical Variables Categories
Personal Computer
Ownership
Type of Stocks Owned
Internet Provider
Yes / No
Microsoft Network / AOL/ Other
Growth Value Other
Levels of Measurement
 An ordinal scale classifies data into distinct categories
in which ranking is implied
Categorical Variable Ordered Categories
Student class designation Freshman, Sophomore, Junior,
Senior
Product satisfaction Satisfied, Neutral, Unsatisfied
Faculty rank Professor, Associate Professor,
Assistant Professor, Instructor
Standard & Poor’s bond ratings AAA, AA, A, BBB, BB, B, CCC, CC,
C, DDD, DD, D
Student Grades A, B, C, D, F
Levels of Measurement
 An interval scale is an ordered scale in which the difference
between measurements is a meaningful quantity but the
measurements do not have a true zero point.
 A ratio scale is an ordered scale in which the difference
between the measurements is a meaningful quantity and the
measurements have a true zero point.
Interval and Ratio Scales
Chapter Summary
 Reviewed why a manager needs to know statistics
 Introduced key definitions:
 Population vs. Sample
 Primary vs. Secondary data types
 Categorical vs. Numerical data
 Examined descriptive vs. inferential statistics
 Reviewed data types and measurement levels
In this chapter, we have

More Related Content

PPT
Data Analysis
PPT
Analyzing survey data
PDF
Data analysis using spss
PPTX
Business statistics
PPTX
Research Method chapter 6.pptx
DOCX
Assignment 2 RA Annotated BibliographyIn your final paper for .docx
PPT
Business Research Methods. problem definition literature review and qualitati...
Data Analysis
Analyzing survey data
Data analysis using spss
Business statistics
Research Method chapter 6.pptx
Assignment 2 RA Annotated BibliographyIn your final paper for .docx
Business Research Methods. problem definition literature review and qualitati...

Similar to LECTURE 1 STATISTICS for data analytics and machine learning (20)

PPT
business research process, design and proposal
PPTX
Unit 1 pptx.pptx
PPTX
Understanding the Different Scales of Measurement with Examples
PPTX
Data Analysis - Approach & Techniques
PPTX
introductiontobusinessstatistics-anithanew.pptx
PDF
A step-by-step guide for conducting statistical data analysis
PDF
Defining Constituents, Data Vizzes and Telling a Data Story
PPTX
1Elementary Statistics1Elementary Statistics
PPT
Stat11t Chapter1
PPT
Stat11t chapter1
PDF
TESTBANK For Business Statistics 3e Canadian Black
PPTX
SUSLA IE Process: Writing Assessment Results
PPTX
Topic-6-Finding-the-Answers-to-the-Research-Questions-Interpretation-and-Pres...
PPTX
Practical Research 2.pptx
DOCX
Need a nonplagiarised paper and a form completed by 1006015 before.docx
PPTX
Research and Statistics Report- Estonio, Ryan.pptx
PPTX
Denise Rousseau's Generic EBMgt Class 4
PPT
Lecture-1 Introduction to statistics.ppt
PPTX
Finding the answers to the research questions.pptx
PPTX
Data Processing & Explain each term in details.pptx
business research process, design and proposal
Unit 1 pptx.pptx
Understanding the Different Scales of Measurement with Examples
Data Analysis - Approach & Techniques
introductiontobusinessstatistics-anithanew.pptx
A step-by-step guide for conducting statistical data analysis
Defining Constituents, Data Vizzes and Telling a Data Story
1Elementary Statistics1Elementary Statistics
Stat11t Chapter1
Stat11t chapter1
TESTBANK For Business Statistics 3e Canadian Black
SUSLA IE Process: Writing Assessment Results
Topic-6-Finding-the-Answers-to-the-Research-Questions-Interpretation-and-Pres...
Practical Research 2.pptx
Need a nonplagiarised paper and a form completed by 1006015 before.docx
Research and Statistics Report- Estonio, Ryan.pptx
Denise Rousseau's Generic EBMgt Class 4
Lecture-1 Introduction to statistics.ppt
Finding the answers to the research questions.pptx
Data Processing & Explain each term in details.pptx
Ad

Recently uploaded (20)

PPTX
Leprosy and NLEP programme community medicine
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
annual-report-2024-2025 original latest.
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Introduction to Data Science and Data Analysis
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPT
Predictive modeling basics in data cleaning process
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Database Infoormation System (DBIS).pptx
Leprosy and NLEP programme community medicine
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
annual-report-2024-2025 original latest.
.pdf is not working space design for the following data for the following dat...
STUDY DESIGN details- Lt Col Maksud (21).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Optimise Shopper Experiences with a Strong Data Estate.pdf
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Introduction to Data Science and Data Analysis
Galatica Smart Energy Infrastructure Startup Pitch Deck
Predictive modeling basics in data cleaning process
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Introduction-to-Cloud-ComputingFinal.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Database Infoormation System (DBIS).pptx
Ad

LECTURE 1 STATISTICS for data analytics and machine learning

  • 1. Probability & Statistics for Engineers & Scientists Authors: Walpole, Myers, Myers, YE Instructor: NADEEM KHAN Lecturer, S & H Dept., FAST-NU, Main Campus, Karachi Nadeem.arif@nu.edu.pk
  • 4. MT206 – Probability & Statistics (4 Credit Hours) Week Topics 01 Intro. To Statistics, Measures of Central Tendency & Dispersion 02 Bar Chart, Histogram, Stem-Leaf Plot, Box Plot, Dot Plot, Frequency Curves, Ogive, Skewness & Kurtosis 03 Introduction to Probability: Sample Space, Tree Diagram, Event, Set Theory, Venn Diagram 04 Counting techniques, Kinds of Events, Additive rules 05 Conditional Probability, Independence, Multiplicative rules, Bayes’ Theorem. 06 1st Mid-Term Examination Course Outline:
  • 5. MT206 – Probability & Statistics (4 Credit Hours) Wee k Topics 07 Random Variables & Probability Distributions: PMF, PDF, CDF, Joint & Marginal Probability Distributions, Mathematical Expectation 08 Discrete Distributions: Binomial & Multinomial, Poisson, Geometric, Hypergeometric, and Discrete uniform. 09 Continuous Distributions: Normal, Exponential, Uniform, Chi-Square 10 Testing of Hypothesis: z-test, t-test 11 Goodness of Fit Test, Chi-Square test of Independence 12 2nd Mid-Term Examination Course Outline (Contd.)
  • 6. MT206 – Probability & Statistics (4 Credit Hours)  Note: The above course outline & schedule is tentative. Wee k Topics 13 Correlation & Regression 14 Non-Linear Regression: Polynomial regression 15 ANOVA 16 Final Examination Course Outline (Contd.)
  • 7. Marks Distribution S. No. Particulars % Marks 01 Assignments 10 02 Quizzes 10 03 1st Mid Term 15 04 2nd Mid Term 15 05 Final Exam 50 Total 100
  • 8. Important Instructions  Be in the classroom on time.  All students are required to maintain 80% of attendance. In case students fail to maintain 80% of attendance, they become ineligible to take the final exam.  Turn off your cell phones or any other electronic devices before entering the class.  Maintain the decorum of the class room all the time.  Avoid conversation with your classmates while the lecture is in progress.  Submit your assignments on time otherwise marks will be deducted after deadline.
  • 9. Important Instructions (Contd.)  Assignment should include a title page consisting of your complete Name, Roll No, Subject Name and date etc.  Assignment should be submitted in the Holes clip punch folder (snap attached).  Incomplete assignments lead to reduction in marks.  Avoid plagiarism.  For Quizzes bring your own loose pages.  Violation of any instructions leads to a reduction in marks.
  • 10. Introduction and Data Collection PROBABILITY AND STATISTICS
  • 11. Learning Objectives In this topic you learn:  How Statistics is used in business  The sources of data used in business  The types of data used in business  The basics of Microsoft Excel  The basics of Minitab
  • 12. Why Learn Statistics? So you are able to make better sense of the ubiquitous use of numbers:  Business memos  Business research  Technical reports  Technical journals  Newspaper articles  Magazine articles
  • 13. What is statistics?  A branch of mathematics taking and transforming numbers into useful information for decision makers  Methods for processing & analyzing numbers  Methods for helping reduce the uncertainty inherent in decision making
  • 14. Why Study Statistics? Decision Makers Use Statistics To:  Present and describe business data and information properly  Draw conclusions about large groups of individuals or items, using information collected from subsets of the individuals or items.  Make reliable forecasts about a business activity  Improve business processes
  • 15. Types of Statistics  Statistics  The branch of mathematics that transforms data into useful information for decision makers. Descriptive Statistics Collecting, summarizing, and describing data Inferential Statistics Drawing conclusions and/or making decisions concerning a population based only on sample data
  • 16. Descriptive Statistics  Collect data  e.g., Survey  Present data  e.g., Tables and graphs  Characterize data  e.g., Sample mean = i X n 
  • 17. Inferential Statistics  Estimation  e.g., Estimate the population mean weight using the sample mean weight  Hypothesis testing  e.g., Test the claim that the population mean weight is 120 pounds Drawing conclusions about a large group of individuals based on a subset of the large group.
  • 18. Basic Vocabulary of Statistics VARIABLE A variable is a characteristic of an item or individual. DATA Data are the different values associated with a variable. OPERATIONAL DEFINITIONS Data values are meaningless unless their variables have operational definitions, universally accepted meanings that are clear to all associated with an analysis.
  • 19. Basic Vocabulary of Statistics POPULATION A population consists of all the items or individuals about which you want to draw a conclusion. SAMPLE A sample is the portion of a population selected for analysis. PARAMETER A parameter is a numerical measure that describes a characteristic of a population. STATISTIC A statistic is a numerical measure that describes a characteristic of a sample.
  • 20. Population vs. Sample Population Sample Measures used to describe the population are called parameters Measures computed from sample data are called statistics
  • 21. Why Collect Data?  A marketing research analyst needs to assess the effectiveness of a new television advertisement.  A pharmaceutical manufacturer needs to determine whether a new drug is more effective than those currently in use.  An operations manager wants to monitor a manufacturing process to find out whether the quality of the product being manufactured is conforming to company standards.  An auditor wants to review the financial transactions of a company in order to determine whether the company is in compliance with generally accepted accounting principles.
  • 22. Sources of Data  Primary Sources: The data collector is the one using the data for analysis  Data from a political survey  Data collected from an experiment  Observed data  Secondary Sources: The person performing data analysis is not the data collector  Analyzing census data  Examining data from print journals or data published on the internet.
  • 23. Sources of data fall into four categories  Data distributed by an organization or an individual  A designed experiment  A survey  An observational study
  • 24. Types of Variables  Categorical (qualitative) variables have values that can only be placed into categories, such as “yes” and “no.”  Numerical (quantitative) variables have values that represent quantities.
  • 25. Types of Data Data Discrete Continuou s Examples:  Marital Status  Political Party  Eye Color (Defined categories) Examples:  Number of Children  Defects per hour (Counted items) Examples:  Weight  Voltage (Measured characteristics)
  • 26. Levels of Measurement  A nominal scale classifies data into distinct categories in which no ranking is implied. Categorical Variables Categories Personal Computer Ownership Type of Stocks Owned Internet Provider Yes / No Microsoft Network / AOL/ Other Growth Value Other
  • 27. Levels of Measurement  An ordinal scale classifies data into distinct categories in which ranking is implied Categorical Variable Ordered Categories Student class designation Freshman, Sophomore, Junior, Senior Product satisfaction Satisfied, Neutral, Unsatisfied Faculty rank Professor, Associate Professor, Assistant Professor, Instructor Standard & Poor’s bond ratings AAA, AA, A, BBB, BB, B, CCC, CC, C, DDD, DD, D Student Grades A, B, C, D, F
  • 28. Levels of Measurement  An interval scale is an ordered scale in which the difference between measurements is a meaningful quantity but the measurements do not have a true zero point.  A ratio scale is an ordered scale in which the difference between the measurements is a meaningful quantity and the measurements have a true zero point.
  • 30. Chapter Summary  Reviewed why a manager needs to know statistics  Introduced key definitions:  Population vs. Sample  Primary vs. Secondary data types  Categorical vs. Numerical data  Examined descriptive vs. inferential statistics  Reviewed data types and measurement levels In this chapter, we have