SlideShare a Scribd company logo
3
Most read
12
Most read
14
Most read
Basics Of Data Analysis
Presented By: Ankur Jain
Swati
Biraj Choudhary
Abhijeet
Prateek Rajpal
Data Analysis
• Turning raw data into useful information.
• Purpose is to provide answers to questions being
asked at a program site or research questions.
• Even the greatest amount and best quality data
mean nothing if not properly analyzed—or if not
analyzed at all.
• Analysis is looking at the data in light of the
questions you need to answer:
– How would you analyze data to determine: “Is my
program/research meeting its objectives?”
Answering Programmatic
Questions
• Question: Is my program meeting its objectives?
• Analysis: Compare program targets and actual
program performance to learn how far you are
from target.
• Interpretation: Why you have or have not
achieved the target and what this means for your
program.
• May require more information.
Data Preparation Process
Prepare preliminary plan of data analysis

Check questionnaires

Edit

Code

Transcribe

Clean data

Select a data analysis strategy
Types of Statistical Analyses Used in
Marketing Research
• Data summarization: the process of describing a
data matrix by computing a small number of
measures that characterize the data set.
• Four functions of data summarization:
– Summarizes the data
– Applies understandable conceptualizations
– Communicates underlying patterns
– Generalizes sample findings to the population
Coding
• Coding – process of translating information gathered
from questionnaires or other sources into something
that can be analyzed.
• Involves assigning a value to the information given—
often value is given a label.
• Coding can make data more consistent:
– Example: Question = Sex
– Answers = Male, Female, M, or F
– Coding will avoid inconsistencies
Coding System
• Common coding systems (code and label) for variables:
– 0=No 1=Yes
(1 = value assigned, Yes= label of value)
– OR: 1=No 2=Yes
• When you assign a value you must also make it clear what
that value means.
– In first example above, 1=Yes but in second example 1=No
– As long as it is clear how the data are coded, either is fine
• You can make it clear by creating a data dictionary to
accompany the dataset.
Coding: Dummy Variable
• A “dummy” variable is any variable that is coded to
have 2 levels (yes/no, male/female, etc.)
• Dummy variables may be used to represent more
complicated variables
– Example: No. of cigarettes smoked per week--
answers total 75 different responses ranging from
0 cigarettes to 3 packs per week.
– Can be recoded as a dummy variable:
1=smokes (at all) 0=non-smoker
• This type of coding is useful in later stages of
analysis.
 Attaching Labels to values:
• Many analysis software packages allow you to attach a label
to the variable values
Example: Label 0’s as male and 1’s as female
• Makes reading data output easier:
Without label: Variable SEX Frequency Percent
0 21 60%
1 14 40%
With label: Variable SEX Frequency Percent
Male 21 60%
Female 14 40%
Coding – Original Variables
• Coding process is similar with other categorical
variables.
• Example: Variable EDUCATION, possible coding:
0 = Did not graduate from high school
1 = High school graduate
2 = Some college or post-high school education
3 = College graduate
• Could be coded in reverse order (0=college graduate,
3=did not graduate high school).
• For this ordinal categorical variable we want to be
consistent with numbering because the value of the
code assigned has significance.
• Example of bad coding:
0 = Some college or post-high school education
1 = High school graduate
2 = College graduate
3 = Did not graduate from high school
• Data has an inherent order but coding does not
follow that order—NOT appropriate coding for an
ordinal categorical variable.
Basic Terminology and
Concepts
• Statistical terms
– Ratio
– Mean
– Median
– Mode
– Frequency Distribution
– Standard Deviation
Conclusion
• Purpose of analysis is to provide answers to
programmatic questions.
• Data analysis describe the sample/target population.
• Analysis of a data is a process of inspecting, cleaning,
transforming and modeling data with a goal of
highlighting useful information, suggesting
conclusion and supporting decision making.
Thank You

More Related Content

PPT
Crosstabs
PPTX
Linear and Logistics Regression
PPTX
Data Driven Decision Making Presentation
PPT
General Statistics boa
PPT
Data Preparation and Processing
PPT
Basic Statistical Concepts and Methods
PPTX
Types of data
PDF
Logistic Regression Analysis
Crosstabs
Linear and Logistics Regression
Data Driven Decision Making Presentation
General Statistics boa
Data Preparation and Processing
Basic Statistical Concepts and Methods
Types of data
Logistic Regression Analysis

What's hot (20)

PDF
Unsupervised Learning in Machine Learning
PPTX
Basic Statistics & Data Analysis
PPT
Data preprocessing
PPTX
Data Collection Preparation
PPTX
Factor Analysis in Research
PDF
Data analysis using spss
PPT
Auto Correlation Presentation
PPTX
Exploratory data analysis with Python
PPTX
Logistic regression
PPT
Multinomial logisticregression basicrelationships
PDF
Exploratory data analysis data visualization
PPTX
Association Rule Learning Part 1: Frequent Itemset Generation
PPTX
Basics of statistics
PPTX
Classification of data
PPT
Introduction to spss
PPTX
Data Visualization - A Brief Overview
PDF
Measure of central tendency
PDF
Data preprocessing using Machine Learning
PPT
Data mining techniques unit 1
Unsupervised Learning in Machine Learning
Basic Statistics & Data Analysis
Data preprocessing
Data Collection Preparation
Factor Analysis in Research
Data analysis using spss
Auto Correlation Presentation
Exploratory data analysis with Python
Logistic regression
Multinomial logisticregression basicrelationships
Exploratory data analysis data visualization
Association Rule Learning Part 1: Frequent Itemset Generation
Basics of statistics
Classification of data
Introduction to spss
Data Visualization - A Brief Overview
Measure of central tendency
Data preprocessing using Machine Learning
Data mining techniques unit 1
Ad

Viewers also liked (8)

PPT
Catálogo 2010 2011 (formato 2003)
PPT
PPTX
Gacte presentation
PDF
Data Analysis Basics - Workshop (Frameworks)
PPT
Data Analysis
PPTX
Data Management for Dummies
PPT
Marketing research process
PPT
Marketing research process
Catálogo 2010 2011 (formato 2003)
Gacte presentation
Data Analysis Basics - Workshop (Frameworks)
Data Analysis
Data Management for Dummies
Marketing research process
Marketing research process
Ad

Similar to Basics of Data Analysis (20)

PPT
MELJUN CORTES research seminar_1__data_analysis_basics_slides_2nd_updates
PPT
MELJUN CORTES research seminar_1_data_analysis_basics
PPT
MELJUN CORTES research seminar_1__data_analysis_basics_slides
PPTX
Coding and Data Entry and data cleaning.pptx
PPTX
DATA PROCESSING_Bus 221(0).pptxDATA PROCESSING_Bus 221(0).pptx
PPT
PlanningAnalysis.pptsaeqweqefqeqeqeerwew
PDF
Data exploration validation and sanitization
PPTX
Data analysis copy
PPTX
Data Analysis.pptx
PPT
Research methodology - Analysis of Data
PPTX
Editing, coding and tabulation of data
DOC
Kinds Of Variable
PPT
Abdm4064 week 11 data analysis
PPTX
RSS 2012 Data Entry SPSS
PPTX
8. data analysis in research practice.pptx
PPT
Chap 8
PPTX
Research Methodology Unit-4 Notes.pptx
PPT
Business Research Methods. data collection preparation and analysis
PDF
Fundamentals of Data Science CSB2205 Data Wrangling explained
PPTX
Mba2216 week 11 data analysis part 01
MELJUN CORTES research seminar_1__data_analysis_basics_slides_2nd_updates
MELJUN CORTES research seminar_1_data_analysis_basics
MELJUN CORTES research seminar_1__data_analysis_basics_slides
Coding and Data Entry and data cleaning.pptx
DATA PROCESSING_Bus 221(0).pptxDATA PROCESSING_Bus 221(0).pptx
PlanningAnalysis.pptsaeqweqefqeqeqeerwew
Data exploration validation and sanitization
Data analysis copy
Data Analysis.pptx
Research methodology - Analysis of Data
Editing, coding and tabulation of data
Kinds Of Variable
Abdm4064 week 11 data analysis
RSS 2012 Data Entry SPSS
8. data analysis in research practice.pptx
Chap 8
Research Methodology Unit-4 Notes.pptx
Business Research Methods. data collection preparation and analysis
Fundamentals of Data Science CSB2205 Data Wrangling explained
Mba2216 week 11 data analysis part 01

Recently uploaded (20)

PDF
Power and position in leadershipDOC-20250808-WA0011..pdf
PDF
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
PDF
COST SHEET- Tender and Quotation unit 2.pdf
PPT
Data mining for business intelligence ch04 sharda
PPTX
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
PPTX
AI-assistance in Knowledge Collection and Curation supporting Safe and Sustai...
PPTX
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
PPTX
New Microsoft PowerPoint Presentation - Copy.pptx
PDF
Digital Marketing & E-commerce Certificate Glossary.pdf.................
PDF
Tata consultancy services case study shri Sharda college, basrur
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
DOCX
Business Management - unit 1 and 2
PPTX
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
PDF
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
PPTX
5 Stages of group development guide.pptx
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PDF
Ôn tập tiếng anh trong kinh doanh nâng cao
PDF
How to Get Funding for Your Trucking Business
PDF
A Brief Introduction About Julia Allison
PDF
Laughter Yoga Basic Learning Workshop Manual
Power and position in leadershipDOC-20250808-WA0011..pdf
SIMNET Inc – 2023’s Most Trusted IT Services & Solution Provider
COST SHEET- Tender and Quotation unit 2.pdf
Data mining for business intelligence ch04 sharda
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
AI-assistance in Knowledge Collection and Curation supporting Safe and Sustai...
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
New Microsoft PowerPoint Presentation - Copy.pptx
Digital Marketing & E-commerce Certificate Glossary.pdf.................
Tata consultancy services case study shri Sharda college, basrur
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
Business Management - unit 1 and 2
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
5 Stages of group development guide.pptx
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
Ôn tập tiếng anh trong kinh doanh nâng cao
How to Get Funding for Your Trucking Business
A Brief Introduction About Julia Allison
Laughter Yoga Basic Learning Workshop Manual

Basics of Data Analysis

  • 1. Basics Of Data Analysis Presented By: Ankur Jain Swati Biraj Choudhary Abhijeet Prateek Rajpal
  • 2. Data Analysis • Turning raw data into useful information. • Purpose is to provide answers to questions being asked at a program site or research questions. • Even the greatest amount and best quality data mean nothing if not properly analyzed—or if not analyzed at all. • Analysis is looking at the data in light of the questions you need to answer: – How would you analyze data to determine: “Is my program/research meeting its objectives?”
  • 3. Answering Programmatic Questions • Question: Is my program meeting its objectives? • Analysis: Compare program targets and actual program performance to learn how far you are from target. • Interpretation: Why you have or have not achieved the target and what this means for your program. • May require more information.
  • 4. Data Preparation Process Prepare preliminary plan of data analysis  Check questionnaires  Edit  Code  Transcribe  Clean data  Select a data analysis strategy
  • 5. Types of Statistical Analyses Used in Marketing Research • Data summarization: the process of describing a data matrix by computing a small number of measures that characterize the data set. • Four functions of data summarization: – Summarizes the data – Applies understandable conceptualizations – Communicates underlying patterns – Generalizes sample findings to the population
  • 6. Coding • Coding – process of translating information gathered from questionnaires or other sources into something that can be analyzed. • Involves assigning a value to the information given— often value is given a label. • Coding can make data more consistent: – Example: Question = Sex – Answers = Male, Female, M, or F – Coding will avoid inconsistencies
  • 7. Coding System • Common coding systems (code and label) for variables: – 0=No 1=Yes (1 = value assigned, Yes= label of value) – OR: 1=No 2=Yes • When you assign a value you must also make it clear what that value means. – In first example above, 1=Yes but in second example 1=No – As long as it is clear how the data are coded, either is fine • You can make it clear by creating a data dictionary to accompany the dataset.
  • 8. Coding: Dummy Variable • A “dummy” variable is any variable that is coded to have 2 levels (yes/no, male/female, etc.) • Dummy variables may be used to represent more complicated variables – Example: No. of cigarettes smoked per week-- answers total 75 different responses ranging from 0 cigarettes to 3 packs per week. – Can be recoded as a dummy variable: 1=smokes (at all) 0=non-smoker • This type of coding is useful in later stages of analysis.
  • 9.  Attaching Labels to values: • Many analysis software packages allow you to attach a label to the variable values Example: Label 0’s as male and 1’s as female • Makes reading data output easier: Without label: Variable SEX Frequency Percent 0 21 60% 1 14 40% With label: Variable SEX Frequency Percent Male 21 60% Female 14 40%
  • 10. Coding – Original Variables • Coding process is similar with other categorical variables. • Example: Variable EDUCATION, possible coding: 0 = Did not graduate from high school 1 = High school graduate 2 = Some college or post-high school education 3 = College graduate • Could be coded in reverse order (0=college graduate, 3=did not graduate high school). • For this ordinal categorical variable we want to be consistent with numbering because the value of the code assigned has significance.
  • 11. • Example of bad coding: 0 = Some college or post-high school education 1 = High school graduate 2 = College graduate 3 = Did not graduate from high school • Data has an inherent order but coding does not follow that order—NOT appropriate coding for an ordinal categorical variable.
  • 12. Basic Terminology and Concepts • Statistical terms – Ratio – Mean – Median – Mode – Frequency Distribution – Standard Deviation
  • 13. Conclusion • Purpose of analysis is to provide answers to programmatic questions. • Data analysis describe the sample/target population. • Analysis of a data is a process of inspecting, cleaning, transforming and modeling data with a goal of highlighting useful information, suggesting conclusion and supporting decision making.