SlideShare a Scribd company logo
MODULE- IV
DATA PROCESSING
DATA PROCESSING
 Data in its raw form is not useful to any research. Data processing is the method of collecting
raw data and translating it into usable information. It is usually performed in a step-by-step
process.
 Data processing is concerned with editing, coding, classifying, tabulating and charting and
diagramming research data.
 The essence of data processing in research is data reduction.
 Data reduction involves removing out the irrelevant from the relevant data and establishing
order from chaos and giving shape to a mass of data.
DATA EDITING
 Editing of data is a process of examining the collected raw data (specially in surveys) to detect
errors and omissions and to correct these when possible.
 Editing involves a careful scrutiny of the completed questionnaires and/or schedules.
 Editing is done to assure that the data are accurate, consistent with other facts gathered, uniformly
entered, as completed as possible and have been well arranged to facilitate coding and tabulation.
 With regard to points or stages at which editing should be done:
 Field editing
 Central editing.
FIELD EDITING
 Field editing consists in the review of the reporting forms by the investigator for completing (translating
or rewriting) what the latter has written in abbreviated and/or in illegible format the time of recording the
respondents’ responses.
 This type of editing is necessary in view of the fact that individual writing styles often can be difficult for
others to decipher.
 This sort of editing should be done as soon as possible after the interview, preferably on the very day
or on the next day.
 While doing field editing, the investigator must restrain himself and must not correct errors of omission
by simply guessing what the informant would have said if the question had been asked.
CENTRAL EDITING
 Central editing should take place when all forms or schedules have been completed and returned to the office.
 This type of editing implies that all forms should get a thorough editing by a single editor in a small study and by
a team of editors in case of a large inquiry.
 Editor(s) may correct the obvious errors such as an entry in the wrong place, entry recorded in months when it
should have been recorded in weeks, and the like.
 In case of inappropriate on missing replies, the editor can sometimes determine the proper answer by reviewing
the other information in the schedule.
 At times, the respondent can be contacted for clarification.
 The editor must strike out the answer if the same is inappropriate and he has no basis for determining the correct
answer or the response.
 In such a case an editing entry of ‘no answer’ is called for.
 All the wrong replies, which are quite obvious, must be dropped from the final results, especially in the context of
mail surveys.
CODING
 Coding refers to the process of assigning numerals or other symbols to answers so that responses can be put into a limited number of
categories or classes.
 Such classes should be appropriate to the research problem under consideration.
 They must mutual exclusive which means that a specific answer can be placed in one and only one cell in a given category set.
 Another rule to be observed is that of unidimensionality by which is meant that every class is defined in terms of only one concept.
 Coding is necessary for efficient analysis and through it the several replies may be reduced to a small number of classes which contain
the critical information required for analysis.
 Coding decisions should usually be taken at the designing stage of the questionnaire. This makes it possible to precode the
questionnaire choices and which in turn is helpful for computer tabulation as one can straight forward key punch from the original
questionnaires.
 But in case of hand coding some standard method may be used. One such standard method is to code in the margin with a coloured
pencil. The other method can be to transcribe the data from the questionnaire to a coding sheet.
 Whatever method is adopted, one should see that coding errors are altogether eliminated or reduced to the minimum level.
Example of Coding
Example of Coding
CLASSIFICATION
 Most research studies result in a large volume of raw data which must be reduced into homogeneous
groups if we are to get meaningful relationships.
 This fact necessitates classification of data which happens to be the process of arranging data in groups
or classes on the basis of common characteristics.
 Data having a common characteristic are placed in one class and in this way the entire data get divided
into a number of groups or classes.
 Classification can be one of the following two types, depending upon the nature of the phenomenon
involved:
 Classification according to attributes:
 Classification according to class-intervals
CLASSIFICATION ACCORDING TO ATTRIBUTES
 As stated above, data are classified on the basis of common characteristics which can either be descriptive
(such as literacy, sex, honesty, etc.) or numerical (such as weight, height, income, etc.).
 Descriptive characteristics refer to qualitative phenomenon which cannot be measured quantitatively; only
their presence or absence in an individual item can be noticed.
 Data obtained this way on the basis of certain attributes are known as statistics of attributes and their
classification is said to be classification according to attributes.
 Such classification can be simple classification or manifold classification.
 In simple classification we consider only one attribute and divide the universe into two classes—one class consisting
of items possessing the given attribute and the other class consisting of items which do not possess the given
attribute.
 But in manifold classification we consider two or more attributes simultaneously, and divide that data into a number of
classes (total number of classes of final order is given by 2n , where n = number of attributes considered).
 Whenever data are classified according to attributes, the researcher must see that the attributes are defined
in such a manner that there is least possibility of any doubt/ambiguity concerning the said attributes.
CLASSIFICATION ACCORDING TO CLASS-
INTERVALS
 Unlike descriptive characteristics, the numerical characteristics refer to quantitative phenomenon which can
be measured through some statistical units.
 Data relating to income, production, age, weight, etc. come under this category.
 Such data are known as statistics of variables and are classified on the basis of class intervals. For instance,
persons whose incomes, say, are within Rs 201 to Rs 400 can form one group, those whose incomes are
within Rs 401 to Rs 600 can form another group and so on.
 In this way the entire data may be divided into a number of groups or classes or what are usually called, ‘class-
intervals.’ Each group of class-interval, thus, has an upper limit as well as a lower limit which are known as
class limits.
 The difference between the two class limits is known as class magnitude.
 We may have classes with equal class magnitudes or with unequal class magnitudes. The number of items
which fall in a given class is known as the frequency of the given class.
 All the classes or groups, with their respective frequencies taken together and put in the form of a table, are
described as group frequency distribution or simply frequency distribution.
TABULATION
 When a mass of data has been assembled, it becomes necessary for the researcher to arrange the
same in some kind of concise and logical order.
 This procedure is referred to as tabulation.
 Thus, tabulation is the process of summarising raw data and displaying the same in compact form (i.e.,
in the form of statistical tables) for further analysis.
 In a broader sense, tabulation is an orderly arrangement of data in columns and rows.
 Tabulation is essential because of the following reasons.
 It conserves space and reduces explanatory and descriptive statement to a minimum.
 It facilitates the process of comparison.
 It facilitates the summation of items and the detection of errors and omissions.
 It provides a basis for various statistical computations.

More Related Content

PDF
7 Processing And Analysis Of Data
PPT
a data editing, coding and tabulation.ppt
DOC
Data Analysis, Hypothesis ,Report writing Unit 5 RM.doc
PPTX
Data analysis.pptx
PPTX
Editing, coding and tabulation of data
PPTX
Research methodology unit6
PPTX
Research Methodology-Data Processing
PPTX
Research methodology-Research Report
7 Processing And Analysis Of Data
a data editing, coding and tabulation.ppt
Data Analysis, Hypothesis ,Report writing Unit 5 RM.doc
Data analysis.pptx
Editing, coding and tabulation of data
Research methodology unit6
Research Methodology-Data Processing
Research methodology-Research Report

Similar to MOdule IV- Data Processing.pptx (20)

PPTX
7.pptx
PPTX
dataanalysisandinterpretation-231025045220-81d52e02.pptx
PDF
Data processing in research methodology
PPTX
Data analysis copy
PPT
GIS Pre PhD Research Methodology (1).ppt
PPTX
PROCESSING-ORGANIZING-AND-PRESENTATION-OF-DATA.pptx
PPTX
BRM ppt 1.pptx
PPTX
Analysis of data.pptx
PPTX
Research methodology
PPTX
Research methodology
PPTX
RESEARCH METHODOLOGY- PROCESSING OF DATA
PPTX
Lecture 1- data preparation.pptx
PPTX
data analysis and report wring in research (Section d)
PPTX
Ansalysis of daata w- roough slides.pptx
PPT
Chap 8
PPTX
Practical applications and analysis in Research Methodology
PDF
RM CHAPTER SEVEN AND EIGHT.pKJUYTTRRRRRRRRdf
PPTX
RESEARCH STUDY - CHAPTER 5: Finding Results Through Data Collection
PPTX
unit 4 deta analysis bbaY Dr kanchan.pptx
PPTX
unit 4 deta analysis bbaY Dr kanchan.pptx
7.pptx
dataanalysisandinterpretation-231025045220-81d52e02.pptx
Data processing in research methodology
Data analysis copy
GIS Pre PhD Research Methodology (1).ppt
PROCESSING-ORGANIZING-AND-PRESENTATION-OF-DATA.pptx
BRM ppt 1.pptx
Analysis of data.pptx
Research methodology
Research methodology
RESEARCH METHODOLOGY- PROCESSING OF DATA
Lecture 1- data preparation.pptx
data analysis and report wring in research (Section d)
Ansalysis of daata w- roough slides.pptx
Chap 8
Practical applications and analysis in Research Methodology
RM CHAPTER SEVEN AND EIGHT.pKJUYTTRRRRRRRRdf
RESEARCH STUDY - CHAPTER 5: Finding Results Through Data Collection
unit 4 deta analysis bbaY Dr kanchan.pptx
unit 4 deta analysis bbaY Dr kanchan.pptx
Ad

More from ssuserff5cd7 (7)

PPT
Sociology Ch. 5 S. 1 (1).ppt
PPTX
MODULE 4 Managerial Economics.pptx
PPT
chapter3_su08.ppt
PPTX
POM Module 5.pptx
PPTX
Managerial Economics- Module 5.pptx
PDF
india-japan-161101184146.pdf
PPTX
POM-2.pptx
Sociology Ch. 5 S. 1 (1).ppt
MODULE 4 Managerial Economics.pptx
chapter3_su08.ppt
POM Module 5.pptx
Managerial Economics- Module 5.pptx
india-japan-161101184146.pdf
POM-2.pptx
Ad

Recently uploaded (20)

PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
Leprosy and NLEP programme community medicine
PDF
Transcultural that can help you someday.
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
Introduction to the R Programming Language
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Business Analytics and business intelligence.pdf
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
DOCX
Factor Analysis Word Document Presentation
PDF
Microsoft Core Cloud Services powerpoint
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
How to run a consulting project- client discovery
PPTX
IMPACT OF LANDSLIDE.....................
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Leprosy and NLEP programme community medicine
Transcultural that can help you someday.
Qualitative Qantitative and Mixed Methods.pptx
annual-report-2024-2025 original latest.
Pilar Kemerdekaan dan Identi Bangsa.pptx
Introduction to the R Programming Language
[EN] Industrial Machine Downtime Prediction
Business Analytics and business intelligence.pdf
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
A Complete Guide to Streamlining Business Processes
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
retention in jsjsksksksnbsndjddjdnFPD.pptx
Factor Analysis Word Document Presentation
Microsoft Core Cloud Services powerpoint
Acceptance and paychological effects of mandatory extra coach I classes.pptx
How to run a consulting project- client discovery
IMPACT OF LANDSLIDE.....................

MOdule IV- Data Processing.pptx

  • 2. DATA PROCESSING  Data in its raw form is not useful to any research. Data processing is the method of collecting raw data and translating it into usable information. It is usually performed in a step-by-step process.  Data processing is concerned with editing, coding, classifying, tabulating and charting and diagramming research data.  The essence of data processing in research is data reduction.  Data reduction involves removing out the irrelevant from the relevant data and establishing order from chaos and giving shape to a mass of data.
  • 3. DATA EDITING  Editing of data is a process of examining the collected raw data (specially in surveys) to detect errors and omissions and to correct these when possible.  Editing involves a careful scrutiny of the completed questionnaires and/or schedules.  Editing is done to assure that the data are accurate, consistent with other facts gathered, uniformly entered, as completed as possible and have been well arranged to facilitate coding and tabulation.  With regard to points or stages at which editing should be done:  Field editing  Central editing.
  • 4. FIELD EDITING  Field editing consists in the review of the reporting forms by the investigator for completing (translating or rewriting) what the latter has written in abbreviated and/or in illegible format the time of recording the respondents’ responses.  This type of editing is necessary in view of the fact that individual writing styles often can be difficult for others to decipher.  This sort of editing should be done as soon as possible after the interview, preferably on the very day or on the next day.  While doing field editing, the investigator must restrain himself and must not correct errors of omission by simply guessing what the informant would have said if the question had been asked.
  • 5. CENTRAL EDITING  Central editing should take place when all forms or schedules have been completed and returned to the office.  This type of editing implies that all forms should get a thorough editing by a single editor in a small study and by a team of editors in case of a large inquiry.  Editor(s) may correct the obvious errors such as an entry in the wrong place, entry recorded in months when it should have been recorded in weeks, and the like.  In case of inappropriate on missing replies, the editor can sometimes determine the proper answer by reviewing the other information in the schedule.  At times, the respondent can be contacted for clarification.  The editor must strike out the answer if the same is inappropriate and he has no basis for determining the correct answer or the response.  In such a case an editing entry of ‘no answer’ is called for.  All the wrong replies, which are quite obvious, must be dropped from the final results, especially in the context of mail surveys.
  • 6. CODING  Coding refers to the process of assigning numerals or other symbols to answers so that responses can be put into a limited number of categories or classes.  Such classes should be appropriate to the research problem under consideration.  They must mutual exclusive which means that a specific answer can be placed in one and only one cell in a given category set.  Another rule to be observed is that of unidimensionality by which is meant that every class is defined in terms of only one concept.  Coding is necessary for efficient analysis and through it the several replies may be reduced to a small number of classes which contain the critical information required for analysis.  Coding decisions should usually be taken at the designing stage of the questionnaire. This makes it possible to precode the questionnaire choices and which in turn is helpful for computer tabulation as one can straight forward key punch from the original questionnaires.  But in case of hand coding some standard method may be used. One such standard method is to code in the margin with a coloured pencil. The other method can be to transcribe the data from the questionnaire to a coding sheet.  Whatever method is adopted, one should see that coding errors are altogether eliminated or reduced to the minimum level.
  • 9. CLASSIFICATION  Most research studies result in a large volume of raw data which must be reduced into homogeneous groups if we are to get meaningful relationships.  This fact necessitates classification of data which happens to be the process of arranging data in groups or classes on the basis of common characteristics.  Data having a common characteristic are placed in one class and in this way the entire data get divided into a number of groups or classes.  Classification can be one of the following two types, depending upon the nature of the phenomenon involved:  Classification according to attributes:  Classification according to class-intervals
  • 10. CLASSIFICATION ACCORDING TO ATTRIBUTES  As stated above, data are classified on the basis of common characteristics which can either be descriptive (such as literacy, sex, honesty, etc.) or numerical (such as weight, height, income, etc.).  Descriptive characteristics refer to qualitative phenomenon which cannot be measured quantitatively; only their presence or absence in an individual item can be noticed.  Data obtained this way on the basis of certain attributes are known as statistics of attributes and their classification is said to be classification according to attributes.  Such classification can be simple classification or manifold classification.  In simple classification we consider only one attribute and divide the universe into two classes—one class consisting of items possessing the given attribute and the other class consisting of items which do not possess the given attribute.  But in manifold classification we consider two or more attributes simultaneously, and divide that data into a number of classes (total number of classes of final order is given by 2n , where n = number of attributes considered).  Whenever data are classified according to attributes, the researcher must see that the attributes are defined in such a manner that there is least possibility of any doubt/ambiguity concerning the said attributes.
  • 11. CLASSIFICATION ACCORDING TO CLASS- INTERVALS  Unlike descriptive characteristics, the numerical characteristics refer to quantitative phenomenon which can be measured through some statistical units.  Data relating to income, production, age, weight, etc. come under this category.  Such data are known as statistics of variables and are classified on the basis of class intervals. For instance, persons whose incomes, say, are within Rs 201 to Rs 400 can form one group, those whose incomes are within Rs 401 to Rs 600 can form another group and so on.  In this way the entire data may be divided into a number of groups or classes or what are usually called, ‘class- intervals.’ Each group of class-interval, thus, has an upper limit as well as a lower limit which are known as class limits.  The difference between the two class limits is known as class magnitude.  We may have classes with equal class magnitudes or with unequal class magnitudes. The number of items which fall in a given class is known as the frequency of the given class.  All the classes or groups, with their respective frequencies taken together and put in the form of a table, are described as group frequency distribution or simply frequency distribution.
  • 12. TABULATION  When a mass of data has been assembled, it becomes necessary for the researcher to arrange the same in some kind of concise and logical order.  This procedure is referred to as tabulation.  Thus, tabulation is the process of summarising raw data and displaying the same in compact form (i.e., in the form of statistical tables) for further analysis.  In a broader sense, tabulation is an orderly arrangement of data in columns and rows.  Tabulation is essential because of the following reasons.  It conserves space and reduces explanatory and descriptive statement to a minimum.  It facilitates the process of comparison.  It facilitates the summation of items and the detection of errors and omissions.  It provides a basis for various statistical computations.