SlideShare a Scribd company logo
DATA BASE AND DATA ENTRY




         Presented By:-
        Mukesh Jaiswal
         Somya Verma
        ICRI, Dehradun.
Clinical Data Base
• A database is a method of organizing and
  analyzing information.
• A database is a collection of information that
  is organized so that it can easily be
  accessed, managed, and updated. In one
  view, databases can be classified according to
  types of content: bibliographic, full-
  text, numeric, and images.
Cont…
• In computing, databases are sometimes classified
  according to their organizational approach. The
  most prevalent approach is the relational
  database, a tabular database in which data is
  defined so that it can be reorganized and
  accessed in a number of different ways. A
  distributed database is one that can be dispersed
  or replicated among different points in a network.
  An object-oriented programming database is one
  that is congruent with the data defined in object
  classes and subclasses
Cont…
• The main objectives of data base design is to
  capture and store clinical data accurately.
• The essential features of good design are ease
  of data capture, efficient creation of analysis
  datasets and accommodation of source data
  transfer formets.
Why use a database?
• Organize and analyze information in different
  ways
 –   Sorting
 –   Grouping
 –   Querying
 –   Reporting
 –   Exporting for statistical analysis
• Computerized database
 –   Speed
 –   Quality control
 –   Precision
 –   Automate repetitive tasks
Databases versus Excel
• Excel has some limited capabilities to sort data but its primary
  function is to create financial spreadsheets
  – Can create “what if” scenarios to determine financial consequences
  – Can be used for small and limited research data sets and simple lists
  – Not multi-user such that only one person can work on the file at a time
• Databases are designed to collect, sort, and manipulate data
  – Data sets can process large amounts of data and is usually limited by
    hardware constraints
  – Structure is in the same format for each member record of a table
  – Data quality control features ensure that valid data is entered
  – A relational database allows for linking of an unlimited number of
    tables
  – Databases are multi-user because the data can reside on a server and
    multiple people can have access at the same time
  – Many databases offer web interfaces thereby eliminating the need for
    each user to have a copy of the the program on their computer
Cont…
• Many databases offer audit functions required by
  certain regulatory agencies
   • Tracks date record created and modified
   • Tracks original and changed values
   • Requires user to give reason for the change
• Databases are more suitable for importing data
  from multiple sources
   • More robust in connecting to different data sources
   • Imports of different data types into different tables can be
     linked via common identifiers such as subject ID
   • Merging multiple data sources into Excel so that the rows line
     up properly in a flat file format can be a challenge
How is a database organized?
• One or more tables
• Tables store records
   – Patient identifiers
   – Demographics and history
   – Test results
   – Etc…..
• A record is a collection of fields
   – Patient identifiers
      • Name, DOB, address, …..are stored in separate fields
Records and Fields
               Fields
Records
Differences between a clinical and
research database
• Clinical database
  – Form or report oriented so data is displayed for
    clinical decision making
  – Emphasis on displaying or reporting of individual
    data rather than accumulating multiple records
• Research database
  – Table oriented so that data is accumulated for
    eventual export to a statistical package for data
    analysis and reporting
  – Less emphasis on individual records
Types of Database
• Flat-File:- The flat-file style of database are
  ideal for small amounts of data that needs to
  be human readable or edited by hand.
  Essentially all they are made up of is a set of
  strings in one or more files that can be parsed
  to get the information they store; great for
  storing simple lists and data values, but can
  get complicated when you try to replicate
  more complex data structures.
Cont…
• Relational:- The relational databases such as
  MySQL, Microsoft SQL Server and Oracle, have a much
  more logical structure in the way that it stores data.
  Tables can be used to represent real world
  objects, with each field acting like an attribute.
• One major advantage of the relational model is that, if
  a database is designed efficiently, there should be no
  duplication of any data; helping to maintain database
  integrity. This can also represent a huge saving in file
  size, which is important when dealing with large
  volumes of data.
Cont…
• Relational databases also have functions "built
  in" that help them to retrieve, sort and edit
  the data in many different ways. These
  functions save script designers from having to
  worry about filtering out the results that they
  get, and so can go quite some way to speeding
  up the development and production of web
  applications.
Advantages of a Relational Database
 • Elimination of Multiple Value Data – a relational database allows
   creation of relationships for subordinate data. For example, a table
   for laboratory testing and another table for clinical findings would
   each have multiple subjects but the subject demographic
   information is maintained in a separate table).

 • Avoiding Update Anomalies – since data is stored in only one
   place, it is easy to update (no other copies to remember to update).

 • Avoiding Data Entry Anomalies – like updates, since data is only
   stored in one place, it needs to be inserted in one place.

 • Avoiding Data Deletion Anomalies – once again, since data is in
   one place only, it is deleted only once.
Advantages of a database
• Collection of data in a centralized location
• Controls redundant data
• Data stored so as to appear to users in one
  location
  – Data can be stored in multiple tables and come
    from multiple sources
  – A relational database brings it all together
Database Design Considerations
• What to collect
 – What questions are to be answered?
 – Think of the data tables in your future publications
   • Focus on the key data elements rather than collect as much as
     possible
• What statistical package will be used
 – Format of the data file to which the data will be exported
   • Allowable characters
   • Format for certain analyses
          – For example, gender can be recorded in the database as M or F but
            statistical package may require 0 and 1
   • Length of data field labels
   • Long or wide format
Long versus Wide Format
     Long: each year is represented as its own observation in a record




  Wide: each family is a record and each year is a field with that record
Quality Control of Data Before
Study
• Collect only needed variables
• Select appropriate computer hardware and
  software
• Plan analyses with dummy tabulations
• Develop study forms
  –   Precode responses
  –   Format boxes for data entry
  –   Label each page with date, time, ID
  –   Consider scan technology
What needs to be in the research
database?
 Research variables directly related to the
  hypotheses being tested-YES
 Clinical measures used for screening-MAYBE
 ◦ Blood work, ECG, medical history
 Administrative data-NO
 ◦ Contact information
 ◦ Scheduling
What Do You Do With the Data?
• Ongoing monitoring

• Safety/adverse event reporting

• IRB reports/sponsor reports

• FDA reports

• Early analysis/late analysis
Data Entry
• Refers to the process of transferring data from
  the paper CRF to the data base.
• This is also refers to as transcribing the data.
• Data entry result in creation of electronic data
  , which corresponds to the CRF data.
• Once the data is entered into the database, it
  is reviewed and validated by the data editor.
• Data entry consists of both double entry and
  single entry.
Double Entry
• This involves entry of the same CRF page by two
  independent data entry personnel.
• The first data entry personnel keys in the data
  into the database. Later, a second independent
  data entry personnel keys in the same data.
• In the case of difference or discrepancy between
  first and second entry, a ‘pop up’ box throws
  up, alerting the second data entry personnel
  either key in what they see or to accept what the
  first data entry personnel has entered.
Cont…
• Another option is to have a third personnel
  review the differences/discrepancies and
  resolve them.
• Thus double data entry serves as a quality
  check in the data that is entered into the
  database.
Cont…
• The system allowed design of data entry forms that
  satisfied the needs of our
  clinicians, biostatisticians, and administrative staff. The
  system drastically reduced the time required to enter
  patient exam, demographic, and laboratory
  measurement data onto the study database, and
  provided tools for verifying that the data were scanned
  accurately. The system improved both the quality of
  patient care and the integrity of clinical patient
  data, allowing clinicians to quickly and easily retrieve
  patient records, and permitted our biostatisticians to
  generate periodic recruitment monitoring, patient
  safety, protocol adherence, and data quality assurance
  reports in a timely fashion.
Single Entry
• This involves entry by single data entry
  personnel.
• This process is used when there are sufficient
  and extensive checks built into the database
  that would detect certain error that might be
  missed out by the data entry personnel.
• Single data entry is extensively used in EDC
  and RDC systems.
Cont…
• Thus single data entry eliminates having data
  entry personnel within the data management
  unit.
• Once the data is keyed directly at site, it is
  already to be reviewed, edited and validated
  by the data editor.
Cont…
The data entry could be of two types:-

• Data entry is done locally at the site database and
  transmitted periodically to the central database via
  internet or using a dialup line. Sometimes the data is
  sent using other electronic media such as a CD, floppy
  or as a mail attachment.

• Data entry is done online directly into the central
  database via internet. Usually these systems are web-
  based and the data is available in real time for review.
Rules for Data Entry
• Each variable has a field in the dataset
• Categorical and nominal values require a number or
  string code
• Continuous values are entered directly
• Missing values must be different values from a real
  response
   – Common formats are “99” or bullets “·”
   – Don’t know is a response—do not leave blank
   – “0” is not the same as missing
• Coding instructions should be on form
• Avoid open-ended questions
Avoid open-ended questions
• Enter the subject’s
  gender:___________________


• Enter the subject's level of
  education:__________
Close Ended Question

     What is the subject’s sex? Check one




                        Male

                        Female
Use pre-coded responses where
possible
THANK YOU!

More Related Content

PPT
Retail Sales Training
PPTX
Implant supported overdenture
PPTX
Sales associate Chapter 1 ( Basic of Retail )
PDF
Database Lock _ Unlock Procedure_Katalyst HLS
PPTX
PETIT FOURS.pptx
PDF
Artificial Intelligence (AI) in Education.pdf
PPT
Distal extension removable partial denture prosthesis /certified fixed orthod...
PPTX
PREPARE PASTRY PRODUCTS
Retail Sales Training
Implant supported overdenture
Sales associate Chapter 1 ( Basic of Retail )
Database Lock _ Unlock Procedure_Katalyst HLS
PETIT FOURS.pptx
Artificial Intelligence (AI) in Education.pdf
Distal extension removable partial denture prosthesis /certified fixed orthod...
PREPARE PASTRY PRODUCTS

What's hot (20)

PPT
Data integrity
PDF
Introduction to basic data analytics tools
PPT
Data mining slides
 
PPT
Data indexing presentation
PPTX
Knowledge discovery process
PPTX
Data cleansing
PPTX
The Growing Importance of Data Cleaning
PDF
Data Models
PPT
Research Data Management
PPTX
Data Preparation.pptx
PPTX
Database Design
PDF
The importance of data
PPTX
PPTX
Data Analysis & Visualization using MS. Excel
PDF
Introduction to data analytics
PPTX
Data Visualization & Analytics.pptx
PPT
Data Analyst Role
PPTX
Database indexing techniques
PPTX
Data warehouse architecture
PPTX
1. Data Analytics-introduction
Data integrity
Introduction to basic data analytics tools
Data mining slides
 
Data indexing presentation
Knowledge discovery process
Data cleansing
The Growing Importance of Data Cleaning
Data Models
Research Data Management
Data Preparation.pptx
Database Design
The importance of data
Data Analysis & Visualization using MS. Excel
Introduction to data analytics
Data Visualization & Analytics.pptx
Data Analyst Role
Database indexing techniques
Data warehouse architecture
1. Data Analytics-introduction
Ad

Viewers also liked (20)

PPTX
Data entry projects
PPTX
Excel Lesson 2: Data Entry
PPT
Presentation on input devices
PPT
Spreadsheet Features
PDF
ODIN Final Event - The Care and Feeding of Scientific Data
PDF
Data Care, Feeding, and Maintenance
PPT
Amos
PPTX
Tax Audit
PPTX
Concepts of Data Bases
PPTX
Statistical software packages
PPT
Scott MacKenzie at BayCHI: Evaluating Eye Tracking Systems for Computer Data ...
PPTX
Database and types of database
PPTX
Data collection and input overview
PPT
Edp Powerpoint
PPTX
Progressive: Pay-as-you-go insurance
PPTX
Multiple Regression Analysis
PPTX
Database and types of databases
PDF
Structural Equation Modelling (SEM) Part 1
PPTX
Tax audit presentation AY 2013-14.
PPTX
Best practices data collection
Data entry projects
Excel Lesson 2: Data Entry
Presentation on input devices
Spreadsheet Features
ODIN Final Event - The Care and Feeding of Scientific Data
Data Care, Feeding, and Maintenance
Amos
Tax Audit
Concepts of Data Bases
Statistical software packages
Scott MacKenzie at BayCHI: Evaluating Eye Tracking Systems for Computer Data ...
Database and types of database
Data collection and input overview
Edp Powerpoint
Progressive: Pay-as-you-go insurance
Multiple Regression Analysis
Database and types of databases
Structural Equation Modelling (SEM) Part 1
Tax audit presentation AY 2013-14.
Best practices data collection
Ad

Similar to Data base and data entry presentation by mj n somya (20)

PPTX
Lec20.pptx introduction to data bases and information systems
PPTX
System Analysis And Design
PPTX
Big data analyti data analytical life cycle
PPTX
Data concepts
PPT
Management information system database management
PPT
Unit 3 part i Data mining
PPT
DW (1).ppt
PDF
UNIT machine learning unit 1,algorithm pdf
PPT
Various Applications of Data Warehouse.ppt
PPTX
Intro.pptx
PDF
Introduction to Artificial Intelligence_ Lec 4
PPT
Fundamentals of information systems chapter 3.ppt
PPTX
Fundamentals of information systems chapter 3.pptx
PDF
Lect 1a - Introduction to Pharmacy Informatics 1a.pdf
PPT
Database an introduction
PPTX
data base management system basic class pdf
PPTX
Unit1 DBMS Introduction
PPTX
DBMS basics and normalizations unit.pptx
PPTX
dbms introduction.pptx
PPTX
DATABASE MANAGEMENT SYSTEMS CS 3492.pptx
Lec20.pptx introduction to data bases and information systems
System Analysis And Design
Big data analyti data analytical life cycle
Data concepts
Management information system database management
Unit 3 part i Data mining
DW (1).ppt
UNIT machine learning unit 1,algorithm pdf
Various Applications of Data Warehouse.ppt
Intro.pptx
Introduction to Artificial Intelligence_ Lec 4
Fundamentals of information systems chapter 3.ppt
Fundamentals of information systems chapter 3.pptx
Lect 1a - Introduction to Pharmacy Informatics 1a.pdf
Database an introduction
data base management system basic class pdf
Unit1 DBMS Introduction
DBMS basics and normalizations unit.pptx
dbms introduction.pptx
DATABASE MANAGEMENT SYSTEMS CS 3492.pptx

More from Mukesh Jaiswal (6)

PPTX
ADR by Mukesh Jaiswal & Somya Verma
PPT
Fraud & misconduct by Mukesh Jaiswal
PPT
Bioavailability & bioequivalance
PPTX
Site & investigator selection
PDF
HTS by mukesh
PPTX
Kinetics of ADME by Mukesh Jaiswal
ADR by Mukesh Jaiswal & Somya Verma
Fraud & misconduct by Mukesh Jaiswal
Bioavailability & bioequivalance
Site & investigator selection
HTS by mukesh
Kinetics of ADME by Mukesh Jaiswal

Recently uploaded (20)

PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PDF
Computing-Curriculum for Schools in Ghana
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
IGGE1 Understanding the Self1234567891011
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
Complications of Minimal Access Surgery at WLH
PDF
advance database management system book.pdf
PDF
Trump Administration's workforce development strategy
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Weekly quiz Compilation Jan -July 25.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Computing-Curriculum for Schools in Ghana
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
IGGE1 Understanding the Self1234567891011
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Orientation - ARALprogram of Deped to the Parents.pptx
A powerpoint presentation on the Revised K-10 Science Shaping Paper
Practical Manual AGRO-233 Principles and Practices of Natural Farming
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
Complications of Minimal Access Surgery at WLH
advance database management system book.pdf
Trump Administration's workforce development strategy
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf

Data base and data entry presentation by mj n somya

  • 1. DATA BASE AND DATA ENTRY Presented By:- Mukesh Jaiswal Somya Verma ICRI, Dehradun.
  • 2. Clinical Data Base • A database is a method of organizing and analyzing information. • A database is a collection of information that is organized so that it can easily be accessed, managed, and updated. In one view, databases can be classified according to types of content: bibliographic, full- text, numeric, and images.
  • 3. Cont… • In computing, databases are sometimes classified according to their organizational approach. The most prevalent approach is the relational database, a tabular database in which data is defined so that it can be reorganized and accessed in a number of different ways. A distributed database is one that can be dispersed or replicated among different points in a network. An object-oriented programming database is one that is congruent with the data defined in object classes and subclasses
  • 4. Cont… • The main objectives of data base design is to capture and store clinical data accurately. • The essential features of good design are ease of data capture, efficient creation of analysis datasets and accommodation of source data transfer formets.
  • 5. Why use a database? • Organize and analyze information in different ways – Sorting – Grouping – Querying – Reporting – Exporting for statistical analysis • Computerized database – Speed – Quality control – Precision – Automate repetitive tasks
  • 6. Databases versus Excel • Excel has some limited capabilities to sort data but its primary function is to create financial spreadsheets – Can create “what if” scenarios to determine financial consequences – Can be used for small and limited research data sets and simple lists – Not multi-user such that only one person can work on the file at a time • Databases are designed to collect, sort, and manipulate data – Data sets can process large amounts of data and is usually limited by hardware constraints – Structure is in the same format for each member record of a table – Data quality control features ensure that valid data is entered – A relational database allows for linking of an unlimited number of tables – Databases are multi-user because the data can reside on a server and multiple people can have access at the same time – Many databases offer web interfaces thereby eliminating the need for each user to have a copy of the the program on their computer
  • 7. Cont… • Many databases offer audit functions required by certain regulatory agencies • Tracks date record created and modified • Tracks original and changed values • Requires user to give reason for the change • Databases are more suitable for importing data from multiple sources • More robust in connecting to different data sources • Imports of different data types into different tables can be linked via common identifiers such as subject ID • Merging multiple data sources into Excel so that the rows line up properly in a flat file format can be a challenge
  • 8. How is a database organized? • One or more tables • Tables store records – Patient identifiers – Demographics and history – Test results – Etc….. • A record is a collection of fields – Patient identifiers • Name, DOB, address, …..are stored in separate fields
  • 9. Records and Fields Fields Records
  • 10. Differences between a clinical and research database • Clinical database – Form or report oriented so data is displayed for clinical decision making – Emphasis on displaying or reporting of individual data rather than accumulating multiple records • Research database – Table oriented so that data is accumulated for eventual export to a statistical package for data analysis and reporting – Less emphasis on individual records
  • 11. Types of Database • Flat-File:- The flat-file style of database are ideal for small amounts of data that needs to be human readable or edited by hand. Essentially all they are made up of is a set of strings in one or more files that can be parsed to get the information they store; great for storing simple lists and data values, but can get complicated when you try to replicate more complex data structures.
  • 12. Cont… • Relational:- The relational databases such as MySQL, Microsoft SQL Server and Oracle, have a much more logical structure in the way that it stores data. Tables can be used to represent real world objects, with each field acting like an attribute. • One major advantage of the relational model is that, if a database is designed efficiently, there should be no duplication of any data; helping to maintain database integrity. This can also represent a huge saving in file size, which is important when dealing with large volumes of data.
  • 13. Cont… • Relational databases also have functions "built in" that help them to retrieve, sort and edit the data in many different ways. These functions save script designers from having to worry about filtering out the results that they get, and so can go quite some way to speeding up the development and production of web applications.
  • 14. Advantages of a Relational Database • Elimination of Multiple Value Data – a relational database allows creation of relationships for subordinate data. For example, a table for laboratory testing and another table for clinical findings would each have multiple subjects but the subject demographic information is maintained in a separate table). • Avoiding Update Anomalies – since data is stored in only one place, it is easy to update (no other copies to remember to update). • Avoiding Data Entry Anomalies – like updates, since data is only stored in one place, it needs to be inserted in one place. • Avoiding Data Deletion Anomalies – once again, since data is in one place only, it is deleted only once.
  • 15. Advantages of a database • Collection of data in a centralized location • Controls redundant data • Data stored so as to appear to users in one location – Data can be stored in multiple tables and come from multiple sources – A relational database brings it all together
  • 16. Database Design Considerations • What to collect – What questions are to be answered? – Think of the data tables in your future publications • Focus on the key data elements rather than collect as much as possible • What statistical package will be used – Format of the data file to which the data will be exported • Allowable characters • Format for certain analyses – For example, gender can be recorded in the database as M or F but statistical package may require 0 and 1 • Length of data field labels • Long or wide format
  • 17. Long versus Wide Format Long: each year is represented as its own observation in a record Wide: each family is a record and each year is a field with that record
  • 18. Quality Control of Data Before Study • Collect only needed variables • Select appropriate computer hardware and software • Plan analyses with dummy tabulations • Develop study forms – Precode responses – Format boxes for data entry – Label each page with date, time, ID – Consider scan technology
  • 19. What needs to be in the research database?  Research variables directly related to the hypotheses being tested-YES  Clinical measures used for screening-MAYBE ◦ Blood work, ECG, medical history  Administrative data-NO ◦ Contact information ◦ Scheduling
  • 20. What Do You Do With the Data? • Ongoing monitoring • Safety/adverse event reporting • IRB reports/sponsor reports • FDA reports • Early analysis/late analysis
  • 21. Data Entry • Refers to the process of transferring data from the paper CRF to the data base. • This is also refers to as transcribing the data. • Data entry result in creation of electronic data , which corresponds to the CRF data. • Once the data is entered into the database, it is reviewed and validated by the data editor. • Data entry consists of both double entry and single entry.
  • 22. Double Entry • This involves entry of the same CRF page by two independent data entry personnel. • The first data entry personnel keys in the data into the database. Later, a second independent data entry personnel keys in the same data. • In the case of difference or discrepancy between first and second entry, a ‘pop up’ box throws up, alerting the second data entry personnel either key in what they see or to accept what the first data entry personnel has entered.
  • 23. Cont… • Another option is to have a third personnel review the differences/discrepancies and resolve them. • Thus double data entry serves as a quality check in the data that is entered into the database.
  • 24. Cont… • The system allowed design of data entry forms that satisfied the needs of our clinicians, biostatisticians, and administrative staff. The system drastically reduced the time required to enter patient exam, demographic, and laboratory measurement data onto the study database, and provided tools for verifying that the data were scanned accurately. The system improved both the quality of patient care and the integrity of clinical patient data, allowing clinicians to quickly and easily retrieve patient records, and permitted our biostatisticians to generate periodic recruitment monitoring, patient safety, protocol adherence, and data quality assurance reports in a timely fashion.
  • 25. Single Entry • This involves entry by single data entry personnel. • This process is used when there are sufficient and extensive checks built into the database that would detect certain error that might be missed out by the data entry personnel. • Single data entry is extensively used in EDC and RDC systems.
  • 26. Cont… • Thus single data entry eliminates having data entry personnel within the data management unit. • Once the data is keyed directly at site, it is already to be reviewed, edited and validated by the data editor.
  • 27. Cont… The data entry could be of two types:- • Data entry is done locally at the site database and transmitted periodically to the central database via internet or using a dialup line. Sometimes the data is sent using other electronic media such as a CD, floppy or as a mail attachment. • Data entry is done online directly into the central database via internet. Usually these systems are web- based and the data is available in real time for review.
  • 28. Rules for Data Entry • Each variable has a field in the dataset • Categorical and nominal values require a number or string code • Continuous values are entered directly • Missing values must be different values from a real response – Common formats are “99” or bullets “·” – Don’t know is a response—do not leave blank – “0” is not the same as missing • Coding instructions should be on form • Avoid open-ended questions
  • 29. Avoid open-ended questions • Enter the subject’s gender:___________________ • Enter the subject's level of education:__________
  • 30. Close Ended Question What is the subject’s sex? Check one Male Female
  • 31. Use pre-coded responses where possible