3
Most read
6
Most read
10
Most read
Dr. Harisingh Gour Viswavidyalaya
A Central University
DEPARTMENT OF ZOOLOGY
TOPIC – DATABASES IN BIOINFORMATICS
MID II ASSIGNMENT
ZOO – SEC – 128
SUBMITED TO – MR. ANUPAM KUMAR
SUBMITED BY –
PRAVANJAN DASH
ROLL NO. – Y23265020, Msc 1st YEAR, 1st SEMESTER
INTRODUCTION OF DATABASE
BIOLOGICAL DATABASES are
 Collection of files containing records of biological data in
machine readable form Can be accessed, added, retrieved,
manipulated and modified.
 Store, manage, connect and distribute data.
 Data are arranged by sets of rules which are programmed
into software that manages the data called Database
Management System or DBMS.
 A biological database is a collection of data that is
structured, searchable, updated periodically and cross
referenced.
 The data is stores, maintained, annotated, curated and
stored for public/research use.
 Data collected and organized in a specific but useful way
Classification based on type of data stored
 Primary Databases: Contain original data in the form of
primary sequence data or structural data as submitted by the
scientific community.
 Secondary Databases: Contain information that has been
processed and derived from the raw data available in primary
database.eg: PROSITE, PRINTS, BLOCKS etc..
 Composite Databases: Collect and present data after
comparing and filtering them from different primary databases
and exhibit only the non redundant sequences.
PRIMARY DATA VERSUS SECONDARY DATA
PRIMARY DATA
• Primary data is a type of data researchers
directly collect from main sources.
• Includes real-time data.
• Collected to address a current research
problem.
• Accessing primary data includes a relatively
long process.
• Data collection tools include observations,
surveys, questionnaires, physical testing,
online questionnaires, personal or telephone
interviews, case studies, and focused group
discussions.
SECONDARY DATA
• Secondary data refers to already existing data
produced by the previous researchers.
• Related to the past.
• Primarily collected to address previously
existed research problems and can be used
to address the current research problem as
well.
• Referring to secondary data is quick and easy.
• Data collection tools include journal articles,
websites, books, government publications,
records, etc.
PRIMARY DATABASES
 Primary databases contain original biological data. They are
archives of raw sequence or structural data submitted by the scientific
community.
 Once given a database a accession number, the data in primary
database are never changed.
 There are three (Genbank, EMBL, DDBJ) major public sequence
databases that store raw nucleic acid sequence data produced and
submitted by researchers worldwide.
 SOME PRIMARY DATABASES
Nucleic acid databases: Gen Bank, EMBL, DDBJ
Protein sequence databases: PIR, Swiss-Prot, UNIPROT
Protein structure database: PDB
Metabolic databases: KEGG
SECONDARY DATABASE
• Secondary database contain additional information
derived from the analysis f data available in primary
sources. econdary databases are analysed in a variety
Of ways and contain different formation in different
formats.
• SOME SECONDARY DATABASES ARE
 TrEMBL
 Pfam
 PROSITE
 Profiles
 SCOP
 CATH
NUCLEOTIDE SEQUENCE DATABASE
• Composed of a group of nucleotide sequence entries.
• Data repositories that accept nucleic acid sequence data
and make it freely available to the public.
• All the three are members of the International Nucleotide
Sequence Database Consortium (INSDC) and interchange
data.
• GenBank, EMBL, DDBJ are principal nucleotide
databases.
PROTEIN SEQUENCE DATABASES
 An array of amino acid sequence entries arranged
according to the identification number.
 Well known protein sequence databases available
on www are
 Swiss-Prot
 PIR
 UNIPROT
PROTEIN STRUCTURE DATABASE
 Many proteins which exhibit a common evolutionary
origin, show structural similarities.
 Dissimilar proteins exhibit changes in primary, secondary,
teritiary and quarternary structures.
 Similar or dissimilar protein structure can be predicted
with structure database.
 These databases store a collection of three dimensional
structures of proteins.
 EXAMPLE IS pluggable database (PDB) .
THANK YOU

More Related Content

PPT
Plant tissue culture media components
PPTX
L21. techniques for selection, screening and characterization of transformants
PPTX
Phloem diffrentiation
PPT
swiss-prot<bioinformatics>
PPT
Androgenesis by Aswathy Viswanath
PPTX
Plant genome project(aribidopsis)
PPTX
totipotency
Plant tissue culture media components
L21. techniques for selection, screening and characterization of transformants
Phloem diffrentiation
swiss-prot<bioinformatics>
Androgenesis by Aswathy Viswanath
Plant genome project(aribidopsis)
totipotency

What's hot (20)

PDF
Restriction endonucleases
PPTX
Production of secondary metabolites
PPTX
Construction of genomic and c dna library
PPTX
Nucleic acid database
PPTX
Somatic embryogenesis and artificial seed production
PPTX
Transposable elements
PPTX
Transgenic plant with improved nutritional quality
PPTX
Site specific recombination
PPTX
Agrobacterium tumefaciens mediated gene transfer
PPTX
Plant tissue culture application
PPTX
TYPES OF OVULES.pptx
PPTX
Cleaning and sterilization during tissue culture
PPTX
Nucleic Acid Sequence Databases
PPT
History of plant biotechnology
PPT
Plant Tissue Culture
PPTX
FAMOUS MOLECULAR BIOLOGY EXPERIMENTS
PDF
Tools and database of NCBI
PPTX
Protein Databases
Restriction endonucleases
Production of secondary metabolites
Construction of genomic and c dna library
Nucleic acid database
Somatic embryogenesis and artificial seed production
Transposable elements
Transgenic plant with improved nutritional quality
Site specific recombination
Agrobacterium tumefaciens mediated gene transfer
Plant tissue culture application
TYPES OF OVULES.pptx
Cleaning and sterilization during tissue culture
Nucleic Acid Sequence Databases
History of plant biotechnology
Plant Tissue Culture
FAMOUS MOLECULAR BIOLOGY EXPERIMENTS
Tools and database of NCBI
Protein Databases
Ad

Similar to BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf (20)

PDF
BIOLOGICAL DATABASE AND ITS TYPES,IMPORTANCE OF BIOLOGICAL DATABASE
PPTX
biological databases.pptx
PPTX
What are Databases?
PPT
Databases.ppt
PPTX
Share_Introduction to Bioinformatics-WPS_Office.pptx
PPTX
BIOINFORMATICS BIOLOGICAL DATABASES DATA BASES.pptx
PPTX
Biological data bioinformatics
PPTX
Biological databases.pptx
PPTX
Introduction OF BIOLOGICAL DATABASE
PPTX
Introduction to databases.pptx
PPTX
Biological databases
PPTX
Biological database
PPTX
Database in bioinformatics
PDF
Bioinformatics biological databases
PPT
Biological databases
PPTX
DATABASES...............................pptx
PDF
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
PDF
Bioinformatics__Lecture_1.ppt
PPTX
Primary Bioinformatics Database.pptx
PPTX
Biological Data bases for biological sciences bioinformatics
BIOLOGICAL DATABASE AND ITS TYPES,IMPORTANCE OF BIOLOGICAL DATABASE
biological databases.pptx
What are Databases?
Databases.ppt
Share_Introduction to Bioinformatics-WPS_Office.pptx
BIOINFORMATICS BIOLOGICAL DATABASES DATA BASES.pptx
Biological data bioinformatics
Biological databases.pptx
Introduction OF BIOLOGICAL DATABASE
Introduction to databases.pptx
Biological databases
Biological database
Database in bioinformatics
Bioinformatics biological databases
Biological databases
DATABASES...............................pptx
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
Bioinformatics__Lecture_1.ppt
Primary Bioinformatics Database.pptx
Biological Data bases for biological sciences bioinformatics
Ad

Recently uploaded (20)

PDF
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
PPTX
limit test definition and all limit tests
PDF
Chapter 3 - Human Development Poweroint presentation
PPTX
Presentation1 INTRODUCTION TO ENZYMES.pptx
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PDF
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
PPTX
Preformulation.pptx Preformulation studies-Including all parameter
PPTX
PMR- PPT.pptx for students and doctors tt
PDF
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
PPT
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
PPTX
LIPID & AMINO ACID METABOLISM UNIT-III, B PHARM II SEMESTER
PPTX
perinatal infections 2-171220190027.pptx
PDF
5.Physics 8-WBS_Light.pdfFHDGJDJHFGHJHFTY
PPTX
Cells and Organs of the Immune System (Unit-2) - Majesh Sir.pptx
PPTX
Introduction to Immunology (Unit-1).pptx
PPTX
A powerpoint on colorectal cancer with brief background
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
PPTX
gene cloning powerpoint for general biology 2
PPT
Cell Structure Description and Functions
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
limit test definition and all limit tests
Chapter 3 - Human Development Poweroint presentation
Presentation1 INTRODUCTION TO ENZYMES.pptx
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
From Molecular Interactions to Solubility in Deep Eutectic Solvents: Explorin...
Preformulation.pptx Preformulation studies-Including all parameter
PMR- PPT.pptx for students and doctors tt
The Future of Telehealth: Engineering New Platforms for Care (www.kiu.ac.ug)
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
LIPID & AMINO ACID METABOLISM UNIT-III, B PHARM II SEMESTER
perinatal infections 2-171220190027.pptx
5.Physics 8-WBS_Light.pdfFHDGJDJHFGHJHFTY
Cells and Organs of the Immune System (Unit-2) - Majesh Sir.pptx
Introduction to Immunology (Unit-1).pptx
A powerpoint on colorectal cancer with brief background
Animal tissues, epithelial, muscle, connective, nervous tissue
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
gene cloning powerpoint for general biology 2
Cell Structure Description and Functions

BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf

  • 1. Dr. Harisingh Gour Viswavidyalaya A Central University DEPARTMENT OF ZOOLOGY TOPIC – DATABASES IN BIOINFORMATICS MID II ASSIGNMENT ZOO – SEC – 128 SUBMITED TO – MR. ANUPAM KUMAR SUBMITED BY – PRAVANJAN DASH ROLL NO. – Y23265020, Msc 1st YEAR, 1st SEMESTER
  • 2. INTRODUCTION OF DATABASE BIOLOGICAL DATABASES are  Collection of files containing records of biological data in machine readable form Can be accessed, added, retrieved, manipulated and modified.  Store, manage, connect and distribute data.  Data are arranged by sets of rules which are programmed into software that manages the data called Database Management System or DBMS.  A biological database is a collection of data that is structured, searchable, updated periodically and cross referenced.  The data is stores, maintained, annotated, curated and stored for public/research use.  Data collected and organized in a specific but useful way
  • 3. Classification based on type of data stored  Primary Databases: Contain original data in the form of primary sequence data or structural data as submitted by the scientific community.  Secondary Databases: Contain information that has been processed and derived from the raw data available in primary database.eg: PROSITE, PRINTS, BLOCKS etc..  Composite Databases: Collect and present data after comparing and filtering them from different primary databases and exhibit only the non redundant sequences.
  • 4. PRIMARY DATA VERSUS SECONDARY DATA PRIMARY DATA • Primary data is a type of data researchers directly collect from main sources. • Includes real-time data. • Collected to address a current research problem. • Accessing primary data includes a relatively long process. • Data collection tools include observations, surveys, questionnaires, physical testing, online questionnaires, personal or telephone interviews, case studies, and focused group discussions. SECONDARY DATA • Secondary data refers to already existing data produced by the previous researchers. • Related to the past. • Primarily collected to address previously existed research problems and can be used to address the current research problem as well. • Referring to secondary data is quick and easy. • Data collection tools include journal articles, websites, books, government publications, records, etc.
  • 5. PRIMARY DATABASES  Primary databases contain original biological data. They are archives of raw sequence or structural data submitted by the scientific community.  Once given a database a accession number, the data in primary database are never changed.  There are three (Genbank, EMBL, DDBJ) major public sequence databases that store raw nucleic acid sequence data produced and submitted by researchers worldwide.  SOME PRIMARY DATABASES Nucleic acid databases: Gen Bank, EMBL, DDBJ Protein sequence databases: PIR, Swiss-Prot, UNIPROT Protein structure database: PDB Metabolic databases: KEGG
  • 6. SECONDARY DATABASE • Secondary database contain additional information derived from the analysis f data available in primary sources. econdary databases are analysed in a variety Of ways and contain different formation in different formats. • SOME SECONDARY DATABASES ARE  TrEMBL  Pfam  PROSITE  Profiles  SCOP  CATH
  • 7. NUCLEOTIDE SEQUENCE DATABASE • Composed of a group of nucleotide sequence entries. • Data repositories that accept nucleic acid sequence data and make it freely available to the public. • All the three are members of the International Nucleotide Sequence Database Consortium (INSDC) and interchange data. • GenBank, EMBL, DDBJ are principal nucleotide databases.
  • 8. PROTEIN SEQUENCE DATABASES  An array of amino acid sequence entries arranged according to the identification number.  Well known protein sequence databases available on www are  Swiss-Prot  PIR  UNIPROT
  • 9. PROTEIN STRUCTURE DATABASE  Many proteins which exhibit a common evolutionary origin, show structural similarities.  Dissimilar proteins exhibit changes in primary, secondary, teritiary and quarternary structures.  Similar or dissimilar protein structure can be predicted with structure database.  These databases store a collection of three dimensional structures of proteins.  EXAMPLE IS pluggable database (PDB) .