SlideShare a Scribd company logo
7
Most read
9
Most read
12
Most read
Microarray Data Analysis
• Microarrays can be used in many types of
experiments including genotyping, epigenetics,
translation profiling and gene expression
profiling.
• Gene expression profiling is by far the most
common use of microarray technology. Both
one and two colour microarrays can be used for
this type of experiment. The process of
analysing gene expression data is similar for
both types of microarrays and involves
• feature extraction;
• quality control;
• normalisation;
• differential expression analysis;
• biological interpretation of the results;
• submission of data to a public database.
Overview of
the
microarray
data analysis
pipeline
Feature
extraction
Feature extraction is the process of converting
the scanned image of the microarray into
quantifiable (computable) values and annotating
it with the gene IDs, sample names and other
useful information
Feature extraction involves the conversion of the scanned microarray image
to quantifiable values that are saved in binary (e.g. CEL) or text format.
Feature
extraction
• This process is often performed using the
software provided by the microarray
manufacturer. The output of this process is raw
(i.e. unprocessed) data files that can be in binary
or text format
• After the feature extraction process, the data can
be analysed. Array manufacturers often provide
software to open and analyse their raw data files.
These programs may not always be available, may
become obsolete after a few years, or may not be
flexible enough for your needs. There are several
free software tools that are suitable for the
downstream processing of microarray files.
Examples are the Galaxy
platform, GenePattern, GeneSpring (licence
required) and the statistics software R.
Quality
control
• Quality control of microarray data begins with
the visual inspection of the scanned microarray
images to make sure that there are no obvious
splotches, scratches or blank areas
• After feature extraction, the data analysis
software packages can be used to make
diagnostic plots (for example of background
signal, average intensity values and percentage
of genes above background) to help identify
problematic arrays, reporters or samples
Examples of quality control plots made when analysing
differential expression data in Expression Atlas. Left to right:
Array intensity distributions, PCA plot, density estimates.
Normalisation
• Normalisation of microarray data is used to
control for technical variation between assays,
while preserving the biological variation. There
are many ways to normalise the data, and the
methods used depend on:
• the type of array;
• the design of the experiment;
• assumptions made about the data (e.g. 'the
majority of genes represented on the microarray
are not expected to be differentially expressed in
the test group relative to controls');
• and the package being used to analyse the data.
Differential
expression
analysis
• The goal of differential expression analysis is to
identify genes whose expression differs under
different conditions. An important
consideration for differential expression analysis
is correction for multiple testing. This is a
statistical phenomenon that occurs when
thousands of comparisons (e.g. the comparison
of expression of multiple genes in multiple
conditions) are performed for a small number
of samples (most microarray experiments have
less than five biological replicates per
condition). This leads to an increased chance of
false positive results
Biological
interpretation
of gene
expression
data
• Many of the methods for visualising and
interpreting microarray data can also be used
for RNA-seq experiments.
Submission of
data to a
public
repository
• Once you have finished generating microarray
data (e.g. raw data files and
processed/normalised data files are ready), it is
important that you submit the data files
together with metadata to a public database
such as ArrayExpress. This helps to ensure the
reproducibility of your experiment and is now
a requirement of many journals and funding
bodies.

More Related Content

PPTX
SAGE (Serial analysis of Gene Expression)
PPT
Microarray Data Analysis
PPTX
Tertiary structure prediction- MODELLER, RASMOL
PPTX
Protein Threading
PPTX
PPTX
Next generation sequencing
PPTX
System biology and its tools
PPTX
Cancer Proteomics
SAGE (Serial analysis of Gene Expression)
Microarray Data Analysis
Tertiary structure prediction- MODELLER, RASMOL
Protein Threading
Next generation sequencing
System biology and its tools
Cancer Proteomics

What's hot (20)

PDF
Dot matrix
PPTX
MASSIVELY PARELLEL SIGNATURE SEQUENCING
PPTX
Introduction to systems biology
PPTX
Gene Expression Omnibus (GEO)
PPTX
Protein protein interactions
PPTX
PPT
PPTX
gene prediction programs
PPTX
Tools for modeling protein 3 d structure
PDF
PPTX
sequence of file formats in bioinformatics
PPTX
Protein Structure Determination
PPTX
SAGE- Serial Analysis of Gene Expression
PPT
Sequence file formats
PPTX
3D cell cultures
PPT
Clustal
PPTX
Dna binding proteins
PDF
Protein Structure Prediction
Dot matrix
MASSIVELY PARELLEL SIGNATURE SEQUENCING
Introduction to systems biology
Gene Expression Omnibus (GEO)
Protein protein interactions
gene prediction programs
Tools for modeling protein 3 d structure
sequence of file formats in bioinformatics
Protein Structure Determination
SAGE- Serial Analysis of Gene Expression
Sequence file formats
3D cell cultures
Clustal
Dna binding proteins
Protein Structure Prediction
Ad

Similar to Microarray data Analysis.pptx (20)

PPT
20100509 bioinformatics kapushesky_lecture03-04_0
PDF
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
PDF
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
PPTX
Bioinformatics-R program의 실례
PPT
Myths in science & statistics s ha h.ppt
PPTX
PPTPPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT..pptx
PPTX
GIAB Integrating multiple technologies to form benchmark SVs 180517
PPTX
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
PPTX
171017 giab for giab grc workshop
PPTX
High throughput Data Analysis
PPTX
171017 giab for giab grc workshop
PPTX
Whole Exome Sequencing .pptx
PPTX
Metabolomic Data Analysis Workshop and Tutorials (2014)
PDF
Feature selection and microarray data
PPT
Multivariate Analysis and Visualization of Proteomic Data
PDF
Data analysis
PDF
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...
PDF
Standardization of the HIPC Data Templates: The Story So Far
PDF
Standardization of the HIPC Data Templates
PPTX
Microarray data analysis _ by Ritesh Kumar
20100509 bioinformatics kapushesky_lecture03-04_0
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
Bioinformatics-R program의 실례
Myths in science & statistics s ha h.ppt
PPTPPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT.PPT..pptx
GIAB Integrating multiple technologies to form benchmark SVs 180517
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
171017 giab for giab grc workshop
High throughput Data Analysis
171017 giab for giab grc workshop
Whole Exome Sequencing .pptx
Metabolomic Data Analysis Workshop and Tutorials (2014)
Feature selection and microarray data
Multivariate Analysis and Visualization of Proteomic Data
Data analysis
MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Seque...
Standardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates
Microarray data analysis _ by Ritesh Kumar
Ad

More from sanarao25 (19)

PPT
Lecture Environmental Biotechnology, Bioremediation 24-12-2019.ppt
PPTX
Parasitic diseases.pptx
PPTX
Prenylation.pptx
PPTX
Prenylation.pptx
PPTX
Data Retreival, Protein sequence database, Gene Pridiction 3.pptx
PPTX
COPI and COPII Secretery pathways.pptx
PPT
citrus SH presentation.ppt
PPT
Molecular markers.ppt
PDF
An efficient cucumber (Cucumis sativus L.pdf
PPT
L 7 Plant protoplasts .ppt
PPT
helgesonsomatic.ppt
PPT
17465299.ppt
PPTX
GAMETOGENESIS.pptx
PPTX
23_clark_missing_epistasis.pptx
PPTX
Cell and Cell division.pptx
PPTX
Cell and Cell division.pptx
PPTX
Presentation1.pptx
PPTX
mitosis and meiosis 02152018.pptx
PPTX
mitosis and meiosis 02152018.pptx
Lecture Environmental Biotechnology, Bioremediation 24-12-2019.ppt
Parasitic diseases.pptx
Prenylation.pptx
Prenylation.pptx
Data Retreival, Protein sequence database, Gene Pridiction 3.pptx
COPI and COPII Secretery pathways.pptx
citrus SH presentation.ppt
Molecular markers.ppt
An efficient cucumber (Cucumis sativus L.pdf
L 7 Plant protoplasts .ppt
helgesonsomatic.ppt
17465299.ppt
GAMETOGENESIS.pptx
23_clark_missing_epistasis.pptx
Cell and Cell division.pptx
Cell and Cell division.pptx
Presentation1.pptx
mitosis and meiosis 02152018.pptx
mitosis and meiosis 02152018.pptx

Recently uploaded (20)

PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
modul_python (1).pptx for professional and student
PPT
Image processing and pattern recognition 2.ppt
PPT
Predictive modeling basics in data cleaning process
PPTX
New ISO 27001_2022 standard and the changes
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
Business_Capability_Map_Collection__pptx
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPT
DU, AIS, Big Data and Data Analytics.ppt
PPTX
SET 1 Compulsory MNH machine learning intro
PPTX
Leprosy and NLEP programme community medicine
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
CYBER SECURITY the Next Warefare Tactics
PPTX
A Complete Guide to Streamlining Business Processes
PDF
Transcultural that can help you someday.
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
PDF
Navigating the Thai Supplements Landscape.pdf
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
modul_python (1).pptx for professional and student
Image processing and pattern recognition 2.ppt
Predictive modeling basics in data cleaning process
New ISO 27001_2022 standard and the changes
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Business_Capability_Map_Collection__pptx
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
DU, AIS, Big Data and Data Analytics.ppt
SET 1 Compulsory MNH machine learning intro
Leprosy and NLEP programme community medicine
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
SAP 2 completion done . PRESENTATION.pptx
CYBER SECURITY the Next Warefare Tactics
A Complete Guide to Streamlining Business Processes
Transcultural that can help you someday.
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
Navigating the Thai Supplements Landscape.pdf

Microarray data Analysis.pptx

  • 2. • Microarrays can be used in many types of experiments including genotyping, epigenetics, translation profiling and gene expression profiling. • Gene expression profiling is by far the most common use of microarray technology. Both one and two colour microarrays can be used for this type of experiment. The process of analysing gene expression data is similar for both types of microarrays and involves
  • 3. • feature extraction; • quality control; • normalisation; • differential expression analysis; • biological interpretation of the results; • submission of data to a public database.
  • 5. Feature extraction Feature extraction is the process of converting the scanned image of the microarray into quantifiable (computable) values and annotating it with the gene IDs, sample names and other useful information Feature extraction involves the conversion of the scanned microarray image to quantifiable values that are saved in binary (e.g. CEL) or text format.
  • 6. Feature extraction • This process is often performed using the software provided by the microarray manufacturer. The output of this process is raw (i.e. unprocessed) data files that can be in binary or text format • After the feature extraction process, the data can be analysed. Array manufacturers often provide software to open and analyse their raw data files. These programs may not always be available, may become obsolete after a few years, or may not be flexible enough for your needs. There are several free software tools that are suitable for the downstream processing of microarray files. Examples are the Galaxy platform, GenePattern, GeneSpring (licence required) and the statistics software R.
  • 7. Quality control • Quality control of microarray data begins with the visual inspection of the scanned microarray images to make sure that there are no obvious splotches, scratches or blank areas • After feature extraction, the data analysis software packages can be used to make diagnostic plots (for example of background signal, average intensity values and percentage of genes above background) to help identify problematic arrays, reporters or samples
  • 8. Examples of quality control plots made when analysing differential expression data in Expression Atlas. Left to right: Array intensity distributions, PCA plot, density estimates.
  • 9. Normalisation • Normalisation of microarray data is used to control for technical variation between assays, while preserving the biological variation. There are many ways to normalise the data, and the methods used depend on: • the type of array; • the design of the experiment; • assumptions made about the data (e.g. 'the majority of genes represented on the microarray are not expected to be differentially expressed in the test group relative to controls'); • and the package being used to analyse the data.
  • 10. Differential expression analysis • The goal of differential expression analysis is to identify genes whose expression differs under different conditions. An important consideration for differential expression analysis is correction for multiple testing. This is a statistical phenomenon that occurs when thousands of comparisons (e.g. the comparison of expression of multiple genes in multiple conditions) are performed for a small number of samples (most microarray experiments have less than five biological replicates per condition). This leads to an increased chance of false positive results
  • 11. Biological interpretation of gene expression data • Many of the methods for visualising and interpreting microarray data can also be used for RNA-seq experiments.
  • 12. Submission of data to a public repository • Once you have finished generating microarray data (e.g. raw data files and processed/normalised data files are ready), it is important that you submit the data files together with metadata to a public database such as ArrayExpress. This helps to ensure the reproducibility of your experiment and is now a requirement of many journals and funding bodies.