SlideShare a Scribd company logo
Functional and Pathway
                      Analysis
                                          Stewart MacArthur

                                             Bioinformatics Core


                                          March 18th, 2010




Stewart MacArthur (Bioinformatics Core)   Functional and Pathway Analysis   March 18th, 2010   1 / 19
Introduction   The Problem




The Problem
     • High-throughput genomics methods:
         • microarrays
         • next generation sequencing
     • Generate large lists of “interesting” genes




     • How to we summarize?
     • What are the themes of the lists?


Stewart MacArthur (Bioinformatics Core)   Functional and Pathway Analysis   March 18th, 2010   2 / 19
Introduction   The Solution




The Solution

    • Functional Analysis
        • Determine common
          functions
        • Find groups of functionally
          related genes
    • Pathways Analysis
        • Determine common
          pathways
        • Determine potential
          up/down stream regulators




Stewart MacArthur (Bioinformatics Core)   Functional and Pathway Analysis   March 18th, 2010   3 / 19
Enrichment Analysis Methods




The Methods

 Enrichment Analysis
 Are there more of the genes in my list in functional category X than we could
 expect by chance?
     • SEA - Singular Enrichment Analysis
     • MEA - Modular Enrichment Analysis
     • GSEA - Gene Set Enrichment Analysis




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   4 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    Hypergeometric




Brief Aside: Hypergeometric Test
 The hypergeometric test calculates the probability that the number of
 genes in our gene list that are in functional category/pathway X
 occured by chance




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis   March 18th, 2010   5 / 19
Enrichment Analysis Methods    SEA - Singular Enrichment Analysis




SEA - Singular Enrichment Analysis
 Inputs:
     • List of “interesting” genes, e.g. DE genes
     • List of functional annotations e.g. GO annotations




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                   March 18th, 2010   6 / 19
Enrichment Analysis Methods    SEA - Singular Enrichment Analysis




SEA - Singular Enrichment Analysis
 Inputs:
     • List of “interesting” genes, e.g. DE genes
     • List of functional annotations e.g. GO annotations
 Method:
 For each annotation
     • Are more of the genes in our list present than would be expected by
       chance
     • Calculate p-value
 Next annotation

     • Correction for multiple testing




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                   March 18th, 2010   6 / 19
Enrichment Analysis Methods    SEA - Singular Enrichment Analysis




SEA - Singular Enrichment Analysis
 Inputs:
     • List of “interesting” genes, e.g. DE genes
     • List of functional annotations e.g. GO annotations
 Method:
 For each annotation
     • Are more of the genes in our list present than would be expected by
       chance
     • Calculate p-value
 Next annotation

     • Correction for multiple testing

 Output:
     • Ranked list of annotations

Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                   March 18th, 2010   6 / 19
Enrichment Analysis Methods    MEA - Modular Enrichment Analysis




MEA - Modular Enrichment Analysis

     • Extension of SEA
     • Incorporates network discovery algorithms
     • Considers term-to-term relationships
         • Terms not treated as separate tests
         • Uses co-occurrences of terms
     • More closely related to biology
     • Based on assumption that related functional groups have similar
         member genes




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                  March 18th, 2010   7 / 19
Enrichment Analysis Methods    GSEA -Gene Set Enrichment Analysis




GSEA
 No cutoff, uses all genes ranked
 e.g. microarray experiment ranked by fold change or differential expression
 For each functional annotation
      • Are genes randomly distributed in ranked list?
 or
      • Are genes distributed towards the top/bottom of the list?
      • Calculate enrichment score (ES)
      • Calculate significance of ES
 Next annotation
      • Correct for multiple testing



Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                 March 18th, 2010   8 / 19
Enrichment Analysis Methods    GSEA -Gene Set Enrichment Analysis




GSEA




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                 March 18th, 2010   9 / 19
Enrichment Analysis Methods    GSEA -Gene Set Enrichment Analysis




GSEA




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                 March 18th, 2010   9 / 19
Enrichment Analysis Methods    GSEA -Gene Set Enrichment Analysis




GSEA




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                 March 18th, 2010   9 / 19
Enrichment Analysis Methods    GSEA -Gene Set Enrichment Analysis




GSEA




Stewart MacArthur (Bioinformatics Core)       Functional and Pathway Analysis                 March 18th, 2010   9 / 19
Annotation Resources




Annotation Resources

 Where do the gene sets come from?
     • GO - Gene Ontology
     • KEGG - Kyoto Encyclopedia of Genes and Genomes
     • MSigDB - Molecular Signatures Database
     • Pathway Commons
     • ...
     • ...
 Choice of annotation often dictated by choice of tool




Stewart MacArthur (Bioinformatics Core)          Functional and Pathway Analysis   March 18th, 2010   10 / 19
Web based tools




Tools

     • Approximately 68 enrichment tools




Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   11 / 19
Web based tools




Tools
     • Here they are:




Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   11 / 19
Web based tools




Tools
     • Mainly Web based




Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   11 / 19
Web based tools




Tools

     • Mainly Hypergeometric based




Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   11 / 19
Web based tools




Recommended Tools


     • SEA - ClueGO, GOStat,
     • MEA - DAVID, GOToolBox
     • GSEA - GeneTrail, FatiScan (Babelomics)

 See Bioinformatics Core Wiki Page for more tools
 http://criwiki.cancerresearchuk.org/




Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   12 / 19
Web based tools    David




DAVID http://david.abcc.ncifcrf.gov
 The Database for Annotation, Visualization and Integrated Discovery




    • Over 1,600 DAVID citations
    • 37 nature-branded citations to
        date
    • Daily Usage: 1200 gene
        lists/sublists
    • Daily Usage: 400 unique
        researchers.


Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   13 / 19
Web based tools    David




DAVID http://david.abcc.ncifcrf.gov

 The Database for Annotation, Visualization and Integrated Discovery
     • Identify enriched biological themes
     • Discover enriched functional-related gene groups
     • Cluster redundant annotation terms
     • Visualize genes on BioCarta & KEGG pathway maps
     • Search for other functionally related genes not in the list
     • Convert gene identifiers from one type to another.
     • And more




Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   14 / 19
Web based tools    GeneTrail




GeneTrail



 Annotations include
     • KEGG
     • TRANSPATH
     • TRANSFAC
     • GO
 Methods:
     • Over-Representation Analysis (ORA)
     • Gene Set Enrichment Analysis (GSEA)

Stewart MacArthur (Bioinformatics Core)     Functional and Pathway Analysis   March 18th, 2010   15 / 19
Commercial Tools




Ingenuity Pathways Analysis (IPA)




Stewart MacArthur (Bioinformatics Core)      Functional and Pathway Analysis   March 18th, 2010   16 / 19
Commercial Tools




GeneGo MetaCore




Stewart MacArthur (Bioinformatics Core)      Functional and Pathway Analysis   March 18th, 2010   17 / 19
Commercial Tools




Suraj - GeneGO Demo




Stewart MacArthur (Bioinformatics Core)      Functional and Pathway Analysis   March 18th, 2010   18 / 19
Cytoscape




Cytoscape




Stewart MacArthur (Bioinformatics Core)   Functional and Pathway Analysis   March 18th, 2010   19 / 19

More Related Content

PDF
PPTX
Proteomics
PPTX
Applications of microarray
PPT
Protein database
PPTX
Microarray technique
PPT
Systems biology & Approaches of genomics and proteomics
PPTX
protein data bank
PPT
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSES
Proteomics
Applications of microarray
Protein database
Microarray technique
Systems biology & Approaches of genomics and proteomics
protein data bank
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSES

What's hot (20)

PDF
Gene prediction strategies
PPTX
Proteomics ppt
PPTX
Cytoscape Network Visualization and Analysis
PDF
Basics of Genome Assembly
PPTX
Sequence alignment
PPTX
Microarray technology and applications
PPTX
Pathways and genomes databases in bioinformatics
PPTX
PDF
Tech Talk: UCSC Genome Browser
PPTX
Introduction to Bioinformatics
PPTX
Proteomics
PPTX
Functional proteomics, and tools
PPT
Protein Structure, Databases and Structural Alignment
PDF
Sequence Alignment
PDF
Sequence analysis - Bioinformatics
PPT
EST Clustering.ppt
PPT
PPTX
Methods of Protein structure determination
PPTX
Sequence similarity tools.pptx
PPTX
DNA MICRO ARRAY TECHNIQUE
Gene prediction strategies
Proteomics ppt
Cytoscape Network Visualization and Analysis
Basics of Genome Assembly
Sequence alignment
Microarray technology and applications
Pathways and genomes databases in bioinformatics
Tech Talk: UCSC Genome Browser
Introduction to Bioinformatics
Proteomics
Functional proteomics, and tools
Protein Structure, Databases and Structural Alignment
Sequence Alignment
Sequence analysis - Bioinformatics
EST Clustering.ppt
Methods of Protein structure determination
Sequence similarity tools.pptx
DNA MICRO ARRAY TECHNIQUE
Ad

Viewers also liked (20)

PPT
Gene Ontology Enrichment Network Analysis -Tutorial
PDF
Pathway analysis 2012
PPTX
6 metabolite enrichment analysis
PDF
Cytoscape Talk 2010
PDF
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
PDF
Introduction
PDF
Gary bader fged_toronto_2012
PDF
NetBioSIG2012 anyatsalenko-en-viz
ODP
Genes associated with genotype-specific DNA methylation in squamous cell carc...
PDF
Integrative Analysis of Gene Expression and Promoter Methylation during Repro...
PPT
In search of tissue specific regulators in periodontium - a bioinformatic ap...
PDF
Quatitative assessment of chromatin sturcture
PPT
Plegable biología molecular
PDF
Dna methylation field guide 20130806
PDF
TINAGL1 and B3GALNT1 are potential therapy target genes to suppress metastasi...
PDF
Identification of aberrant gene expression associated with aberrant promoter ...
PDF
Семинар ДНК 16/05/2014 Сибэнзим
PDF
<마더리스크> biomarkers of methylation
Gene Ontology Enrichment Network Analysis -Tutorial
Pathway analysis 2012
6 metabolite enrichment analysis
Cytoscape Talk 2010
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Introduction
Gary bader fged_toronto_2012
NetBioSIG2012 anyatsalenko-en-viz
Genes associated with genotype-specific DNA methylation in squamous cell carc...
Integrative Analysis of Gene Expression and Promoter Methylation during Repro...
In search of tissue specific regulators in periodontium - a bioinformatic ap...
Quatitative assessment of chromatin sturcture
Plegable biología molecular
Dna methylation field guide 20130806
TINAGL1 and B3GALNT1 are potential therapy target genes to suppress metastasi...
Identification of aberrant gene expression associated with aberrant promoter ...
Семинар ДНК 16/05/2014 Сибэнзим
<마더리스크> biomarkers of methylation
Ad

Similar to Functional And Pathway Analysis 2010 (20)

PDF
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
PDF
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
PDF
Integration of knowledge for personalized medicine: a pharmacogenomics case-s...
PPTX
Omic Data Integration Strategies
PDF
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
PDF
RNA-seq for DE analysis: the biology behind observed changes - part 6
PPT
Gene Ontology Network Enrichment Analysis
DOCX
BioIn_Pap1dramaprevenbtionsakcnnshejjsja
PDF
Computational Analysis in an extended model of E. Coli
PPTX
Pathway analysis for genomics data
PPTX
Cool Informatics Tools and Services for Biomedical Research
PDF
N-of-1-pathways MixEnrich
PDF
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
PDF
Algal Functional Annotation Tool
PPT
Kishor Presentation
DOCX
Statistical SignificancePieceFinal
PPTX
Eisai-2
PDF
Tutorial 1.3 - Run Enrichment Analysis
PPTX
Bioinformatics and functional genomics
Introduction to Functional Analysis with IPA (UEB-UAT Bioinformatics Course -...
A Review of Various Methods Used in the Analysis of Functional Gene Expressio...
Integration of knowledge for personalized medicine: a pharmacogenomics case-s...
Omic Data Integration Strategies
COMPUTATIONAL METHODS FOR FUNCTIONAL ANALYSIS OF GENE EXPRESSION
RNA-seq for DE analysis: the biology behind observed changes - part 6
Gene Ontology Network Enrichment Analysis
BioIn_Pap1dramaprevenbtionsakcnnshejjsja
Computational Analysis in an extended model of E. Coli
Pathway analysis for genomics data
Cool Informatics Tools and Services for Biomedical Research
N-of-1-pathways MixEnrich
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Algal Functional Annotation Tool
Kishor Presentation
Statistical SignificancePieceFinal
Eisai-2
Tutorial 1.3 - Run Enrichment Analysis
Bioinformatics and functional genomics

Recently uploaded (20)

PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
Empowerment Technology for Senior High School Guide
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Complications of Minimal Access Surgery at WLH
PDF
Indian roads congress 037 - 2012 Flexible pavement
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
Classroom Observation Tools for Teachers
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
advance database management system book.pdf
PDF
1_English_Language_Set_2.pdf probationary
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PPTX
History, Philosophy and sociology of education (1).pptx
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
Orientation - ARALprogram of Deped to the Parents.pptx
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Empowerment Technology for Senior High School Guide
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Complications of Minimal Access Surgery at WLH
Indian roads congress 037 - 2012 Flexible pavement
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
Paper A Mock Exam 9_ Attempt review.pdf.
Classroom Observation Tools for Teachers
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
advance database management system book.pdf
1_English_Language_Set_2.pdf probationary
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
History, Philosophy and sociology of education (1).pptx
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين

Functional And Pathway Analysis 2010

  • 1. Functional and Pathway Analysis Stewart MacArthur Bioinformatics Core March 18th, 2010 Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 1 / 19
  • 2. Introduction The Problem The Problem • High-throughput genomics methods: • microarrays • next generation sequencing • Generate large lists of “interesting” genes • How to we summarize? • What are the themes of the lists? Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 2 / 19
  • 3. Introduction The Solution The Solution • Functional Analysis • Determine common functions • Find groups of functionally related genes • Pathways Analysis • Determine common pathways • Determine potential up/down stream regulators Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 3 / 19
  • 4. Enrichment Analysis Methods The Methods Enrichment Analysis Are there more of the genes in my list in functional category X than we could expect by chance? • SEA - Singular Enrichment Analysis • MEA - Modular Enrichment Analysis • GSEA - Gene Set Enrichment Analysis Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 4 / 19
  • 5. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 6. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 7. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 8. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 9. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 10. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 11. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 12. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 13. Enrichment Analysis Methods Hypergeometric Brief Aside: Hypergeometric Test The hypergeometric test calculates the probability that the number of genes in our gene list that are in functional category/pathway X occured by chance Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 5 / 19
  • 14. Enrichment Analysis Methods SEA - Singular Enrichment Analysis SEA - Singular Enrichment Analysis Inputs: • List of “interesting” genes, e.g. DE genes • List of functional annotations e.g. GO annotations Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 6 / 19
  • 15. Enrichment Analysis Methods SEA - Singular Enrichment Analysis SEA - Singular Enrichment Analysis Inputs: • List of “interesting” genes, e.g. DE genes • List of functional annotations e.g. GO annotations Method: For each annotation • Are more of the genes in our list present than would be expected by chance • Calculate p-value Next annotation • Correction for multiple testing Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 6 / 19
  • 16. Enrichment Analysis Methods SEA - Singular Enrichment Analysis SEA - Singular Enrichment Analysis Inputs: • List of “interesting” genes, e.g. DE genes • List of functional annotations e.g. GO annotations Method: For each annotation • Are more of the genes in our list present than would be expected by chance • Calculate p-value Next annotation • Correction for multiple testing Output: • Ranked list of annotations Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 6 / 19
  • 17. Enrichment Analysis Methods MEA - Modular Enrichment Analysis MEA - Modular Enrichment Analysis • Extension of SEA • Incorporates network discovery algorithms • Considers term-to-term relationships • Terms not treated as separate tests • Uses co-occurrences of terms • More closely related to biology • Based on assumption that related functional groups have similar member genes Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 7 / 19
  • 18. Enrichment Analysis Methods GSEA -Gene Set Enrichment Analysis GSEA No cutoff, uses all genes ranked e.g. microarray experiment ranked by fold change or differential expression For each functional annotation • Are genes randomly distributed in ranked list? or • Are genes distributed towards the top/bottom of the list? • Calculate enrichment score (ES) • Calculate significance of ES Next annotation • Correct for multiple testing Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 8 / 19
  • 19. Enrichment Analysis Methods GSEA -Gene Set Enrichment Analysis GSEA Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 9 / 19
  • 20. Enrichment Analysis Methods GSEA -Gene Set Enrichment Analysis GSEA Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 9 / 19
  • 21. Enrichment Analysis Methods GSEA -Gene Set Enrichment Analysis GSEA Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 9 / 19
  • 22. Enrichment Analysis Methods GSEA -Gene Set Enrichment Analysis GSEA Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 9 / 19
  • 23. Annotation Resources Annotation Resources Where do the gene sets come from? • GO - Gene Ontology • KEGG - Kyoto Encyclopedia of Genes and Genomes • MSigDB - Molecular Signatures Database • Pathway Commons • ... • ... Choice of annotation often dictated by choice of tool Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 10 / 19
  • 24. Web based tools Tools • Approximately 68 enrichment tools Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 11 / 19
  • 25. Web based tools Tools • Here they are: Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 11 / 19
  • 26. Web based tools Tools • Mainly Web based Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 11 / 19
  • 27. Web based tools Tools • Mainly Hypergeometric based Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 11 / 19
  • 28. Web based tools Recommended Tools • SEA - ClueGO, GOStat, • MEA - DAVID, GOToolBox • GSEA - GeneTrail, FatiScan (Babelomics) See Bioinformatics Core Wiki Page for more tools http://criwiki.cancerresearchuk.org/ Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 12 / 19
  • 29. Web based tools David DAVID http://david.abcc.ncifcrf.gov The Database for Annotation, Visualization and Integrated Discovery • Over 1,600 DAVID citations • 37 nature-branded citations to date • Daily Usage: 1200 gene lists/sublists • Daily Usage: 400 unique researchers. Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 13 / 19
  • 30. Web based tools David DAVID http://david.abcc.ncifcrf.gov The Database for Annotation, Visualization and Integrated Discovery • Identify enriched biological themes • Discover enriched functional-related gene groups • Cluster redundant annotation terms • Visualize genes on BioCarta & KEGG pathway maps • Search for other functionally related genes not in the list • Convert gene identifiers from one type to another. • And more Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 14 / 19
  • 31. Web based tools GeneTrail GeneTrail Annotations include • KEGG • TRANSPATH • TRANSFAC • GO Methods: • Over-Representation Analysis (ORA) • Gene Set Enrichment Analysis (GSEA) Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 15 / 19
  • 32. Commercial Tools Ingenuity Pathways Analysis (IPA) Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 16 / 19
  • 33. Commercial Tools GeneGo MetaCore Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 17 / 19
  • 34. Commercial Tools Suraj - GeneGO Demo Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 18 / 19
  • 35. Cytoscape Cytoscape Stewart MacArthur (Bioinformatics Core) Functional and Pathway Analysis March 18th, 2010 19 / 19