SlideShare a Scribd company logo
Basic bioinformatics concepts, databases and tools Module 5 Genome browsers and  interpretation of  gene lists Dr. Joachim Jacob http://www.bits.vib.be Updated 21 July 2011 http://dl.dropbox.com/u/18352887/BITS_training_material/Link%20to%20mod5-intro_H1_2011_genomebrowsers.pdf
Integrating biological information Genome databases and browsers Integration on a species basis all biological information: Ensembl Genome Browser http://www.ensembl.org/ Table Browsers Retrieving biological (not only sequence) data applying various criteria: Biomart http://www.biomart.org/ Interpreting gene lists 'What is the biology behind my gene list': DAVID http://david.abcc.ncifcrf.gov/
Reference genome sequences provide a standard genome sequence per species  Genomes  From various sequence sources, a genome is  assembled By NCBI: currently assembly 37 in human (or 'build') (2010)  By Celera: commercial Each build differs! 1. Data freeze: all data for assembling (ignoring new data from that point) 2. Assembly process and annotation 3. Release of the Build: Reference Sequence Genom e http://www.ncbi.nlm.nih.gov/Genomes/
 
Finding your way in genomes Annotation and terms See also  NCBI handbook Locus = place on the genome, ~ a gene (different alleles) Location: Rough location by staining of chromosomes e.g. 18q12.1 -> chromosome 18, long arm (=q, small arm is p) Exact bases on genomes (assembly must be mentioned!)
Genome Browsers: main players Three main players  MapViewer (NCBI) UCSC Genome Browser Ensembl Genome browser BITS UCSC Genome Browser training BITS Ensembl Genome Browser training
Ensembl Genome browser We will use this browser in this session Information is combination of   automatic  annotation and  manually curated  s ources (ENS >< Havana (Vega) genes) All entries can be accessed through the browser, each with its own clear identifiers
28 November 2009 [email_address] /10 http://www.ensembl.org Information about the genomes
http://www.ensemblgenomes.org
[email_address] /10 ! …  or click on the figure feature!
28 November 2009 [email_address] /10
28 November 2009 [email_address] /10 [email_address]
TAB SUMMARY DETAILED INFORMATION INFOR-MATION SELEC-TOR DATA MANAGER tab DAS
Ensembl Genome browser Usefulness: One place for all information on a particular gene / structure / location / variation But also:  Comparison to other species The Ensembl Team has a lot of training movies and examples available. Check them out! http://www.ensembl.org/info/index.html http://www.ensembl.org/Help/Movie?id=188
Ensembl Genome browser Usefulness: One place for all information on a particular gene / structure / location / variation But also:  Comparison to other species The Ensembl Team has a lot of training movies and examples available. Check them out! http://www.ensembl.org/info/index.html http://www.ensembl.org/Help/Movie?id=188
Tracks are a way to display information on a genome sequence The annotation on a genome-wide scale is displayed in tracks.  Relevant database content can be formatted in tracks and displayed on a reference genome Genome reference tracks Screenshot of Ensembl genome browser
Tracks are a way to display information on a genome sequence The annotation on a genome-wide scale is displayed in tracks, most used formats: - each base receives a value: dense continuous data:  WIG format  (e.g. %GC) - annotation has a start and a stop coordinate:  bed format  (e.g. gene annotations) Example Variations in genomes are reported in vcf format http://www.ensembl.org/info/website/upload/bed.html http://www.bits.vib.be/wiki/index.php/.vcf #CHROM POS  ID  REF  ALT  QUAL FILTER INFO  FORMAT  20  14370  rs6054257 G  A  29  PASS  NS=3;DP=14;AF=0.5;DB;H2  GT:GQ:DP:HQ 20  17330  .  T  A  3  q10  NS=3;DP=11;AF=0.017  GT:GQ:DP:HQ
Biomart, your one stop portal to fetch information Biomart  http://www.biomart.org/   These questions are easy: Hey, can you tell me how many genes in mouse  exist which regulate transcription and are located on  Chromosome 19 ?
Biomart, your one stop portal to fetch information Biomart  http://www.biomart.org/   These questions are easy: Hey, can you tell me  how many   genes  in  mouse   exist which  regulate transcription  and are located on  Chromosome 19  ? Ensembl  Genes Genome sequence (Ensembl) Gene Ontology GO:0009299
Biomart, your one stop portal to fetch information Biomart  http://www.biomart.org/   Translated questions reflect in database choice and  Filters Resulting genes are counted and the output set via  Attributes
Biomart is available for an increasing number of databases Biomart http://www.biomart.org/
Gene lists resulting from different analyses can reveal their biology  DAVID -  http://david.abcc.ncifcrf.gov/
Gene lists resulting from different analyses can reveal their biology  DAVID -  http://david.abcc.ncifcrf.gov/   DEMO Alternatives g:Profiler http://biit.cs.ut.ee/gprofiler/ Babelomics http://www.babelomics.org/
Galaxy allows you to store your data and to (re)analyse it conveniently Galaxy -  http://usegalaxy.org
Galaxy allows you to store your data and to (re)analyse it conveniently Galaxy -  http://usegalaxy.org   DEMO TOOLS RESULTS DATA SETS

More Related Content

DOCX
Bioinformatics on internet
PPTX
Secondary protein structure prediction
PPTX
Protein database
PPTX
Sequence alignment
PPTX
Multiple sequence alignment
PPTX
clustal omega.pptx
Bioinformatics on internet
Secondary protein structure prediction
Protein database
Sequence alignment
Multiple sequence alignment
clustal omega.pptx

What's hot (20)

PPTX
EMBL-EBI
PPTX
Biological databases
PDF
The ensembl database
PPT
Stem cell culture
PDF
Bioinformatics data mining
PPTX
PPTX
Nucleic acid and protein databanks
PPTX
Biological databases
PPTX
Uni prot presentation
PPTX
Primary Bioinformatics Database.pptx
PPTX
Sequence homology search and multiple sequence alignment(1)
PPT
The uni prot knowledgebase
PPTX
Cryopreservation ( methods and application)
PPT
Clustal
PPTX
Swiss prot database
PPTX
Protein structure analysis
DOCX
Notes for Cell Culture Basic Techniques
PDF
Bioinformatics biological databases
PPT
ENTREZ.ppt
EMBL-EBI
Biological databases
The ensembl database
Stem cell culture
Bioinformatics data mining
Nucleic acid and protein databanks
Biological databases
Uni prot presentation
Primary Bioinformatics Database.pptx
Sequence homology search and multiple sequence alignment(1)
The uni prot knowledgebase
Cryopreservation ( methods and application)
Clustal
Swiss prot database
Protein structure analysis
Notes for Cell Culture Basic Techniques
Bioinformatics biological databases
ENTREZ.ppt
Ad

Viewers also liked (20)

PDF
BITS: Basics of sequence databases
PDF
BITS: Basics of Sequence similarity
PDF
BITS: Basics of sequence analysis
PDF
BITS: Overview of important biological databases beyond sequences
PPT
Bioinformatics
PPT
L01 ecture 01-
PDF
GenomeBrowser
PDF
Bioinformatics in dermato-oncology
PPT
B.sc biochem i bobi u 3.2 algorithm + blast
PPTX
How to evaluate the usefulness of digital libraries
PPT
ARC VIEW GEOGRAPHICAL INFORMATION SYSTEM (GIS)
PDF
An introduction to geographic information systems (gis) m goulbourne 2007
PPTX
B.sc biochem i bobi u 4 gene prediction
PDF
September 1 Day Workshop
PPT
DRUG DESIGN BASED ON BIOINFORMATICS TOOLS
PPT
Dotplots for Bioinformatics
PPT
Bioinformatics and Drug Discovery
PPT
Multiple sequence alignment
PPTX
Geographical information system : GIS and Social Media
PPT
Computer aided drug designing
BITS: Basics of sequence databases
BITS: Basics of Sequence similarity
BITS: Basics of sequence analysis
BITS: Overview of important biological databases beyond sequences
Bioinformatics
L01 ecture 01-
GenomeBrowser
Bioinformatics in dermato-oncology
B.sc biochem i bobi u 3.2 algorithm + blast
How to evaluate the usefulness of digital libraries
ARC VIEW GEOGRAPHICAL INFORMATION SYSTEM (GIS)
An introduction to geographic information systems (gis) m goulbourne 2007
B.sc biochem i bobi u 4 gene prediction
September 1 Day Workshop
DRUG DESIGN BASED ON BIOINFORMATICS TOOLS
Dotplots for Bioinformatics
Bioinformatics and Drug Discovery
Multiple sequence alignment
Geographical information system : GIS and Social Media
Computer aided drug designing
Ad

Similar to BITs: Genome browsers and interpretation of gene lists. (20)

PPTX
Understanding Genome
PDF
BITS: UCSC genome browser - Part 1
PDF
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
PPT
Ensembl genome
PPTX
Role of ensembl in genome browsing
PDF
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
 
PPT
RML NCBI Resources
PPTX
Web based servers and softwares for genome analysis
PPTX
Genomic databases
PPTX
Ncbi basic intro_v_pitt_kent_osu
PDF
Publicly available tools and open resources in Bioinformatics
PDF
Genes and Transcripts: Ensembl Online Webinar series
PDF
Ensembl Browser Workshop
PDF
Browsing Genes, Variation and Regulation data with Ensembl
PPTX
Nucleic acid database
PPTX
Genomic Databases-.pptx
PDF
Vb tutorial-genome browser2010
PPT
PPT
Bioinformatics - Discovering the Bio Logic Of Nature
PPTX
BITS training - UCSC Genome Browser - Part 2
Understanding Genome
BITS: UCSC genome browser - Part 1
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Ensembl genome
Role of ensembl in genome browsing
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
 
RML NCBI Resources
Web based servers and softwares for genome analysis
Genomic databases
Ncbi basic intro_v_pitt_kent_osu
Publicly available tools and open resources in Bioinformatics
Genes and Transcripts: Ensembl Online Webinar series
Ensembl Browser Workshop
Browsing Genes, Variation and Regulation data with Ensembl
Nucleic acid database
Genomic Databases-.pptx
Vb tutorial-genome browser2010
Bioinformatics - Discovering the Bio Logic Of Nature
BITS training - UCSC Genome Browser - Part 2

More from BITS (20)

PDF
RNA-seq for DE analysis: detecting differential expression - part 5
PDF
RNA-seq for DE analysis: extracting counts and QC - part 4
PDF
RNA-seq for DE analysis: the biology behind observed changes - part 6
PDF
RNA-seq: analysis of raw data and preprocessing - part 2
PDF
RNA-seq: general concept, goal and experimental design - part 1
PDF
RNA-seq: Mapping and quality control - part 3
PDF
Productivity tips - Introduction to linux for bioinformatics
PDF
Text mining on the command line - Introduction to linux for bioinformatics
PDF
The structure of Linux - Introduction to Linux for bioinformatics
PDF
Managing your data - Introduction to Linux for bioinformatics
PDF
Introduction to Linux for bioinformatics
PDF
BITS - Genevestigator to easily access transcriptomics data
PDF
BITS - Comparative genomics: the Contra tool
PDF
BITS - Comparative genomics on the genome level
PDF
BITS - Comparative genomics: gene family analysis
PDF
BITS - Introduction to comparative genomics
PDF
BITS - Protein inference from mass spectrometry data
PDF
BITS - Overview of sequence databases for mass spectrometry data analysis
PDF
BITS - Search engines for mass spec data
PDF
BITS - Introduction to proteomics
RNA-seq for DE analysis: detecting differential expression - part 5
RNA-seq for DE analysis: extracting counts and QC - part 4
RNA-seq for DE analysis: the biology behind observed changes - part 6
RNA-seq: analysis of raw data and preprocessing - part 2
RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: Mapping and quality control - part 3
Productivity tips - Introduction to linux for bioinformatics
Text mining on the command line - Introduction to linux for bioinformatics
The structure of Linux - Introduction to Linux for bioinformatics
Managing your data - Introduction to Linux for bioinformatics
Introduction to Linux for bioinformatics
BITS - Genevestigator to easily access transcriptomics data
BITS - Comparative genomics: the Contra tool
BITS - Comparative genomics on the genome level
BITS - Comparative genomics: gene family analysis
BITS - Introduction to comparative genomics
BITS - Protein inference from mass spectrometry data
BITS - Overview of sequence databases for mass spectrometry data analysis
BITS - Search engines for mass spec data
BITS - Introduction to proteomics

Recently uploaded (20)

PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Cell Structure & Organelles in detailed.
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Pharma ospi slides which help in ospi learning
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Insiders guide to clinical Medicine.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
master seminar digital applications in india
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
VCE English Exam - Section C Student Revision Booklet
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
human mycosis Human fungal infections are called human mycosis..pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
Cell Structure & Organelles in detailed.
Module 4: Burden of Disease Tutorial Slides S2 2025
Pharma ospi slides which help in ospi learning
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
O5-L3 Freight Transport Ops (International) V1.pdf
Renaissance Architecture: A Journey from Faith to Humanism
O7-L3 Supply Chain Operations - ICLT Program
Insiders guide to clinical Medicine.pdf
RMMM.pdf make it easy to upload and study
master seminar digital applications in india
102 student loan defaulters named and shamed – Is someone you know on the list?
VCE English Exam - Section C Student Revision Booklet

BITs: Genome browsers and interpretation of gene lists.

  • 1. Basic bioinformatics concepts, databases and tools Module 5 Genome browsers and interpretation of gene lists Dr. Joachim Jacob http://www.bits.vib.be Updated 21 July 2011 http://dl.dropbox.com/u/18352887/BITS_training_material/Link%20to%20mod5-intro_H1_2011_genomebrowsers.pdf
  • 2. Integrating biological information Genome databases and browsers Integration on a species basis all biological information: Ensembl Genome Browser http://www.ensembl.org/ Table Browsers Retrieving biological (not only sequence) data applying various criteria: Biomart http://www.biomart.org/ Interpreting gene lists 'What is the biology behind my gene list': DAVID http://david.abcc.ncifcrf.gov/
  • 3. Reference genome sequences provide a standard genome sequence per species Genomes From various sequence sources, a genome is assembled By NCBI: currently assembly 37 in human (or 'build') (2010) By Celera: commercial Each build differs! 1. Data freeze: all data for assembling (ignoring new data from that point) 2. Assembly process and annotation 3. Release of the Build: Reference Sequence Genom e http://www.ncbi.nlm.nih.gov/Genomes/
  • 4.  
  • 5. Finding your way in genomes Annotation and terms See also NCBI handbook Locus = place on the genome, ~ a gene (different alleles) Location: Rough location by staining of chromosomes e.g. 18q12.1 -> chromosome 18, long arm (=q, small arm is p) Exact bases on genomes (assembly must be mentioned!)
  • 6. Genome Browsers: main players Three main players MapViewer (NCBI) UCSC Genome Browser Ensembl Genome browser BITS UCSC Genome Browser training BITS Ensembl Genome Browser training
  • 7. Ensembl Genome browser We will use this browser in this session Information is combination of automatic annotation and manually curated s ources (ENS >< Havana (Vega) genes) All entries can be accessed through the browser, each with its own clear identifiers
  • 8. 28 November 2009 [email_address] /10 http://www.ensembl.org Information about the genomes
  • 10. [email_address] /10 ! … or click on the figure feature!
  • 11. 28 November 2009 [email_address] /10
  • 12. 28 November 2009 [email_address] /10 [email_address]
  • 13. TAB SUMMARY DETAILED INFORMATION INFOR-MATION SELEC-TOR DATA MANAGER tab DAS
  • 14. Ensembl Genome browser Usefulness: One place for all information on a particular gene / structure / location / variation But also: Comparison to other species The Ensembl Team has a lot of training movies and examples available. Check them out! http://www.ensembl.org/info/index.html http://www.ensembl.org/Help/Movie?id=188
  • 15. Ensembl Genome browser Usefulness: One place for all information on a particular gene / structure / location / variation But also: Comparison to other species The Ensembl Team has a lot of training movies and examples available. Check them out! http://www.ensembl.org/info/index.html http://www.ensembl.org/Help/Movie?id=188
  • 16. Tracks are a way to display information on a genome sequence The annotation on a genome-wide scale is displayed in tracks. Relevant database content can be formatted in tracks and displayed on a reference genome Genome reference tracks Screenshot of Ensembl genome browser
  • 17. Tracks are a way to display information on a genome sequence The annotation on a genome-wide scale is displayed in tracks, most used formats: - each base receives a value: dense continuous data: WIG format (e.g. %GC) - annotation has a start and a stop coordinate: bed format (e.g. gene annotations) Example Variations in genomes are reported in vcf format http://www.ensembl.org/info/website/upload/bed.html http://www.bits.vib.be/wiki/index.php/.vcf #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT 20 14370 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 20 17330 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ
  • 18. Biomart, your one stop portal to fetch information Biomart http://www.biomart.org/ These questions are easy: Hey, can you tell me how many genes in mouse exist which regulate transcription and are located on Chromosome 19 ?
  • 19. Biomart, your one stop portal to fetch information Biomart http://www.biomart.org/ These questions are easy: Hey, can you tell me how many genes in mouse exist which regulate transcription and are located on Chromosome 19 ? Ensembl Genes Genome sequence (Ensembl) Gene Ontology GO:0009299
  • 20. Biomart, your one stop portal to fetch information Biomart http://www.biomart.org/ Translated questions reflect in database choice and Filters Resulting genes are counted and the output set via Attributes
  • 21. Biomart is available for an increasing number of databases Biomart http://www.biomart.org/
  • 22. Gene lists resulting from different analyses can reveal their biology DAVID - http://david.abcc.ncifcrf.gov/
  • 23. Gene lists resulting from different analyses can reveal their biology DAVID - http://david.abcc.ncifcrf.gov/ DEMO Alternatives g:Profiler http://biit.cs.ut.ee/gprofiler/ Babelomics http://www.babelomics.org/
  • 24. Galaxy allows you to store your data and to (re)analyse it conveniently Galaxy - http://usegalaxy.org
  • 25. Galaxy allows you to store your data and to (re)analyse it conveniently Galaxy - http://usegalaxy.org DEMO TOOLS RESULTS DATA SETS