SlideShare a Scribd company logo
PHYLOGENETICS
An introduction to the concepts and analysis
using MEGA 6.0
Today’s Objectives
• To introduce the basis concepts involved in phylogenetic
analysis.
• To learn the usage of the phylogenetic package MEGA
6.0
• To discuss the manner in which you can apply
phylogenetic analysis in your research approach, thesis
and publications.
Why use Phylogenetics ?
• The human mind is naturally inclined to classify
information.
• Classification facilitates logical understanding as well as
the detection of heuristic patterns within data sets.
• Logical understanding of a process facilitates the process
of discovery.
Where will it be of use to
me?
• Classifying my sequence data within a global
perspective.
• Finding unique regions within my sequence data by
comparison with a global data set.
• Identification of genes which have not yet been widely
characterized.
• Infinitely many possibilities
Traditional Classification
schemes
• Based on Phenotypic traits (Phenetic) and taxonomic
classifiers (TU)
• Low level of resolution
• Not applicable to molecular data
• Difficult to resolve taxonomic ambiguities at higher
levels.
From TUs to Genomic
databases
• DNA technology prompted a quantum shift in the
resolving power of phylogenetics.
• TU: < 100 classifiers
• Amino Acids: Millions of combinations of AAs
• Genomic level: Billions of bp of nucleotide data
Does more information solve the problem?
0
100000
200000
300000
400000
500000
600000
700000
800000
900000
1000000
RESOLUTION
Taxonomic unit
Protein
Nucleic acid
Species trees
• A species tree establishes the hierarchy of a species
within a globally accepted framework of classification.
• ITS:16s
• ITS: rDNA
• ITS: chloroplast and mitochondria
• Genes: rbcL, ADH, cytC, Ig(SC)
Crab rRNA sequence data used to construct UPGMA tree, Note the out-group
species that has been added to establish a perspective scale.
Gene trees
• Gene trees facilitate the understanding of evolutionary
processes occurring within genes across taxa or within a
species.
• The rates of evolution offer insights into the manner in
which genes evolve as a family.
• Gene trees can be transformed into species trees if they
conform to evolutionary criteria.
Species v/s Gene trees
• Which one do we select?
The choice is determined by what we intend to characterize:
Is it the organism within a genus / species? OR
Is it a gene which is distributed across taxa?
Molecular taxonomy
based on genes
• Prokaryotes: 16s rDNA
• Higher organisms: ITS rDNA, Cp, Mt
• Do you want an evolutionary tree?
• Does your “molecular tree” corroborate your “taxonomic
tree”?
D. affinidisjuncta
D. heteroneura
D. mimica
D. adiastola
D. nigra
S. albovittata
D. crassifemur
S. lebanonensis
D. mulleri
D. melanogaster
D. pseudoobscura
0.000.050.100.150.200.25
Gene tree constructed using the Alcohol Dehydrogenase (ADH) gene from
Drosophila spp. (UPGMA)
The molecular clock
• A digital clock displays time as the cumulative function
of the frequency of a silicon crystal.
• A molecular clock graphically depicts evolution as the
function of changing nucleotide / amino acid
frequency versus time.
A highly simplified and idealized
molecular clock ! The red bar is a
gene, the colored bars represent
nucleotide positions which change as
a function of time.
Phylogenetic trees
•Distance based methods: inclusive
•Maximum parsimony methods: assumptive
NJT
• Constructed Purely on the basis of pairwise genetic
distance.
• No prior assumptions are made pertaining to tree
topology and branch lengths
Japanese
Korean
Southern Chinese
Australian
Papuan
North Amerind
South Amerind
Finn
Italian
German
English
San
Bantu
Pygmy
Nigerian
0.01
Neighbor Joining Tree (NJT) based on human genetic distance matrix:
compares Pairwise Genetic Distances only
UPGMA
• Originally developed for Phenogram construction (Sokal &
Michener, 1958)
• Adapted for Dendrogram construction
• Can be used when there is a correlation between the distance
measure used and the evolutionary timescale.
Japanese
Korean
Southern Chinese
North Amerind
South Amerind
Italian
Finn
German
English
Australian
Papuan
San
Pygmy
Nigerian
Bantu
0.000.010.020.030.040.05
UPGMA tree based on human genetic distance matrix:
Assumes a constant rate molecular clock
VALIDATION:
Bootstrapping
• The concept of parsimony.
• This is a re-sampling method by replacement with the
same data matrix.
• It allows calculation of standard deviations and variances.
Zea
Oryza
Nicotiana
Pinus
Marchantia
Odontella
Porphyra
Synechocys
Cyanophora
Euglena
100
91
100
100
100
100
100
0.05
Bootstrap consensus tree constructed using the NJT algorithm.
Based on chloroplast DNA protein coding regions.
Zea
Oryza
Nicotiana
Marchantia
Pinus
Odontella
Synechocys
Porphyra
Cyanophora
Euglena
100
100
100
100
100
100
100
0.000.050.100.150.20
Bootstrap consensus tree constructed using the UPGMA algorithm
Based on Chloroplast DNA protein coding regions
Why use MEGA 6.0 ?
• Single platform, combines the functions of BIOEDIT,CLUSTALW,
PAUP and TREEDIST
• Imports FASTA files directly from GenBank: No editing!
• Publication quality output / statistical corroboration.
• Executes on your laptop / desktop.
• User friendly GUI
• Versatile / Flexible
• Highest number of citations
• Open source / Freeware
• No codes to memorize
What can MEGA 6.0 do
for you?
• Download data from a Database / File / Sequencer
• Align data using CLUSTAL W
• Perform phylogenetic analysis using various Algorithms
• Graphically depict phylogenetic trees
• Perform evolutionary tests: Tajima’s Molecular Clock,
Tajima’s neutrality, Z-test, Fishers-exact test, Nei-
Gojobori distance
Getting started with
MEGA
• Input file
• Processing commands
• Output file
PHYLOGENETICS WITH MEGA
THE INPUT FILE
• FASTA format
• ABI format
• Distance matrix files
THE ALIGNMENT
COMMAND
• This step requires discretion. After sequences have been
aligned using CLUSTALW, 5’ and 3’ ends must be
trimmed to develop a blunt composite set.
• Save your output as XXXXX.MAS file
• Before exiting save as XXXXX.MEG file
PHYLOGENETICS WITH MEGA
The ends of the composite sequence should be trimmed after
CLUSTALW alignment as they can contribute significantly to error
in determining true evolutionary divergence / sequence similarity
DEFINING YOUR OUTPUT
• Distance Matrix File
• Phylogenies: NJT / UPGMA / MP / ME
• Parsimony trees
• Evolutionary parameters
• Molecular clocks
PHYLOGENETICS WITH MEGA
Some concepts to think
about:
• Gene clusters
• Genes across geographical boundaries
• Why does genetic evolution transcend species
boundaries?
• Why do some genes evolve faster that others?
• Why do some genes evolve concurrently?
Some concepts to think
about:
• RNA families: clustering of ESTs
• Comparative genomics within a supra genome
• Evolutionary linkages within human genes
CITATION
MEGA should be cited as:
Tamura K, Dudley J, Nei M & Kumar S (2007) MEGA4: Molecular
Evolutionary Genetics Analysis (MEGA) software version 4.0.
Molecular Biology and Evolution 24:1596-1599. (Publication PDF
at http://www.kumarlab.net/publications)
BIOINFORMATICS
SESSION
Follow the instructions on the screen and obtain your tree.
If you have WIFI access to NCBI, you can develop your
own unique alignments
THANK YOU
“In the greater scheme of things, all systems tend to unity… all of
human understanding and logic is based on this underlying
principle.. and the genome is no exception… “

More Related Content

PDF
MEGA (Molecular Evolutionary Genetics Analysis)
PPT
Phylogenetic trees
PPTX
PPTX
Phylogenetic Tree evolution
PPTX
Algorithm research project neighbor joining
PPTX
Phylogenetic tree construction step by step
PPTX
BLAST
PPTX
Phylogenetic data analysis
MEGA (Molecular Evolutionary Genetics Analysis)
Phylogenetic trees
Phylogenetic Tree evolution
Algorithm research project neighbor joining
Phylogenetic tree construction step by step
BLAST
Phylogenetic data analysis

What's hot (20)

PPT
Phylogenetic analysis
PPTX
gene prediction programs
PPTX
Sequence homology search and multiple sequence alignment(1)
PPTX
Expressed sequence tag (EST), molecular marker
PPT
Softwares For Phylogentic Analysis
PDF
Structural databases
PPTX
Biological databases
PPTX
Genome annotation
PPT
Est database
PPTX
Sequence alig Sequence Alignment Pairwise alignment:-
PDF
NCBI National Center for Biotechnology Information
PDF
Phylogenetic analysis
PPTX
Types of genomics ppt
PPTX
History and scope in bioinformatics
PPTX
Single strand conformation polymorphism
PPTX
Phylogenetic tree construction
PPTX
Entrez databases
PPTX
Gen bank databases
PPTX
Multiple sequence alignment
PPTX
DNA Sequencing
Phylogenetic analysis
gene prediction programs
Sequence homology search and multiple sequence alignment(1)
Expressed sequence tag (EST), molecular marker
Softwares For Phylogentic Analysis
Structural databases
Biological databases
Genome annotation
Est database
Sequence alig Sequence Alignment Pairwise alignment:-
NCBI National Center for Biotechnology Information
Phylogenetic analysis
Types of genomics ppt
History and scope in bioinformatics
Single strand conformation polymorphism
Phylogenetic tree construction
Entrez databases
Gen bank databases
Multiple sequence alignment
DNA Sequencing
Ad

Similar to PHYLOGENETICS WITH MEGA (20)

PDF
Phylogenetics
PPTX
Tools in phylogeny
PPTX
Molecular phylogenetics
PPTX
Molecular evolution genetic..MEGA1.pptx
PPTX
Presentation about phylogenetic tree and its construction methods.
PPTX
human phylogetic contrution of evolution tree.pptx
PPT
Phylogenetic studies
PPTX
Bioinformatics presentation shabir .pptx
PPT
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
PPTX
Zunera-Lecture-Introduction to Phylogenetic Analysis-V1.pptx
PPT
Phylogenetic alignment analysis an important tool in computational biology
PPTX
Phylogeny-Abida.pptx
PPTX
PHYLOGENETIC ANALYSIS_CSS2.pptx
PPT
Phylogenetic analysis & their methods.ppt
PPT
Phylogenetic analyses1
PPTX
Molecular basis of evolution and softwares used in phylogenetic tree contruction
PDF
Methods of illustrating evolutionary relationship
PDF
07_Phylogeny_2022.pdf
PPTX
BTC 506 Phylogenetic Analysis.pptx
PPTX
Computational phylogenetics theoretical concepts, methods with practical on C...
Phylogenetics
Tools in phylogeny
Molecular phylogenetics
Molecular evolution genetic..MEGA1.pptx
Presentation about phylogenetic tree and its construction methods.
human phylogetic contrution of evolution tree.pptx
Phylogenetic studies
Bioinformatics presentation shabir .pptx
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
Zunera-Lecture-Introduction to Phylogenetic Analysis-V1.pptx
Phylogenetic alignment analysis an important tool in computational biology
Phylogeny-Abida.pptx
PHYLOGENETIC ANALYSIS_CSS2.pptx
Phylogenetic analysis & their methods.ppt
Phylogenetic analyses1
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Methods of illustrating evolutionary relationship
07_Phylogeny_2022.pdf
BTC 506 Phylogenetic Analysis.pptx
Computational phylogenetics theoretical concepts, methods with practical on C...
Ad

More from UNIVERSITI MALAYSIA SABAH (11)

PPTX
PPTX
Reverse Transcription
PPTX
Reverse Transcription of RNA
PPTX
Breeding Plants using Chemical Mutagens
PPTX
Genome Editing with TALENS
PPTX
PRINCIPLE OF CRISPR GENOME EDITING
PPTX
An overview of the Pharmaceutical Industry
PPTX
Effluent treatment
PDF
BACTERIAL GENOME SEQUENCING PROJECT
PPTX
Molecular Breeding in Plants is an introduction to the fundamental techniques...
Reverse Transcription
Reverse Transcription of RNA
Breeding Plants using Chemical Mutagens
Genome Editing with TALENS
PRINCIPLE OF CRISPR GENOME EDITING
An overview of the Pharmaceutical Industry
Effluent treatment
BACTERIAL GENOME SEQUENCING PROJECT
Molecular Breeding in Plants is an introduction to the fundamental techniques...

Recently uploaded (20)

PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
Microbiology with diagram medical studies .pptx
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
Introduction to Cardiovascular system_structure and functions-1
PPTX
famous lake in india and its disturibution and importance
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
Sciences of Europe No 170 (2025)
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PPT
6.1 High Risk New Born. Padetric health ppt
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
Pharmacology of Autonomic nervous system
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PDF
Phytochemical Investigation of Miliusa longipes.pdf
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Microbiology with diagram medical studies .pptx
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
neck nodes and dissection types and lymph nodes levels
. Radiology Case Scenariosssssssssssssss
Introduction to Cardiovascular system_structure and functions-1
famous lake in india and its disturibution and importance
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
ECG_Course_Presentation د.محمد صقران ppt
Classification Systems_TAXONOMY_SCIENCE8.pptx
Sciences of Europe No 170 (2025)
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
6.1 High Risk New Born. Padetric health ppt
TOTAL hIP ARTHROPLASTY Presentation.pptx
2Systematics of Living Organisms t-.pptx
Pharmacology of Autonomic nervous system
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
Phytochemical Investigation of Miliusa longipes.pdf

PHYLOGENETICS WITH MEGA

  • 1. PHYLOGENETICS An introduction to the concepts and analysis using MEGA 6.0
  • 2. Today’s Objectives • To introduce the basis concepts involved in phylogenetic analysis. • To learn the usage of the phylogenetic package MEGA 6.0 • To discuss the manner in which you can apply phylogenetic analysis in your research approach, thesis and publications.
  • 3. Why use Phylogenetics ? • The human mind is naturally inclined to classify information. • Classification facilitates logical understanding as well as the detection of heuristic patterns within data sets. • Logical understanding of a process facilitates the process of discovery.
  • 4. Where will it be of use to me? • Classifying my sequence data within a global perspective. • Finding unique regions within my sequence data by comparison with a global data set. • Identification of genes which have not yet been widely characterized. • Infinitely many possibilities
  • 5. Traditional Classification schemes • Based on Phenotypic traits (Phenetic) and taxonomic classifiers (TU) • Low level of resolution • Not applicable to molecular data • Difficult to resolve taxonomic ambiguities at higher levels.
  • 6. From TUs to Genomic databases • DNA technology prompted a quantum shift in the resolving power of phylogenetics. • TU: < 100 classifiers • Amino Acids: Millions of combinations of AAs • Genomic level: Billions of bp of nucleotide data Does more information solve the problem?
  • 8. Species trees • A species tree establishes the hierarchy of a species within a globally accepted framework of classification. • ITS:16s • ITS: rDNA • ITS: chloroplast and mitochondria • Genes: rbcL, ADH, cytC, Ig(SC)
  • 9. Crab rRNA sequence data used to construct UPGMA tree, Note the out-group species that has been added to establish a perspective scale.
  • 10. Gene trees • Gene trees facilitate the understanding of evolutionary processes occurring within genes across taxa or within a species. • The rates of evolution offer insights into the manner in which genes evolve as a family. • Gene trees can be transformed into species trees if they conform to evolutionary criteria.
  • 11. Species v/s Gene trees • Which one do we select? The choice is determined by what we intend to characterize: Is it the organism within a genus / species? OR Is it a gene which is distributed across taxa?
  • 12. Molecular taxonomy based on genes • Prokaryotes: 16s rDNA • Higher organisms: ITS rDNA, Cp, Mt • Do you want an evolutionary tree? • Does your “molecular tree” corroborate your “taxonomic tree”?
  • 13. D. affinidisjuncta D. heteroneura D. mimica D. adiastola D. nigra S. albovittata D. crassifemur S. lebanonensis D. mulleri D. melanogaster D. pseudoobscura 0.000.050.100.150.200.25 Gene tree constructed using the Alcohol Dehydrogenase (ADH) gene from Drosophila spp. (UPGMA)
  • 14. The molecular clock • A digital clock displays time as the cumulative function of the frequency of a silicon crystal. • A molecular clock graphically depicts evolution as the function of changing nucleotide / amino acid frequency versus time.
  • 15. A highly simplified and idealized molecular clock ! The red bar is a gene, the colored bars represent nucleotide positions which change as a function of time.
  • 16. Phylogenetic trees •Distance based methods: inclusive •Maximum parsimony methods: assumptive
  • 17. NJT • Constructed Purely on the basis of pairwise genetic distance. • No prior assumptions are made pertaining to tree topology and branch lengths
  • 18. Japanese Korean Southern Chinese Australian Papuan North Amerind South Amerind Finn Italian German English San Bantu Pygmy Nigerian 0.01 Neighbor Joining Tree (NJT) based on human genetic distance matrix: compares Pairwise Genetic Distances only
  • 19. UPGMA • Originally developed for Phenogram construction (Sokal & Michener, 1958) • Adapted for Dendrogram construction • Can be used when there is a correlation between the distance measure used and the evolutionary timescale.
  • 20. Japanese Korean Southern Chinese North Amerind South Amerind Italian Finn German English Australian Papuan San Pygmy Nigerian Bantu 0.000.010.020.030.040.05 UPGMA tree based on human genetic distance matrix: Assumes a constant rate molecular clock
  • 21. VALIDATION: Bootstrapping • The concept of parsimony. • This is a re-sampling method by replacement with the same data matrix. • It allows calculation of standard deviations and variances.
  • 24. Why use MEGA 6.0 ? • Single platform, combines the functions of BIOEDIT,CLUSTALW, PAUP and TREEDIST • Imports FASTA files directly from GenBank: No editing! • Publication quality output / statistical corroboration. • Executes on your laptop / desktop. • User friendly GUI • Versatile / Flexible • Highest number of citations • Open source / Freeware • No codes to memorize
  • 25. What can MEGA 6.0 do for you? • Download data from a Database / File / Sequencer • Align data using CLUSTAL W • Perform phylogenetic analysis using various Algorithms • Graphically depict phylogenetic trees • Perform evolutionary tests: Tajima’s Molecular Clock, Tajima’s neutrality, Z-test, Fishers-exact test, Nei- Gojobori distance
  • 26. Getting started with MEGA • Input file • Processing commands • Output file
  • 28. THE INPUT FILE • FASTA format • ABI format • Distance matrix files
  • 29. THE ALIGNMENT COMMAND • This step requires discretion. After sequences have been aligned using CLUSTALW, 5’ and 3’ ends must be trimmed to develop a blunt composite set. • Save your output as XXXXX.MAS file • Before exiting save as XXXXX.MEG file
  • 31. The ends of the composite sequence should be trimmed after CLUSTALW alignment as they can contribute significantly to error in determining true evolutionary divergence / sequence similarity
  • 32. DEFINING YOUR OUTPUT • Distance Matrix File • Phylogenies: NJT / UPGMA / MP / ME • Parsimony trees • Evolutionary parameters • Molecular clocks
  • 34. Some concepts to think about: • Gene clusters • Genes across geographical boundaries • Why does genetic evolution transcend species boundaries? • Why do some genes evolve faster that others? • Why do some genes evolve concurrently?
  • 35. Some concepts to think about: • RNA families: clustering of ESTs • Comparative genomics within a supra genome • Evolutionary linkages within human genes
  • 36. CITATION MEGA should be cited as: Tamura K, Dudley J, Nei M & Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Molecular Biology and Evolution 24:1596-1599. (Publication PDF at http://www.kumarlab.net/publications)
  • 37. BIOINFORMATICS SESSION Follow the instructions on the screen and obtain your tree. If you have WIFI access to NCBI, you can develop your own unique alignments
  • 38. THANK YOU “In the greater scheme of things, all systems tend to unity… all of human understanding and logic is based on this underlying principle.. and the genome is no exception… “