SlideShare a Scribd company logo
Phylogenetic Tree
➔ Types
➔ Construction methods
Submitted by: Garima. M.Sc. Biotechnology (04)
Introduction:
● A phylogenetic tree, is a diagram that depicts the lines of evolutionary
descent of different species, organisms, or genes from a common
ancestor.
1. Branches: The branches of a phylogenetic tree represent evolutionary
lineages or the descent line. They connect the nodes or divergence sites.
2. Nodes: Nodes represent a common ancestor of the species or groups that
diverged from it. Nodes can be referred to as internal nodes or internal
branches when they represent non-observable common ancestors.
3. Tips or Leaves: The tips or leaves of a phylogenetic tree represent extant
or alive species or groups. Terminal taxa are taxa that are located at the
extremities of branches.
4. Root: The root of the phylogenetic tree represents the most recent
common ancestor of all the included species or groups. Typically, it is
depicted at the base of the tree.
Basic terminology :
5. Branch Length: In a phylogenetic tree, the branch length represents the
quantity of evolutionary change that has occurred along a particular branch. In
terms of time (e.g., millions of years) or genetic variation (e.g., DNA substitutions),
the duration can be quantified.
6. Phylogenetic Distance: The phylogenetic distance between two species or
groups quantifies how closely they are related. Typically, it is estimated using
genetic or morphological differences.
7. Taxa: Taxa are the categories of organisms or species that comprise the
phylogenetic tree. Individual species to higher taxonomic levels such as genera,
families, orders, and even larger groups.
8. Clades: Clades are monophyletic entities in a phylogenetic tree, composed of
an ancestor and all of its descendants. They share unique characteristics that are
derived from a common ancestor.
Types :
On the basis of presence or absence of a common root -
No common root
means no common
ancestor
Common root
representing
common ancestor
On the basis of topology : displays only the
branching pattern of
evolutionary relationships
among organisms.
Cladograms are unscaled,
which means that the
branch lengths do not
reflect the amount of
evolutionary divergence
between taxa or
operational taxonomic
units (OTUs).
that represents the
evolutionary relationships
among organisms by
showing both the
branching pattern and the
amount of evolutionary
divergence. Phylograms
are scaled, which means
that the branch lengths
Construction :
Choice of molecular markers and taxon sampling
Amplification / Sequencing
Alignment
Choice of evolutionary model
Phylogenetic analysis
Tree
User defined trees and topology
testing
Results
Selecting sequences for phylogenetic analysis -
● The rate of mutation is assumed to be same in both coding and non coding
regions. However, there is a difference in substitution rate.
● Non coding DNA regions have more substitution than coding regions.
● Proteins are much more conserved since they ‘need’ to conserve their
function.
● So, it is better to use sequences that mutate slowly (proteins) than DNA.
However, it genes are very small, or they mutate slowly, we can use them for
building the trees.
Basic Steps -
1. Choice of Molecular Markers
2. Alignment
3. Determining the substitution Model
4. Tree- building
5. Tree- evaluation
1. Choice of Molecular Markers
A molecular markers is a molecule contained within a sample taken from an organism or
other matter.
An ideal marker should be-
● A single copy gene may be more useful than multiple copy gene, this condition is
satisfied by mitochondrial and nuclear genes.
● Their alignment should be easy.
● The substitution rate should be optimum so as to provide informative sites.
● Too much of base variation among taxa is not preferable which may not reflect true
ancestry.
● Ribosomal RNA is considered as best target for phylogenetic as it is universal and is
composed of highly reserved and variable regions.
2. Alignment
After obtaining DNA from organism. The chosen markers are the amplified
using a isolated DNA template and markers specific oligonucleotides as
primers by PCR method.
The amplified PCR products are then sequenced.
1. Only a successful sequence alignment produces a genealogically related
tree.
2. A typical alignment procedure applied in phylogenetic studies should
involves the application of CLUSTAL W followed by manual alignment editing
and submission to tree- building program.
CLUSTAL W - It is a general purpose
multiple alignment program for DNA
or proteins. The sensitivity of the
commonly used progressive multiple
sequence alignment method has
been greatly improved for the
alignment of divergent protein
sequences.
If a tree is used to generate
alignment for phylogenetic analysis,
then the tree inferred from alignment
logically should have same topology.
3. Determining the Substitution Model -
To correct homology, statistical methods
known as substitution models or
evolutionary models, are needed to infer the
true evolutionary distances between
sequences, 2 main important substitution
models :
1. Jukes - Cantor model - Jukes - Cantor
model assumes that purines as well as
pyrimidines are substituted with equal
probability. This model can only analyse
reasonably closely related sequences.
2. Kimura model - Kimura-2 parameter
model assumes that transition
mutations should occur more often that
transversion.
● This is a model that takes in to
account the differential mutation
rates of transitions & transversion
and is more realistic.
● For protein sequences, the
evolutionary distances from an
alignment can be corrected using a
PAM or other amino amino acid
substitution matrix.
4. Tree - Building Methods -
Character - Based Methods
1. Maximum Parsimony
2. Maximum Likelihood
Method
Distance - Based Methods
1. Neighbor Joining ( NJ )
2. UPGMA
● Popular technique used in cladistics to infer a phylogenetic tree for a
set of taxa on basis of some observed data on similarities and
differences among taxa.
● Principle - Searches tree that requires smallest no. of evolutionary
changes to explain difference among OTUs.
● Invariant sites are no used in Parsimony ( they yield no information
on character state changes.)
● Informative sites ( at least 2 different kinds of residues - each present
at least 2 times) are used by Parsimony because they discriminate
between topologies - different topologies require different no. of
changes between residues.
Maximum - Parsimony Method
Character - Based Methods
Presentation about phylogenetic tree and its construction methods.
Maximum Likelihood Method
Maximum Likelihood Method create all the possible trees containing the set of
organisms considered, and then use the statistics to evaluate the most likely tree.
For a number of organisms, this is possible.
● Perform it's analysis on each position of the multiple alignment.
● Using a tree model for nucleotide substitutions, it will try to find most likely
tree.
● Maximum Likelihood methods are very slow and computer expensive.
Character - Based Methods
Neighbor Joining Method
● This algorithm is commonly applied with distance tree building,
regardless of optimisation criterion.
● The fully resolved tree is ‘decomposed’ from fully unresolved star tree
by successfully inserting branches between a pair of closest (actually,
most isolated) neighbors and remaining terminals in the tree.
● This neighbor pair is then consolidated, effectively reforming a star
tree, and this process is repeated.
● Method is rapid, requiring only few seconds or less for 50 sequences
tree.
Distance - Based Methods
Presentation about phylogenetic tree and its construction methods.
UPGMA
● Unweighted Pair Group Method with Arithmetic Mean
● It is a clustering algorithm- it joins tree branches on the criterion
of greater similarity among pairs and averages of joining pairs.
● It is not strictly an evolutionary method.
● UPGMA is expected to generate an accurate topology with true
branch length only when the divergence is according to a
molecular clock or approximately equal to raw sequence
dissimilarity.
Distance - Based Methods
Presentation about phylogenetic tree and its construction methods.
5. Tree Evaluation Methods -
Bootstrapping
● Method for testing how good a dataset fits a evolutionary model.
● This method can check branch arrangement or topology of
phylogenetic tree.
● In bootstrapping, the program re-samples columns in multiple
aligned group of sequences, and creates many new alignments
replacing the original dataset.
● These new sets represent the population.
● Process is done at least 100 times and phylogenetic trees are
generated from all sets.
● Part of the results will show the deviation of times a particular
Jackknife Method
● It is also a resampling technique.
● It resamples the original dataset by dropping one or more
alignment positions in each replicate.
● As a consequence, each jackknife replicate is smaller than the
original dataset and cannot contain duplicated data points.
● In practice jackknife is used much less frequently than bootstrap
approach.
Presentation about phylogenetic tree and its construction methods.

More Related Content

PPTX
Mendel´s third law; Law of Independent Assortment
PDF
Phylogenetic Tree Construction
PPTX
Association mapping
PPT
B.sc. agri i pog unit 4 population genetics
PPT
Hardy weinberg law
PPTX
Genome evolution discussion questions
PDF
Sequence Alignment
PPTX
Why need to study population genetics & applications of population genetics
Mendel´s third law; Law of Independent Assortment
Phylogenetic Tree Construction
Association mapping
B.sc. agri i pog unit 4 population genetics
Hardy weinberg law
Genome evolution discussion questions
Sequence Alignment
Why need to study population genetics & applications of population genetics

What's hot (20)

PPT
NATIONAL BIODIVERSITY AUTHORITY
PDF
Population genetics
PPTX
PDF
Population Genetics & Hardy - Weinberg Principle.pdf
PDF
Population Genetics AQA
PPTX
Ecological speciation - kashmeera
PPTX
PAM matrices evolution
PPTX
Hybridization based molecular markers 1
PPTX
History of genetics
PPTX
SCoT and RAPD
PPTX
Genotyping by Sequencing
PPT
Population genetic ppt
PPTX
Molecular Marker and It's Applications
PDF
Introduction to Phylogenetics
PPTX
selection in clonally propagated crops assumtions and realities
PPTX
QTL mapping
PPTX
Qtl and its mapping
PPTX
Linkage mapping and QTL analysis_Lecture
PPT
Genome Mapping
PPTX
Forward and reverse genetics
NATIONAL BIODIVERSITY AUTHORITY
Population genetics
Population Genetics & Hardy - Weinberg Principle.pdf
Population Genetics AQA
Ecological speciation - kashmeera
PAM matrices evolution
Hybridization based molecular markers 1
History of genetics
SCoT and RAPD
Genotyping by Sequencing
Population genetic ppt
Molecular Marker and It's Applications
Introduction to Phylogenetics
selection in clonally propagated crops assumtions and realities
QTL mapping
Qtl and its mapping
Linkage mapping and QTL analysis_Lecture
Genome Mapping
Forward and reverse genetics
Ad

Similar to Presentation about phylogenetic tree and its construction methods. (20)

PPTX
Bioinformatics presentation shabir .pptx
PPTX
Phylogenetic tree construction
PPTX
human phylogetic contrution of evolution tree.pptx
PDF
Phylogenetic analysis
PPTX
Phylogenetic Tree evolution
PPTX
PHYLOGENETIC ANALYSIS_CSS2.pptx
PPTX
Tree building
PPTX
BTC 506 Phylogenetic Analysis.pptx
PPT
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
PPT
Phylogenetic alignment analysis an important tool in computational biology
PPTX
Molecular phylogenetics
PDF
phylogenetics.pdf
PPTX
PHYLOGENETIC TREE CONSTRUCTION.pptx
DOCX
Humans, it would seem, have a great love of categorizing, organi
PPTX
Phylogenetic data analysis
PPT
Phylogenetic analysis & their methods.ppt
PDF
phylogenetictreeanditsconstructionandphylogenyof-191208102256.pdf
PPTX
Phylogenetic tree and its construction and phylogeny of
PPTX
Phylogenetic tree by Dr. Amrita Saxena.pptx
DOCX
Report on Phylogenetic tree
Bioinformatics presentation shabir .pptx
Phylogenetic tree construction
human phylogetic contrution of evolution tree.pptx
Phylogenetic analysis
Phylogenetic Tree evolution
PHYLOGENETIC ANALYSIS_CSS2.pptx
Tree building
BTC 506 Phylogenetic Analysis.pptx
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
Phylogenetic alignment analysis an important tool in computational biology
Molecular phylogenetics
phylogenetics.pdf
PHYLOGENETIC TREE CONSTRUCTION.pptx
Humans, it would seem, have a great love of categorizing, organi
Phylogenetic data analysis
Phylogenetic analysis & their methods.ppt
phylogenetictreeanditsconstructionandphylogenyof-191208102256.pdf
Phylogenetic tree and its construction and phylogeny of
Phylogenetic tree by Dr. Amrita Saxena.pptx
Report on Phylogenetic tree
Ad

Recently uploaded (20)

PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
Microbiology with diagram medical studies .pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PPTX
BIOMOLECULES PPT........................
PDF
HPLC-PPT.docx high performance liquid chromatography
PDF
The scientific heritage No 166 (166) (2025)
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PDF
Sciences of Europe No 170 (2025)
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPT
protein biochemistry.ppt for university classes
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
2. Earth - The Living Planet Module 2ELS
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Taita Taveta Laboratory Technician Workshop Presentation.pptx
7. General Toxicologyfor clinical phrmacy.pptx
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Microbiology with diagram medical studies .pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
The KM-GBF monitoring framework – status & key messages.pptx
bbec55_b34400a7914c42429908233dbd381773.pdf
POSITIONING IN OPERATION THEATRE ROOM.ppt
BIOMOLECULES PPT........................
HPLC-PPT.docx high performance liquid chromatography
The scientific heritage No 166 (166) (2025)
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Sciences of Europe No 170 (2025)
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
protein biochemistry.ppt for university classes
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
2. Earth - The Living Planet Module 2ELS
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud

Presentation about phylogenetic tree and its construction methods.

  • 1. Phylogenetic Tree ➔ Types ➔ Construction methods Submitted by: Garima. M.Sc. Biotechnology (04)
  • 2. Introduction: ● A phylogenetic tree, is a diagram that depicts the lines of evolutionary descent of different species, organisms, or genes from a common ancestor.
  • 3. 1. Branches: The branches of a phylogenetic tree represent evolutionary lineages or the descent line. They connect the nodes or divergence sites. 2. Nodes: Nodes represent a common ancestor of the species or groups that diverged from it. Nodes can be referred to as internal nodes or internal branches when they represent non-observable common ancestors. 3. Tips or Leaves: The tips or leaves of a phylogenetic tree represent extant or alive species or groups. Terminal taxa are taxa that are located at the extremities of branches. 4. Root: The root of the phylogenetic tree represents the most recent common ancestor of all the included species or groups. Typically, it is depicted at the base of the tree. Basic terminology :
  • 4. 5. Branch Length: In a phylogenetic tree, the branch length represents the quantity of evolutionary change that has occurred along a particular branch. In terms of time (e.g., millions of years) or genetic variation (e.g., DNA substitutions), the duration can be quantified. 6. Phylogenetic Distance: The phylogenetic distance between two species or groups quantifies how closely they are related. Typically, it is estimated using genetic or morphological differences. 7. Taxa: Taxa are the categories of organisms or species that comprise the phylogenetic tree. Individual species to higher taxonomic levels such as genera, families, orders, and even larger groups. 8. Clades: Clades are monophyletic entities in a phylogenetic tree, composed of an ancestor and all of its descendants. They share unique characteristics that are derived from a common ancestor.
  • 5. Types : On the basis of presence or absence of a common root - No common root means no common ancestor Common root representing common ancestor
  • 6. On the basis of topology : displays only the branching pattern of evolutionary relationships among organisms. Cladograms are unscaled, which means that the branch lengths do not reflect the amount of evolutionary divergence between taxa or operational taxonomic units (OTUs). that represents the evolutionary relationships among organisms by showing both the branching pattern and the amount of evolutionary divergence. Phylograms are scaled, which means that the branch lengths
  • 7. Construction : Choice of molecular markers and taxon sampling Amplification / Sequencing Alignment Choice of evolutionary model Phylogenetic analysis Tree User defined trees and topology testing Results
  • 8. Selecting sequences for phylogenetic analysis - ● The rate of mutation is assumed to be same in both coding and non coding regions. However, there is a difference in substitution rate. ● Non coding DNA regions have more substitution than coding regions. ● Proteins are much more conserved since they ‘need’ to conserve their function. ● So, it is better to use sequences that mutate slowly (proteins) than DNA. However, it genes are very small, or they mutate slowly, we can use them for building the trees.
  • 9. Basic Steps - 1. Choice of Molecular Markers 2. Alignment 3. Determining the substitution Model 4. Tree- building 5. Tree- evaluation
  • 10. 1. Choice of Molecular Markers A molecular markers is a molecule contained within a sample taken from an organism or other matter. An ideal marker should be- ● A single copy gene may be more useful than multiple copy gene, this condition is satisfied by mitochondrial and nuclear genes. ● Their alignment should be easy. ● The substitution rate should be optimum so as to provide informative sites. ● Too much of base variation among taxa is not preferable which may not reflect true ancestry. ● Ribosomal RNA is considered as best target for phylogenetic as it is universal and is composed of highly reserved and variable regions.
  • 11. 2. Alignment After obtaining DNA from organism. The chosen markers are the amplified using a isolated DNA template and markers specific oligonucleotides as primers by PCR method. The amplified PCR products are then sequenced. 1. Only a successful sequence alignment produces a genealogically related tree. 2. A typical alignment procedure applied in phylogenetic studies should involves the application of CLUSTAL W followed by manual alignment editing and submission to tree- building program.
  • 12. CLUSTAL W - It is a general purpose multiple alignment program for DNA or proteins. The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved for the alignment of divergent protein sequences. If a tree is used to generate alignment for phylogenetic analysis, then the tree inferred from alignment logically should have same topology.
  • 13. 3. Determining the Substitution Model - To correct homology, statistical methods known as substitution models or evolutionary models, are needed to infer the true evolutionary distances between sequences, 2 main important substitution models : 1. Jukes - Cantor model - Jukes - Cantor model assumes that purines as well as pyrimidines are substituted with equal probability. This model can only analyse reasonably closely related sequences.
  • 14. 2. Kimura model - Kimura-2 parameter model assumes that transition mutations should occur more often that transversion. ● This is a model that takes in to account the differential mutation rates of transitions & transversion and is more realistic. ● For protein sequences, the evolutionary distances from an alignment can be corrected using a PAM or other amino amino acid substitution matrix.
  • 15. 4. Tree - Building Methods - Character - Based Methods 1. Maximum Parsimony 2. Maximum Likelihood Method Distance - Based Methods 1. Neighbor Joining ( NJ ) 2. UPGMA
  • 16. ● Popular technique used in cladistics to infer a phylogenetic tree for a set of taxa on basis of some observed data on similarities and differences among taxa. ● Principle - Searches tree that requires smallest no. of evolutionary changes to explain difference among OTUs. ● Invariant sites are no used in Parsimony ( they yield no information on character state changes.) ● Informative sites ( at least 2 different kinds of residues - each present at least 2 times) are used by Parsimony because they discriminate between topologies - different topologies require different no. of changes between residues. Maximum - Parsimony Method Character - Based Methods
  • 18. Maximum Likelihood Method Maximum Likelihood Method create all the possible trees containing the set of organisms considered, and then use the statistics to evaluate the most likely tree. For a number of organisms, this is possible. ● Perform it's analysis on each position of the multiple alignment. ● Using a tree model for nucleotide substitutions, it will try to find most likely tree. ● Maximum Likelihood methods are very slow and computer expensive. Character - Based Methods
  • 19. Neighbor Joining Method ● This algorithm is commonly applied with distance tree building, regardless of optimisation criterion. ● The fully resolved tree is ‘decomposed’ from fully unresolved star tree by successfully inserting branches between a pair of closest (actually, most isolated) neighbors and remaining terminals in the tree. ● This neighbor pair is then consolidated, effectively reforming a star tree, and this process is repeated. ● Method is rapid, requiring only few seconds or less for 50 sequences tree. Distance - Based Methods
  • 21. UPGMA ● Unweighted Pair Group Method with Arithmetic Mean ● It is a clustering algorithm- it joins tree branches on the criterion of greater similarity among pairs and averages of joining pairs. ● It is not strictly an evolutionary method. ● UPGMA is expected to generate an accurate topology with true branch length only when the divergence is according to a molecular clock or approximately equal to raw sequence dissimilarity. Distance - Based Methods
  • 23. 5. Tree Evaluation Methods - Bootstrapping ● Method for testing how good a dataset fits a evolutionary model. ● This method can check branch arrangement or topology of phylogenetic tree. ● In bootstrapping, the program re-samples columns in multiple aligned group of sequences, and creates many new alignments replacing the original dataset. ● These new sets represent the population. ● Process is done at least 100 times and phylogenetic trees are generated from all sets. ● Part of the results will show the deviation of times a particular
  • 24. Jackknife Method ● It is also a resampling technique. ● It resamples the original dataset by dropping one or more alignment positions in each replicate. ● As a consequence, each jackknife replicate is smaller than the original dataset and cannot contain duplicated data points. ● In practice jackknife is used much less frequently than bootstrap approach.