SlideShare a Scribd company logo
Clustering the Temporal Sequences of
3D Protein Structure
Mayumi Kamada+*
, Sachi Kimura,
Mikito Toda‡
, Masami Takata+
, Kazuki Joe+
+:Graduate School of Humanities and Science,
Information and Computer Sciences,
Nara Women’s University
‡:Departments of physics, Nara Women’s University
Outline
• Motivation
• Flexibility Docking
• Feature Extraction using Motion
• Analysis
• Conclusions and Future Work
Motivation
• Protein in biological molecules
“Docking”
– Transform oneself and Combine with other
materials
• Prediction of Docking
 Prediction of resultant functions
Existing Docking Simulation
Predicted structures
from docking
structure
A
structure
B
Docking simulation
PDB*
Rigid
structures
* Protein Data Bank
Fluctuating in living cells
 Low prediction accuracy
Docking simulation
Considering fluctuations
Flexibility Docking
Predicted structures
from docking
structure
A
structure
B
Docking simulation
PDB
Flexibility handling
• Considering fluctuation of proteins in living cells
Extraction of fluctuated
structures
Consideration of
structural fluctuation of proteins
Flexibility Handling
Flexibility handling
MD
Filter
・・・
output
file
Representative
structure
・・・
・・・
・・・
Filtering
• Selection of representative structures
from similar structures
Molecular dynamic
simulation(MD)
• Simulation of motion of
molecules in a polyatomic system
output
file
output
file
output
file
output
file
Representative
structure
Create filters by using RMSD
Filters using RMSD
• RMSD(Root Mean Square Deviation)
– Comparison of the similarity of two structures
• Propose two filtering algorithms
<Filter-1> Maximum RMSD selection filter
<Filter-2> Below RMSD 1Å deletion filter
• Result
– Useful for the heat fluctuation condition
– RMSD
Unification of topology information
Lapse of information
Feature extraction
focusing on Protein Motion
not Structure
Capture Protein Motion
MD
・・・ ・・・
Wavelet transform
・・・ ・・・
Clustering
・・・ ・・・
Continuous wavelet
transform:
Morlet wavelet
Clustering algorithm:
Affinity Propagation
Selection of representative motions
Feature extraction
The frequency
may change
momentarily!
Target Protein
• 1TIB
– Residue length: 269
• MD simulation
– Software: AMBER
– Simulation run time: 2 nsec
– Result data files: 200
• Space coordinates of Cα atoms
Singular Value Decomposition
• SVD(Singular value decomposition)
• Definition:
*
V
U
A 

Unitary matrix U:
Left-singular vectors
Spatial motion
Unitary matrix V:
Right-singular vectors
Frequency fluctuation
Matrix A:
At time step i (ti)
Components column : Cα
row : Frequency
★matrix-size of A: 807×199
Singular Value Decomposition
• SVD(Singular value decomposition)
• Definition:
*
V
U
A 

Unitary matrix U:
Left-singular vectors
Spatial motion
Unitary matrix V:
Right-singular vectors
Frequency fluctuation
Matrix A:
At time step i (ti)
Components column : Cα
row : Frequency
★matrix-size of A: 807×199
Verification of Reproducibility
• Singular values and principal components
• N=1 • N=4
• N=6 • N=8
• M=1 • M=4
• M=6 • M=8
2
1
2
2
807
1

 

N
j
j
j
i
'
i u
vs
a 
2
1
2
2
199
1

 

M
j
j
j
i
i v
vs
a 
Left Singular Vectors
(Spatial motion)
Right Singular Vectors
(Frequency fluctuation)
Reproducibility
Using the eight principal components,
the motion expressed by 199 components
can be reproduced !
Almost adjusted !
Examination
(1) Each of singular values
(2)The first singular value
– Accounted for about 30% over
Expression of the original motion
 Possible by the six singular values
The first singular value is useful
Clustering Analysis
• Focus on the first principal component
• Definition
– Similarities and Preference
 Clustering by using the above values
Similarities (1)
• For left singular vectors
– Difference of spatial directs
 Inner products
– Similarity :
Same direction Differential direction
Kij :Value
1 0
)
(
)
( 1
1 j
i
ij t
e
t
e
K 

ij
j
i
j
i K
t
t
S 


)
,
(
Cα
Similarities (2)
• For right singular vectors
– Difference between distributions of spectrum
 Hellinger Distance
– Similarity:
 
 


n
j
i
j
i
i
i
n
P
n
P
P
P
d
n
x
n
P
2
2
)
(
)
(
2
)
,
(
)
(
)
(
)
,
(
)
,
( j
i
j
i P
P
d
P
P
S 

Clustering Method
• Affinity propagation(AP)
– Brendan J. Frey and Delbert Dueck
– “Clustering by Passing Messages Between Data
Points”. Science 315, 972–976.2007
– Obtain “Exemplars”: cluster centers
• Preference
– Left singular vectors
• Average of similarities
– Right singular vectors
• minimum of similarities-(maximum of similarities-minimum)
Data point i
Candidate
exemplar k
Competing candidate
exemplar k’
Supporting
data point i’
)
,
( k
i
r
)
,
( k
i
r 
)
,
( k
i
a 
)
,
( k
i
a
Data point i
Candidate
exemplar k
Competing candidate
exemplar k’
Supporting
data point i’
)
,
( k
i
r
)
,
( k
i
r 
)
,
( k
i
a 
)
,
( k
i
a
Similarities
between Left Singular Vectors
Clustering
of Left Singular Vectors
Similarities
between Right Singular Vectors
Clustering
of Right Singular Vectors
Discussions
• Each of motions
– Spatial motion
• Repetition of several similar spatial motions
in time variation
– Frequency fluctuation
• Repetition of similar frequency patterns
in time variation
• Relationship
Characteristic Frequency fluctuation
Group transition on spatial motion
Conclusions and Future Work
• Flexibility docking
– Flexibility handling: MD and Filter
• Feature extraction based motion
– Wavelet analysis
– Analysis of motions
– Clustering
• Future work
– Collective motion
– Relationship
– Perform the docking simulation
Conclusions and Future Work
• Flexibility docking
– Flexibility handling: MD and Filter
• Feature extraction based motion
– Wavelet analysis
– Analysis of motions
– Clustering
• Future work
– Collective motion
– Relationship
– Perform the docking simulation

More Related Content

PDF
BIM_2010_20_Bioinformatics_Project
PDF
So sánh cấu trúc protein_Protein structure comparison
PDF
Tomasi and Kanade - Structure from Motion
PPTX
TSIndexingIndexacao De Série ttemporal.pptx
PPTX
Structure alignment methods
PPT
Protein Structure Alignment
PPT
Powerpoint
PDF
Latent Factor Model For Collaborative Filtering
BIM_2010_20_Bioinformatics_Project
So sánh cấu trúc protein_Protein structure comparison
Tomasi and Kanade - Structure from Motion
TSIndexingIndexacao De Série ttemporal.pptx
Structure alignment methods
Protein Structure Alignment
Powerpoint
Latent Factor Model For Collaborative Filtering

Similar to Kamada-filehhhhhhhhhhhhhhhhhhhhhhhhhhhh.ppt (20)

PPTX
Data Mining Lecture_9.pptx
PDF
PCB_Lect02_Pairwise_allign (1).pdf
PDF
BITS: Basics of sequence analysis
PDF
SVD-1 (1).pdf
PDF
An Introduction to Spectral Graph Theory
PPT
Sequence alignment belgaum
PDF
Blast fasta
PPT
Template Free Protein Structure Modeling
PDF
Nonlinear Manifolds in Computer Vision
PDF
Content Based Image Retrieval Using Gray Level Co-Occurance Matrix with SVD a...
PDF
Efficient method for finding nearest neighbors in flocking behaviors using k-...
PDF
Object Recognition with Deformable Models
PDF
asm_pca_posit
PDF
Data Smashing
PPTX
Time series data mining techniques
PPTX
HACKSing heterogeneity in cell motility
PDF
esmaeili-2016-ijca-911399
PDF
A Frequency Domain Approach to Protein Sequence Similarity Analysis and Funct...
PDF
Joint3DShapeMatching
PDF
Protein structure prediction by means
Data Mining Lecture_9.pptx
PCB_Lect02_Pairwise_allign (1).pdf
BITS: Basics of sequence analysis
SVD-1 (1).pdf
An Introduction to Spectral Graph Theory
Sequence alignment belgaum
Blast fasta
Template Free Protein Structure Modeling
Nonlinear Manifolds in Computer Vision
Content Based Image Retrieval Using Gray Level Co-Occurance Matrix with SVD a...
Efficient method for finding nearest neighbors in flocking behaviors using k-...
Object Recognition with Deformable Models
asm_pca_posit
Data Smashing
Time series data mining techniques
HACKSing heterogeneity in cell motility
esmaeili-2016-ijca-911399
A Frequency Domain Approach to Protein Sequence Similarity Analysis and Funct...
Joint3DShapeMatching
Protein structure prediction by means
Ad

More from taoufikakabli1 (20)

PPTX
paludisme-ppt-dr-berbwxdqqdqsadj-2022.pptx
PPTX
parasito06-introdqsssssssssssssduction.pptx
PPTX
Douves-M2017-zerrrrrrrrrrrrrrrrrrrr18.pptx
PPTX
Amibes-et-amibiase.jjjjjjjjjjjjjpdf.pptx
PPT
723684839-Cours-de-Microbiologie-generale(1).ppt
PPT
pharmaco3an-gr;;;;ossesse_medicaments.ppt
PPT
3542_Presentationsssssddddd_Orchidee.ppt
PPTX
471408731-B7-Diagnostic-en-Bacteriologie-1-pptx.pptx
PPTX
556140767-Examen-Bacteriologique-Des-Selles-Coproculture-LAPTOP-L6BKA9FM.pptx
PPTX
LA PHARMACODYNAMIkklklklklkklklklklklE&é.pptx
PPT
dyslipidemies_nouvellesjjjjjjjjjjjjjjjjjjjjjjj_therapeutiques_2006.ppt
PPTX
pkp_cours.puuuuuuuuuuuuuuuuuuuuuiiiiiiiiptx
PPTX
728913926-Bilan-hepjkjjjjjjjjjjjjjjjjjjjjjatique.pptx
PPTX
647961846-EXkjjjjjjjjjkjjjjjjjjAMEN-LABO.pptx
PPTX
5671974kkkkkkkkkkkkkkkk18-BILAN-HEPATIQUE.pptx
PPTX
89873214-marquekkkkkkkkkkkkkkkkkkurs.pptx
PPTX
46966626-IONOGdffffffffffsfRAMME-SANGUIN.pptx
PPT
Prof-Kebiefdddddddddddddffffffffffdche.ppt
PPT
Equilibre_hydro_electrklllllllllolytique.ppt
PPTX
TD équilibrehhhhhhhhhhhhhhhhhhhhhhhh AB.pptx
paludisme-ppt-dr-berbwxdqqdqsadj-2022.pptx
parasito06-introdqsssssssssssssduction.pptx
Douves-M2017-zerrrrrrrrrrrrrrrrrrrr18.pptx
Amibes-et-amibiase.jjjjjjjjjjjjjpdf.pptx
723684839-Cours-de-Microbiologie-generale(1).ppt
pharmaco3an-gr;;;;ossesse_medicaments.ppt
3542_Presentationsssssddddd_Orchidee.ppt
471408731-B7-Diagnostic-en-Bacteriologie-1-pptx.pptx
556140767-Examen-Bacteriologique-Des-Selles-Coproculture-LAPTOP-L6BKA9FM.pptx
LA PHARMACODYNAMIkklklklklkklklklklklE&é.pptx
dyslipidemies_nouvellesjjjjjjjjjjjjjjjjjjjjjjj_therapeutiques_2006.ppt
pkp_cours.puuuuuuuuuuuuuuuuuuuuuiiiiiiiiptx
728913926-Bilan-hepjkjjjjjjjjjjjjjjjjjjjjjatique.pptx
647961846-EXkjjjjjjjjjkjjjjjjjjAMEN-LABO.pptx
5671974kkkkkkkkkkkkkkkk18-BILAN-HEPATIQUE.pptx
89873214-marquekkkkkkkkkkkkkkkkkkurs.pptx
46966626-IONOGdffffffffffsfRAMME-SANGUIN.pptx
Prof-Kebiefdddddddddddddffffffffffdche.ppt
Equilibre_hydro_electrklllllllllolytique.ppt
TD équilibrehhhhhhhhhhhhhhhhhhhhhhhh AB.pptx
Ad

Recently uploaded (20)

PDF
Western Pop Music: From Classics to Chart-Toppers
PDF
MAGNET STORY- Coaster Sequence (Rough Version 2).pdf
PPTX
The Pearl - project of Advanced Reading course
PPTX
BULAN K3 NASIONAL PowerPt Templates.pptx
PDF
How Old Radio Shows in the 1940s and 1950s Helped Ella Fitzgerald Grow.pdf
PDF
Benben maam tess of the report on somthing
PPTX
Kulipari: Army of Frogs Movie - OVFX Story Internship 2023
DOC
UNG毕业证学历认证,阿莱恩特国际大学毕业证文凭证书
PPTX
Hacking Movie – Best Films on Cybercrime & Digital Intrigue
PDF
WKA #29: "FALLING FOR CUPID" TRANSCRIPT.pdf
PDF
Watch Eddington (2025) – A Town Torn in Two
PDF
High-Quality PDF Backlinking for Better Rankings
PDF
Best IPTV Service Providers in the UK (2025) – Honest Reviews & Top Picks
PPTX
Other Dance Forms - G10 MAPEH Reporting.pptx
PPTX
Introduction to NGO’s098765789709876.pptx
PPTX
understanding the Human DNA components database design
PPTX
What Makes an Entertainment App Addictive?
PDF
Can You Imagine? Read along and let’s see!
PDF
WKA #29: "FALLING FOR CUPID" TRANSCRIPT.pdf
PDF
Hip Hop Culture – More Than Just Music & Style
Western Pop Music: From Classics to Chart-Toppers
MAGNET STORY- Coaster Sequence (Rough Version 2).pdf
The Pearl - project of Advanced Reading course
BULAN K3 NASIONAL PowerPt Templates.pptx
How Old Radio Shows in the 1940s and 1950s Helped Ella Fitzgerald Grow.pdf
Benben maam tess of the report on somthing
Kulipari: Army of Frogs Movie - OVFX Story Internship 2023
UNG毕业证学历认证,阿莱恩特国际大学毕业证文凭证书
Hacking Movie – Best Films on Cybercrime & Digital Intrigue
WKA #29: "FALLING FOR CUPID" TRANSCRIPT.pdf
Watch Eddington (2025) – A Town Torn in Two
High-Quality PDF Backlinking for Better Rankings
Best IPTV Service Providers in the UK (2025) – Honest Reviews & Top Picks
Other Dance Forms - G10 MAPEH Reporting.pptx
Introduction to NGO’s098765789709876.pptx
understanding the Human DNA components database design
What Makes an Entertainment App Addictive?
Can You Imagine? Read along and let’s see!
WKA #29: "FALLING FOR CUPID" TRANSCRIPT.pdf
Hip Hop Culture – More Than Just Music & Style

Kamada-filehhhhhhhhhhhhhhhhhhhhhhhhhhhh.ppt

  • 1. Clustering the Temporal Sequences of 3D Protein Structure Mayumi Kamada+* , Sachi Kimura, Mikito Toda‡ , Masami Takata+ , Kazuki Joe+ +:Graduate School of Humanities and Science, Information and Computer Sciences, Nara Women’s University ‡:Departments of physics, Nara Women’s University
  • 2. Outline • Motivation • Flexibility Docking • Feature Extraction using Motion • Analysis • Conclusions and Future Work
  • 3. Motivation • Protein in biological molecules “Docking” – Transform oneself and Combine with other materials • Prediction of Docking  Prediction of resultant functions
  • 4. Existing Docking Simulation Predicted structures from docking structure A structure B Docking simulation PDB* Rigid structures * Protein Data Bank Fluctuating in living cells  Low prediction accuracy Docking simulation Considering fluctuations
  • 5. Flexibility Docking Predicted structures from docking structure A structure B Docking simulation PDB Flexibility handling • Considering fluctuation of proteins in living cells Extraction of fluctuated structures Consideration of structural fluctuation of proteins
  • 6. Flexibility Handling Flexibility handling MD Filter ・・・ output file Representative structure ・・・ ・・・ ・・・ Filtering • Selection of representative structures from similar structures Molecular dynamic simulation(MD) • Simulation of motion of molecules in a polyatomic system output file output file output file output file Representative structure Create filters by using RMSD
  • 7. Filters using RMSD • RMSD(Root Mean Square Deviation) – Comparison of the similarity of two structures • Propose two filtering algorithms <Filter-1> Maximum RMSD selection filter <Filter-2> Below RMSD 1Å deletion filter • Result – Useful for the heat fluctuation condition – RMSD Unification of topology information Lapse of information Feature extraction focusing on Protein Motion not Structure
  • 8. Capture Protein Motion MD ・・・ ・・・ Wavelet transform ・・・ ・・・ Clustering ・・・ ・・・ Continuous wavelet transform: Morlet wavelet Clustering algorithm: Affinity Propagation Selection of representative motions Feature extraction The frequency may change momentarily!
  • 9. Target Protein • 1TIB – Residue length: 269 • MD simulation – Software: AMBER – Simulation run time: 2 nsec – Result data files: 200 • Space coordinates of Cα atoms
  • 10. Singular Value Decomposition • SVD(Singular value decomposition) • Definition: * V U A   Unitary matrix U: Left-singular vectors Spatial motion Unitary matrix V: Right-singular vectors Frequency fluctuation Matrix A: At time step i (ti) Components column : Cα row : Frequency ★matrix-size of A: 807×199
  • 11. Singular Value Decomposition • SVD(Singular value decomposition) • Definition: * V U A   Unitary matrix U: Left-singular vectors Spatial motion Unitary matrix V: Right-singular vectors Frequency fluctuation Matrix A: At time step i (ti) Components column : Cα row : Frequency ★matrix-size of A: 807×199
  • 12. Verification of Reproducibility • Singular values and principal components • N=1 • N=4 • N=6 • N=8 • M=1 • M=4 • M=6 • M=8 2 1 2 2 807 1     N j j j i ' i u vs a  2 1 2 2 199 1     M j j j i i v vs a  Left Singular Vectors (Spatial motion) Right Singular Vectors (Frequency fluctuation)
  • 13. Reproducibility Using the eight principal components, the motion expressed by 199 components can be reproduced ! Almost adjusted !
  • 14. Examination (1) Each of singular values (2)The first singular value – Accounted for about 30% over Expression of the original motion  Possible by the six singular values The first singular value is useful
  • 15. Clustering Analysis • Focus on the first principal component • Definition – Similarities and Preference  Clustering by using the above values
  • 16. Similarities (1) • For left singular vectors – Difference of spatial directs  Inner products – Similarity : Same direction Differential direction Kij :Value 1 0 ) ( ) ( 1 1 j i ij t e t e K   ij j i j i K t t S    ) , ( Cα
  • 17. Similarities (2) • For right singular vectors – Difference between distributions of spectrum  Hellinger Distance – Similarity:       n j i j i i i n P n P P P d n x n P 2 2 ) ( ) ( 2 ) , ( ) ( ) ( ) , ( ) , ( j i j i P P d P P S  
  • 18. Clustering Method • Affinity propagation(AP) – Brendan J. Frey and Delbert Dueck – “Clustering by Passing Messages Between Data Points”. Science 315, 972–976.2007 – Obtain “Exemplars”: cluster centers • Preference – Left singular vectors • Average of similarities – Right singular vectors • minimum of similarities-(maximum of similarities-minimum) Data point i Candidate exemplar k Competing candidate exemplar k’ Supporting data point i’ ) , ( k i r ) , ( k i r  ) , ( k i a  ) , ( k i a Data point i Candidate exemplar k Competing candidate exemplar k’ Supporting data point i’ ) , ( k i r ) , ( k i r  ) , ( k i a  ) , ( k i a
  • 23. Discussions • Each of motions – Spatial motion • Repetition of several similar spatial motions in time variation – Frequency fluctuation • Repetition of similar frequency patterns in time variation • Relationship Characteristic Frequency fluctuation Group transition on spatial motion
  • 24. Conclusions and Future Work • Flexibility docking – Flexibility handling: MD and Filter • Feature extraction based motion – Wavelet analysis – Analysis of motions – Clustering • Future work – Collective motion – Relationship – Perform the docking simulation
  • 25. Conclusions and Future Work • Flexibility docking – Flexibility handling: MD and Filter • Feature extraction based motion – Wavelet analysis – Analysis of motions – Clustering • Future work – Collective motion – Relationship – Perform the docking simulation