SlideShare a Scribd company logo
A method to improve predictivity of Net-Cox
using Mutual Information based network
Abstract
Cancer genes
Gene co-expression Network
NetCox
(Network-constrained Cox regression)
Predict Survivability
1g
2g
3g
Dataset
Correlation between gene pairs
Abstract
CNA
mRNA
METH
TCGA data
10,022 genes
340 patients
Censored : 0
Observed : 1
Clinical outcome
M.I. based Network
NetCox
(Network-constrained Cox regression)
Predict Survivability
3g
1g
2g
𝐼(𝑔𝑖, 𝑔𝑗 ; 𝑌)
Mutual Information between gene pairs and clinical outcome of patients
M.I. based network
gene pair clinical outcome of patients
Joint entropy of X, YEntropy of X
Mutual information of X, Y
Mutual information of two genes and
clinical outcome of patient
0, 0.1, 0.2, …, 0.7
Threshold from permutation strategy
1g
2g
3g
NetCox (Network-constrained Cox regression)
Network information
1e-4, 1e-3, 1e-2, 1e-1
0.1, 0.3, 0.5, 0.7, 0.9, 1
L2-Cox
The importance of network
information increases
Regression coefficients
Prognostic index
Profile data
Which
parameter is
the best?
Predict Survivability
Cross validation for optimal parameters
profile
G
10022
10022
network
profileX
Data
340 patients
10022 genes
1
1
0
1
…
340
patients
Clinical outcome
uncensored
(deceased)
censored
(living)
Test set Training set
5-fold cross validation
Sort in ascending order
by survival time of patients
test
profileX 
NetCox training
'XPI 
Evaluation
40%
40%
High-risk
group
low-risk
group
PIrank the
patients • Log-rank test • Time dependent ROC
𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑐, 𝑡 𝑓 𝑋 = Pr 𝑓 𝑋 > 𝑐|𝛿 𝑡 = 1 ,
𝑠𝑝𝑒𝑐𝑖𝑓𝑖𝑐𝑖𝑡𝑦 𝑐, 𝑡 𝑓 𝑋 = Pr 𝑓 𝑋 ≤ 𝑐|𝛿 𝑡 = 0
• 𝑓 𝑋 = 𝑋′ 𝛽
• 𝑐 ∶ 𝑐𝑢𝑡𝑜𝑓𝑓 𝑝𝑜𝑖𝑛𝑡
• 𝑡 ∶ 𝑡𝑖𝑚𝑒
• 𝛿 𝑡 : 𝑒𝑣𝑒𝑛𝑡 𝑖𝑛𝑑𝑖𝑐𝑎𝑡𝑜𝑟 𝑎𝑡 𝑡𝑖𝑚𝑒 𝑡
The area under the ROC curve for any time t
-> how well PI classifies the patients into
high or low risk group
−log(𝑝𝑣𝑎𝑙𝑢𝑒)
Evaluate whether the patients
are assigned to the right group.
Results - Time-dependent ROC & log-rank test
Results
Mutual
Information
Correlation
Functional
Linkage
L2-Cox
CNA 0.6276 0.6265 0.6192 0.6229
mRNA 0.6427 0.6481 0.6366 0.6418
METH 0.5876 0.5845 0.5855 0.5663
Mutual
Information
Correlation
Functional
Linkage
L2-Cox
CNA 1.4068 1.4335 1.0968 1.2020
mRNA 1.8107 2.0748 1.2346 1.5018
METH 0.9714 0.8576 0.7596 0.7407
(a) Best Performance Comparison with time AUC
(b) Best Performance Comparison with log-rank test
Results
Results
Genomic
Profile
Test lambda alpha theta threshold
No. of gene pairs
above threshold
Percentage
CNA
log-rank
1E-03
0.3 0.4 0.0930 66,754 0.13%
AUC 0.7 0.3 0.0863 179,963 0.36%
mRNA
log-rank
1E-04 0.1 0.6 0.1221 6,487 0.01%
AUC
METH
log-rank
1E-04
0.1 0.3 0.1017 32,282 0.06%
AUC 0.3 0.5 0.1173 3,641 0.01%
Results – CNA network
Results – mRNA network
Results – Methylation network
Results
Results
Profile Category ID description count p-value corr p-value
CNA BP 42335 cuticle development 2 1.91E-06 2.53E-03
CNA MF 16174 NAD(P)H oxidase activity 2 2.26E-05 5.13E-03
meth BP 97479 synaptic vesicle localization 4 1.86E-05 5.82E-03
meth BP 48489 synaptic vesicle transport 4 1.74E-05 5.82E-03
mRNA BP 6915 apoptotic process 16 1.55E-07 1.81E-04
mRNA BP 30162 regulation of proteolysis 10 1.04E-06 3.87E-04
P-value < 0.01

More Related Content

PPT
Quantitative Medicine Feb 2009
PPT
Gene Express Jaima Presentation September 04, 2008 Chiba
PDF
Presentation july 31_2015
PPTX
NetBioSIG2014-Talk by David Amar
PPTX
Project_702
PPTX
Selene_Hess_Deciphering Antibiotic Resistance_EDITED
PDF
Identification of novel potential anti cancer agents using network pharmacolo...
PDF
Presentation july 28_2015
Quantitative Medicine Feb 2009
Gene Express Jaima Presentation September 04, 2008 Chiba
Presentation july 31_2015
NetBioSIG2014-Talk by David Amar
Project_702
Selene_Hess_Deciphering Antibiotic Resistance_EDITED
Identification of novel potential anti cancer agents using network pharmacolo...
Presentation july 28_2015

What's hot (6)

PPT
PDF
Bioinformatics
PDF
Genomics2 Phenomics Complete
PPTX
AI in Bioinformatics
PPTX
Big Data Challenges for Real-Time Personalized Medicine
PDF
Big Data and Genomic Medicine by Corey Nislow
Bioinformatics
Genomics2 Phenomics Complete
AI in Bioinformatics
Big Data Challenges for Real-Time Personalized Medicine
Big Data and Genomic Medicine by Corey Nislow
Ad

Similar to A method to improve survival prediction using mutual information based network (20)

PPTX
Outcome-guided mutual information networks for investigating gene-gene intera...
PPT
Myths in science & statistics s ha h.ppt
PDF
PPT
probabilistic ranking
PDF
presentation
PDF
PICS: Pathway Informed Classification System for cancer analysis using gene e...
PDF
InSyBio at Open Coffee Athens CI
PDF
Prof. Dr. Hamdi Akan, 6th Clinical Research Conference
PPTX
TNBC Research Presentation and medical virology .pptx
PDF
Abidemi - ICSA-Final
PPTX
Towards Replicable and Genereralizable Genomic Prediction Models
PDF
Apresentação Netsci 09
PPTX
PSB 2018 presentation
PDF
Design of an Intelligent System for Improving Classification of Cancer Diseases
PPTX
Assessing the clinical utility of cancer genomic and proteomic data across tu...
PDF
Gene Expression Mining for Predicting Survivability of Patients in Earlystage...
PPTX
selection of endpoints in oncology research studies.pptx
PPTX
Personalized Therapies for OA: Can Biomarkers Get Us There?
PDF
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
PDF
Revolutionizing cancer classification: the snr-ogscc method for improved gene...
Outcome-guided mutual information networks for investigating gene-gene intera...
Myths in science & statistics s ha h.ppt
probabilistic ranking
presentation
PICS: Pathway Informed Classification System for cancer analysis using gene e...
InSyBio at Open Coffee Athens CI
Prof. Dr. Hamdi Akan, 6th Clinical Research Conference
TNBC Research Presentation and medical virology .pptx
Abidemi - ICSA-Final
Towards Replicable and Genereralizable Genomic Prediction Models
Apresentação Netsci 09
PSB 2018 presentation
Design of an Intelligent System for Improving Classification of Cancer Diseases
Assessing the clinical utility of cancer genomic and proteomic data across tu...
Gene Expression Mining for Predicting Survivability of Patients in Earlystage...
selection of endpoints in oncology research studies.pptx
Personalized Therapies for OA: Can Biomarkers Get Us There?
2014 Gene expressionmicroarrayclassification usingPCA–BEL.
Revolutionizing cancer classification: the snr-ogscc method for improved gene...
Ad

More from SOYEON KIM (20)

PDF
Network-based machine learning approach for aggregating multi-modal data
PPTX
Revealing disease-associated pathways by network integration of untargeted me...
PPTX
Systems genetics approaches to understand complex traits
PPTX
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
PDF
Network embedding
PPTX
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...
PPTX
Deep learning based multi-omics integration, a survey
PPTX
DeepWalk: Online Learning of Social Representations
PPTX
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
PPTX
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
PPTX
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...
PDF
A survey of heterogeneous information network analysis
PPTX
Translated learning
PPTX
Self taught clustering
PPTX
Semi-automatic ground truth generation using unsupervised clustering and limi...
PPTX
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...
PPTX
Text extraction from natural scene image, a survey
PPTX
Opinion Fraud Detection in Online Reviews by Network Effects
PPTX
Evaluating color descriptors for object and scene recognition
PPTX
Spectral clustering
Network-based machine learning approach for aggregating multi-modal data
Revealing disease-associated pathways by network integration of untargeted me...
Systems genetics approaches to understand complex traits
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Network embedding
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...
Deep learning based multi-omics integration, a survey
DeepWalk: Online Learning of Social Representations
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...
A survey of heterogeneous information network analysis
Translated learning
Self taught clustering
Semi-automatic ground truth generation using unsupervised clustering and limi...
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...
Text extraction from natural scene image, a survey
Opinion Fraud Detection in Online Reviews by Network Effects
Evaluating color descriptors for object and scene recognition
Spectral clustering

Recently uploaded (20)

PPTX
Database Infoormation System (DBIS).pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Business Analytics and business intelligence.pdf
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
1_Introduction to advance data techniques.pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
climate analysis of Dhaka ,Banglades.pptx
Database Infoormation System (DBIS).pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Business Analytics and business intelligence.pdf
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Introduction to Knowledge Engineering Part 1
Galatica Smart Energy Infrastructure Startup Pitch Deck
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Supervised vs unsupervised machine learning algorithms
Fluorescence-microscope_Botany_detailed content
1_Introduction to advance data techniques.pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
.pdf is not working space design for the following data for the following dat...
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
[EN] Industrial Machine Downtime Prediction
climate analysis of Dhaka ,Banglades.pptx

A method to improve survival prediction using mutual information based network

  • 1. A method to improve predictivity of Net-Cox using Mutual Information based network
  • 2. Abstract Cancer genes Gene co-expression Network NetCox (Network-constrained Cox regression) Predict Survivability 1g 2g 3g Dataset Correlation between gene pairs
  • 3. Abstract CNA mRNA METH TCGA data 10,022 genes 340 patients Censored : 0 Observed : 1 Clinical outcome M.I. based Network NetCox (Network-constrained Cox regression) Predict Survivability 3g 1g 2g 𝐼(𝑔𝑖, 𝑔𝑗 ; 𝑌) Mutual Information between gene pairs and clinical outcome of patients
  • 4. M.I. based network gene pair clinical outcome of patients Joint entropy of X, YEntropy of X Mutual information of X, Y Mutual information of two genes and clinical outcome of patient 0, 0.1, 0.2, …, 0.7 Threshold from permutation strategy 1g 2g 3g
  • 5. NetCox (Network-constrained Cox regression) Network information 1e-4, 1e-3, 1e-2, 1e-1 0.1, 0.3, 0.5, 0.7, 0.9, 1 L2-Cox The importance of network information increases Regression coefficients Prognostic index Profile data Which parameter is the best? Predict Survivability
  • 6. Cross validation for optimal parameters profile G 10022 10022 network profileX Data 340 patients 10022 genes 1 1 0 1 … 340 patients Clinical outcome uncensored (deceased) censored (living) Test set Training set 5-fold cross validation Sort in ascending order by survival time of patients test profileX  NetCox training 'XPI 
  • 7. Evaluation 40% 40% High-risk group low-risk group PIrank the patients • Log-rank test • Time dependent ROC 𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑐, 𝑡 𝑓 𝑋 = Pr 𝑓 𝑋 > 𝑐|𝛿 𝑡 = 1 , 𝑠𝑝𝑒𝑐𝑖𝑓𝑖𝑐𝑖𝑡𝑦 𝑐, 𝑡 𝑓 𝑋 = Pr 𝑓 𝑋 ≤ 𝑐|𝛿 𝑡 = 0 • 𝑓 𝑋 = 𝑋′ 𝛽 • 𝑐 ∶ 𝑐𝑢𝑡𝑜𝑓𝑓 𝑝𝑜𝑖𝑛𝑡 • 𝑡 ∶ 𝑡𝑖𝑚𝑒 • 𝛿 𝑡 : 𝑒𝑣𝑒𝑛𝑡 𝑖𝑛𝑑𝑖𝑐𝑎𝑡𝑜𝑟 𝑎𝑡 𝑡𝑖𝑚𝑒 𝑡 The area under the ROC curve for any time t -> how well PI classifies the patients into high or low risk group −log(𝑝𝑣𝑎𝑙𝑢𝑒) Evaluate whether the patients are assigned to the right group.
  • 8. Results - Time-dependent ROC & log-rank test
  • 9. Results Mutual Information Correlation Functional Linkage L2-Cox CNA 0.6276 0.6265 0.6192 0.6229 mRNA 0.6427 0.6481 0.6366 0.6418 METH 0.5876 0.5845 0.5855 0.5663 Mutual Information Correlation Functional Linkage L2-Cox CNA 1.4068 1.4335 1.0968 1.2020 mRNA 1.8107 2.0748 1.2346 1.5018 METH 0.9714 0.8576 0.7596 0.7407 (a) Best Performance Comparison with time AUC (b) Best Performance Comparison with log-rank test
  • 11. Results Genomic Profile Test lambda alpha theta threshold No. of gene pairs above threshold Percentage CNA log-rank 1E-03 0.3 0.4 0.0930 66,754 0.13% AUC 0.7 0.3 0.0863 179,963 0.36% mRNA log-rank 1E-04 0.1 0.6 0.1221 6,487 0.01% AUC METH log-rank 1E-04 0.1 0.3 0.1017 32,282 0.06% AUC 0.3 0.5 0.1173 3,641 0.01%
  • 12. Results – CNA network
  • 13. Results – mRNA network
  • 16. Results Profile Category ID description count p-value corr p-value CNA BP 42335 cuticle development 2 1.91E-06 2.53E-03 CNA MF 16174 NAD(P)H oxidase activity 2 2.26E-05 5.13E-03 meth BP 97479 synaptic vesicle localization 4 1.86E-05 5.82E-03 meth BP 48489 synaptic vesicle transport 4 1.74E-05 5.82E-03 mRNA BP 6915 apoptotic process 16 1.55E-07 1.81E-04 mRNA BP 30162 regulation of proteolysis 10 1.04E-06 3.87E-04 P-value < 0.01