SlideShare a Scribd company logo
1
Clinical Risk Prediction with Temporal Probabilistic
Asymmetric Multi-Task Learning
1School of Computing, 2Graduate School of AI,
Korea Advanced Institute of Science and Technology,
3Aitrics, 4Department of Computer Science, University of Oxford
Tuan Nguyen* 1,4, Hyewon Jeong* 1, Eunho Yang 1,2,3, and Sung Ju Hwang 1,2,3
Clinical Risk Prediction with Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
2
Introduction
Heart Rate (HR)
Respiratory Rate (RR)
Oxygen saturation (SpO2)
Body Temperature (BT)
White Blood Cell Count (WBC)
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
Features Tasks
Task1: Fever
Task2: Infection
Task3: Mortality
Negative Transfer
MTL: clinical setting (MIMIC III-Infection)
Clinical Risk Prediction with Multi-Task Learning
Negative Transfer Problem in Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
3
Introduction
Heart Rate (HR)
Respiratory Rate (RR)
Oxygen saturation (SpO2)
Body Temperature (BT)
White Blood Cell Count (WBC)
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
Features Tasks
Task1: Fever
Task2: Infection
Task3: Mortality
Negative Transfer
MTL: clinical setting (MIMIC III-Infection)
Unreliable Predictor
Clinical Risk Prediction with Multi-Task Learning
Asymmetric Knowledge Transfer Across Timesteps
4
Introduction
𝑓!
𝑓"
𝑓#
…
Fever
𝑖!
𝑖"
𝑖#
Step 1
𝑚!
𝑚"
𝑚#
…
…
Step 2 Step T
Infection
Mortality
낮은
불확실성
높은
불확실성
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
Deep AMTFL
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
MTL: clinical setting (MIMIC III-Infection)
Probabilistic Asymmetric Multi-Task Learning (P-AMTL)
Introduction
Uncertainty-Aware Asymmetric Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
Probabilistic Asymmetric Multi-Task Learning (P-AMTL)
6
0.3
0.4
0.5
0.6
0.7
0
0.02
0.04
0.06
0.08
0.1
0.12
Task 0 Task 1
Knowledge
Transfer
Loss
KT in Loss-based AMTL
Loss
KT
0
0.02
0.04
0.06
0.08
0.1
0.12
0
0.2
0.4
0.6
0.8
Task 0 Task 1
Knowledge
Transfer
Uncertainty
KT in P-AMTL
UC
KT
2000 200
instances 2000 200
instances
-0.02
-0.015
-0.01
-0.005
0
0.005
0.01
0.015
Accuracy
Improvement
over
STL
Loss-based AMTL
TPAMTL
…
…
Step 1 Step 2 Step T
Loss
Loss
Loss-Based AMTL (Lee et al., 2018)
fd
(1) fd
(2) fd
(3)
fj
(1) fj
(2) fj
(3)
…
Low UC
High UC
𝑓!
(#)
UC-aware AMTL
…
Step 1 Step 2 Step T
fd
(1) fd
(2) fd
(3)
fj
(1) fj
(2) fj
(3)
𝑍!, 𝑍" :	High level latent feature
𝑓!, 𝑓" : Multiple features
across timesteps
(𝑍! = 𝑓!
#
, 𝑓!
$
, … , 𝑓!
%
)
(𝑍" = 𝑓"
#
, 𝑓"
$
, … , 𝑓"
%
)
Task J
Task D
Approach
Failure of Loss-based Asymmetric Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
Multiple Features
Across Timesteps
Failure of Loss-based AMTL
7
Approach
Table 1. Task Performance of MNIST-variation Experiment
(AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens
inter-task(in order to capture task relatedness) and across-timestep.
Uncertainty Aware Knowledge Transfer: example case
!
Multiple Features
(zj for Task j)
+ Gj
2
αd,j
Gd
1
!
fd
(1)
Multiple Features
(zd for Task d)
αj,d
Gj
1
+
Gj
1 Gj
1
fd
(3)
fd
(1)
fj
(1)
fd
(3)
fd
(1)
fj
(1)
Transform from more reliable to less reliable latent features.
Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task
!!,# = #!,# $!,#, $#, &!,#
$ , &#
$
!#,! = ##,! $#,!, $!, &#,!
$
, &!
$
"!
(#)
= $!
(#)
+ &!(∑ ∑ )%,!
',#
∗ &%
#
'()
*
%+) $%
'
)	∀. ∈ {1,2, … , !}	
* Same also happens for intra-task, inter-timestep knowledge transfer
TP-AMTL: Uncertainty-Aware Knowledge Transfer
Approach
TP-AMTL: Uncertainty-Aware Knowledge Transfer
Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens
inter-task(in order to capture task relatedness) and across-timestep.
Uncertainty Aware Knowledge Transfer: example case
𝑇
Multiple Features
(zj for Task j)
+ Gj
2
αd,j
Gd
1
𝑇
fd
(1)
Multiple Features
(zd for Task d)
αj,d
Gj
1
+
Gj
1 Gj
1
fd
(3)
fd
(1)
fj
(1)
fd
(3)
fd
(1)
fj
(1)
Transform from more reliable to less reliable latent features.
Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task
Approach
𝛼!,# = 𝐹!,# 𝑍!,#, 𝑍#, 𝜎!,#
$
, 𝜎#
$
𝛼#,! = 𝐹#,! 𝑍#,!, 𝑍!, 𝜎#,!
$
, 𝜎!
$
𝐶%
(&)
= 𝑓%
(&)
+ 𝐺%(∑!'(
)
∑*+(
&
𝛼!,%
*,&
∗ 𝐺! 𝑓!
*
) ∀𝑡 ∈ {1,2, … , 𝑇}
* Same also happens for intra-task, inter-timestep knowledge transfer
𝑧# ∼ 𝑝% 𝑧# 𝑥, 𝜔
𝑝% 𝑧# 𝑥, 𝜔 ∼ 𝒩(𝑧#; 𝜇#, 𝑑𝑖𝑎𝑔 𝜎#
$
)
Complexity Analysis
10
Approach
Supplementary Table 1. Time Complexity of the Baseline Models
Tasks and Datasets
11
Task 1 : Stay < 3
Length of ICU Stay
Task 2 : Cardiac
Recovering from
Cardiac Surgery
Task 4 : Mortality
Task 3 : Recovery
Recovering from
general surgery
PhysioNet2012
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
MIMIC - III Infection
2,000 data points
Tasks : Fever à Infection à Mortality
Features: 12 Infection related features : including heart rate,
arterial blood pressure, and Glasgow Coma Scale(GCS) etc.
4,000 distinct hospital (ICU) records
Tasks: Stay < 3 / Cardiac / Recovery à Mortality
Features: 31 physiological signs including heart rate,
respiratory rate, temperature, etc.
Experiments
Information on MIMIC - III Respiratory Failure, Heart Failure can be found in the supplementary file
Quantitative Results
12
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
13
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
1
Quantitative Results
14
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
15
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
16
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
17
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Source features with low uncertainties transfer knowledge more, while at the target,
features with high uncertainties receive more knowledge transfer.
Qualitative Results: Knowledge Transfer Graph
Normalized amount of knowledge transfer from
multiple sources (task 𝑗 at time 𝑡) to task 𝑑
(normalized over the number of targets)
18
Normalized amount of knowledge transfer to multiple
targets (task 𝑑 at time 𝑡) from task 𝑗
(normalized over the number of sources)
Incoming Transfer to different Targets
Outgoing Transfer from different Sources
𝛼!,#
&,&
+ 𝛼!,#
&,&'(
+ ⋯ + 𝛼!,#
&,)
𝑇 − 𝑡 + 1
− (1)
𝛼!,%
(,&
+ 𝛼!,%
-,&
+ ⋯ + 𝛼!,%
&,&
𝑡
− (2)
Experiments
Qualitative Results: Medical Interpretation
19
Interpretation of the Learned Knowledge Graph
By analyzing selected clinical case studies, we could identify steps where knowledge transferred as we
designed and meaningful medical events occur, which correlates with interactions between selected tasks.
MechVent - Mechanical Ventilation, FiO2 - Fractional inspired Oxygen, SBP - Systolic arterial blood pressure,
DBP - Diastolic arterial blood pressure, HR - Heart Rate, Temp - Body Temperature, Urine - Urine output,
GCS - Glasgow Coma Score, WBC - White Blood Cell Count, Culture - Culture Results.
Experiments
Ablation Study
20
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
Ablation Study
21
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
Ablation Study
22
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
Ablation Study
23
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
• We proposed a novel probabilistic asymmetric multi-task learning framework
that allows asymmetric knowledge transfer between tasks at different timesteps,
based on the uncertainty.
• We use a probabilistic Bayesian formulation for asymmetric knowledge transfer,
where the amount of knowledge transfer depends on the uncertainty at the
feature level.
• We validate our model on clinical risk prediction tasks, on which it achieves
significant improvements over baselines and provides meaningful interpretations,
including temporal relationships between tasks.
Conclusions
24
Thank you
25

More Related Content

PDF
rob 537 final paper(fourth modify)
PDF
Ag044216224
PDF
Transfer Learning for the Detection and Classification of traditional pneumon...
PPTX
harsh final ppt (2).pptx
PDF
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...
PDF
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...
PDF
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...
PDF
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...
rob 537 final paper(fourth modify)
Ag044216224
Transfer Learning for the Detection and Classification of traditional pneumon...
harsh final ppt (2).pptx
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...
AN INVESTIGATION INTO DETECTING PNEUMONIA THROUGH IMAGE PROCESSING AND OBJECT...

Similar to Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning (20)

PPTX
Detection and Classification of Pneumonia with Chest X-Ray Images using Deep ...
PDF
Deep Learning-based Diagnosis of Pneumonia using X-Ray Scans
PDF
AIMS Block Presentation]{Deep Transfer Learning for Magnetic Resonance Image ...
PPTX
Deep Convolutional Neural Networks and Covid19 by Dr.Sana Komal
PPTX
Rapid COVID-19 Diagnosis Using Deep Learning of the Computerized Tomography ...
PDF
Qt7355g8v8
PDF
Lung Cancer Detection using Convolutional Neural Network
PDF
AN AUTOMATED FRAMEWORK FOR DIAGNOSING LUNGS RELATED ISSUES USING ML AND DATA ...
PDF
Prediction for Pulmonary Disease Based on Diagnostic Reciepes and Classification
PDF
Pneumonia Classification using Transfer Learning
PDF
Deep Learning for Pneumonia Diagnosis: A Comprehensive Analysis of CNN and Tr...
PDF
CovidAID: COVID-19 Detection using Chest X-Ray Images
PDF
Health Risk Prediction Using Support Vector Machine with Gray Wolf Optimizati...
PPTX
Corona prediction from symptoms v1.4
PDF
Predicting disease from several symptoms using machine learning approach.
PPTX
CMPE 258 - Short Story ppt.pptx
PDF
2018 IMSM: Identifying Precision Treatment for Rheumatoid Arthritis with Rein...
PDF
ICU MORTALITY PREDICTION
PDF
Deep learning for episodic interventional data
PPTX
Batch -13.pptx lung cancer detection using transfer learning
Detection and Classification of Pneumonia with Chest X-Ray Images using Deep ...
Deep Learning-based Diagnosis of Pneumonia using X-Ray Scans
AIMS Block Presentation]{Deep Transfer Learning for Magnetic Resonance Image ...
Deep Convolutional Neural Networks and Covid19 by Dr.Sana Komal
Rapid COVID-19 Diagnosis Using Deep Learning of the Computerized Tomography ...
Qt7355g8v8
Lung Cancer Detection using Convolutional Neural Network
AN AUTOMATED FRAMEWORK FOR DIAGNOSING LUNGS RELATED ISSUES USING ML AND DATA ...
Prediction for Pulmonary Disease Based on Diagnostic Reciepes and Classification
Pneumonia Classification using Transfer Learning
Deep Learning for Pneumonia Diagnosis: A Comprehensive Analysis of CNN and Tr...
CovidAID: COVID-19 Detection using Chest X-Ray Images
Health Risk Prediction Using Support Vector Machine with Gray Wolf Optimizati...
Corona prediction from symptoms v1.4
Predicting disease from several symptoms using machine learning approach.
CMPE 258 - Short Story ppt.pptx
2018 IMSM: Identifying Precision Treatment for Rheumatoid Arthritis with Rein...
ICU MORTALITY PREDICTION
Deep learning for episodic interventional data
Batch -13.pptx lung cancer detection using transfer learning
Ad

More from MLAI2 (20)

PDF
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
PDF
Online Hyperparameter Meta-Learning with Hypergradient Distillation
PDF
Online Coreset Selection for Rehearsal-based Continual Learning
PDF
Representational Continuity for Unsupervised Continual Learning
PDF
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
PDF
Skill-Based Meta-Reinforcement Learning
PDF
Edge Representation Learning with Hypergraphs
PDF
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
PDF
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
PDF
Task Adaptive Neural Network Search with Meta-Contrastive Learning
PDF
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
PDF
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
PDF
Accurate Learning of Graph Representations with Graph Multiset Pooling
PDF
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
PDF
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
PDF
Adversarial Self-Supervised Contrastive Learning
PDF
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
PDF
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
PDF
Cost-effective Interactive Attention Learning with Neural Attention Process
PDF
Adversarial Neural Pruning with Latent Vulnerability Suppression
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Online Hyperparameter Meta-Learning with Hypergradient Distillation
Online Coreset Selection for Rehearsal-based Continual Learning
Representational Continuity for Unsupervised Continual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Skill-Based Meta-Reinforcement Learning
Edge Representation Learning with Hypergraphs
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Task Adaptive Neural Network Search with Meta-Contrastive Learning
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Accurate Learning of Graph Representations with Graph Multiset Pooling
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
Adversarial Self-Supervised Contrastive Learning
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
Cost-effective Interactive Attention Learning with Neural Attention Process
Adversarial Neural Pruning with Latent Vulnerability Suppression
Ad

Recently uploaded (20)

PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
cuic standard and advanced reporting.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Electronic commerce courselecture one. Pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Big Data Technologies - Introduction.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Approach and Philosophy of On baking technology
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
Network Security Unit 5.pdf for BCA BBA.
cuic standard and advanced reporting.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Programs and apps: productivity, graphics, security and other tools
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Electronic commerce courselecture one. Pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Big Data Technologies - Introduction.pptx
Spectral efficient network and resource selection model in 5G networks
Approach and Philosophy of On baking technology
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Review of recent advances in non-invasive hemoglobin estimation
NewMind AI Weekly Chronicles - August'25 Week I
The Rise and Fall of 3GPP – Time for a Sabbatical?

Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning

  • 1. 1 Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning 1School of Computing, 2Graduate School of AI, Korea Advanced Institute of Science and Technology, 3Aitrics, 4Department of Computer Science, University of Oxford Tuan Nguyen* 1,4, Hyewon Jeong* 1, Eunho Yang 1,2,3, and Sung Ju Hwang 1,2,3
  • 2. Clinical Risk Prediction with Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. 2 Introduction Heart Rate (HR) Respiratory Rate (RR) Oxygen saturation (SpO2) Body Temperature (BT) White Blood Cell Count (WBC) Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality Features Tasks Task1: Fever Task2: Infection Task3: Mortality Negative Transfer MTL: clinical setting (MIMIC III-Infection)
  • 3. Clinical Risk Prediction with Multi-Task Learning Negative Transfer Problem in Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. 3 Introduction Heart Rate (HR) Respiratory Rate (RR) Oxygen saturation (SpO2) Body Temperature (BT) White Blood Cell Count (WBC) Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality Features Tasks Task1: Fever Task2: Infection Task3: Mortality Negative Transfer MTL: clinical setting (MIMIC III-Infection) Unreliable Predictor
  • 4. Clinical Risk Prediction with Multi-Task Learning Asymmetric Knowledge Transfer Across Timesteps 4 Introduction 𝑓! 𝑓" 𝑓# … Fever 𝑖! 𝑖" 𝑖# Step 1 𝑚! 𝑚" 𝑚# … … Step 2 Step T Infection Mortality 낮은 불확실성 높은 불확실성 Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality Deep AMTFL Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. MTL: clinical setting (MIMIC III-Infection)
  • 5. Probabilistic Asymmetric Multi-Task Learning (P-AMTL) Introduction Uncertainty-Aware Asymmetric Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
  • 6. Probabilistic Asymmetric Multi-Task Learning (P-AMTL) 6 0.3 0.4 0.5 0.6 0.7 0 0.02 0.04 0.06 0.08 0.1 0.12 Task 0 Task 1 Knowledge Transfer Loss KT in Loss-based AMTL Loss KT 0 0.02 0.04 0.06 0.08 0.1 0.12 0 0.2 0.4 0.6 0.8 Task 0 Task 1 Knowledge Transfer Uncertainty KT in P-AMTL UC KT 2000 200 instances 2000 200 instances -0.02 -0.015 -0.01 -0.005 0 0.005 0.01 0.015 Accuracy Improvement over STL Loss-based AMTL TPAMTL … … Step 1 Step 2 Step T Loss Loss Loss-Based AMTL (Lee et al., 2018) fd (1) fd (2) fd (3) fj (1) fj (2) fj (3) … Low UC High UC 𝑓! (#) UC-aware AMTL … Step 1 Step 2 Step T fd (1) fd (2) fd (3) fj (1) fj (2) fj (3) 𝑍!, 𝑍" : High level latent feature 𝑓!, 𝑓" : Multiple features across timesteps (𝑍! = 𝑓! # , 𝑓! $ , … , 𝑓! % ) (𝑍" = 𝑓" # , 𝑓" $ , … , 𝑓" % ) Task J Task D Approach Failure of Loss-based Asymmetric Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. Multiple Features Across Timesteps
  • 7. Failure of Loss-based AMTL 7 Approach Table 1. Task Performance of MNIST-variation Experiment (AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 8. Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens inter-task(in order to capture task relatedness) and across-timestep. Uncertainty Aware Knowledge Transfer: example case ! Multiple Features (zj for Task j) + Gj 2 αd,j Gd 1 ! fd (1) Multiple Features (zd for Task d) αj,d Gj 1 + Gj 1 Gj 1 fd (3) fd (1) fj (1) fd (3) fd (1) fj (1) Transform from more reliable to less reliable latent features. Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task !!,# = #!,# $!,#, $#, &!,# $ , &# $ !#,! = ##,! $#,!, $!, &#,! $ , &! $ "! (#) = $! (#) + &!(∑ ∑ )%,! ',# ∗ &% # '() * %+) $% ' ) ∀. ∈ {1,2, … , !} * Same also happens for intra-task, inter-timestep knowledge transfer TP-AMTL: Uncertainty-Aware Knowledge Transfer Approach
  • 9. TP-AMTL: Uncertainty-Aware Knowledge Transfer Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens inter-task(in order to capture task relatedness) and across-timestep. Uncertainty Aware Knowledge Transfer: example case 𝑇 Multiple Features (zj for Task j) + Gj 2 αd,j Gd 1 𝑇 fd (1) Multiple Features (zd for Task d) αj,d Gj 1 + Gj 1 Gj 1 fd (3) fd (1) fj (1) fd (3) fd (1) fj (1) Transform from more reliable to less reliable latent features. Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task Approach 𝛼!,# = 𝐹!,# 𝑍!,#, 𝑍#, 𝜎!,# $ , 𝜎# $ 𝛼#,! = 𝐹#,! 𝑍#,!, 𝑍!, 𝜎#,! $ , 𝜎! $ 𝐶% (&) = 𝑓% (&) + 𝐺%(∑!'( ) ∑*+( & 𝛼!,% *,& ∗ 𝐺! 𝑓! * ) ∀𝑡 ∈ {1,2, … , 𝑇} * Same also happens for intra-task, inter-timestep knowledge transfer 𝑧# ∼ 𝑝% 𝑧# 𝑥, 𝜔 𝑝% 𝑧# 𝑥, 𝜔 ∼ 𝒩(𝑧#; 𝜇#, 𝑑𝑖𝑎𝑔 𝜎# $ )
  • 10. Complexity Analysis 10 Approach Supplementary Table 1. Time Complexity of the Baseline Models
  • 11. Tasks and Datasets 11 Task 1 : Stay < 3 Length of ICU Stay Task 2 : Cardiac Recovering from Cardiac Surgery Task 4 : Mortality Task 3 : Recovery Recovering from general surgery PhysioNet2012 Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality MIMIC - III Infection 2,000 data points Tasks : Fever à Infection à Mortality Features: 12 Infection related features : including heart rate, arterial blood pressure, and Glasgow Coma Scale(GCS) etc. 4,000 distinct hospital (ICU) records Tasks: Stay < 3 / Cardiac / Recovery à Mortality Features: 31 physiological signs including heart rate, respiratory rate, temperature, etc. Experiments Information on MIMIC - III Respiratory Failure, Heart Failure can be found in the supplementary file
  • 12. Quantitative Results 12 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 13. Quantitative Results 13 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red) 1
  • 14. Quantitative Results 14 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 15. Quantitative Results 15 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 16. Quantitative Results 16 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 17. Quantitative Results 17 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 18. Source features with low uncertainties transfer knowledge more, while at the target, features with high uncertainties receive more knowledge transfer. Qualitative Results: Knowledge Transfer Graph Normalized amount of knowledge transfer from multiple sources (task 𝑗 at time 𝑡) to task 𝑑 (normalized over the number of targets) 18 Normalized amount of knowledge transfer to multiple targets (task 𝑑 at time 𝑡) from task 𝑗 (normalized over the number of sources) Incoming Transfer to different Targets Outgoing Transfer from different Sources 𝛼!,# &,& + 𝛼!,# &,&'( + ⋯ + 𝛼!,# &,) 𝑇 − 𝑡 + 1 − (1) 𝛼!,% (,& + 𝛼!,% -,& + ⋯ + 𝛼!,% &,& 𝑡 − (2) Experiments
  • 19. Qualitative Results: Medical Interpretation 19 Interpretation of the Learned Knowledge Graph By analyzing selected clinical case studies, we could identify steps where knowledge transferred as we designed and meaningful medical events occur, which correlates with interactions between selected tasks. MechVent - Mechanical Ventilation, FiO2 - Fractional inspired Oxygen, SBP - Systolic arterial blood pressure, DBP - Diastolic arterial blood pressure, HR - Heart Rate, Temp - Body Temperature, Urine - Urine output, GCS - Glasgow Coma Score, WBC - White Blood Cell Count, Culture - Culture Results. Experiments
  • 20. Ablation Study 20 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 21. Ablation Study 21 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 22. Ablation Study 22 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 23. Ablation Study 23 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 24. • We proposed a novel probabilistic asymmetric multi-task learning framework that allows asymmetric knowledge transfer between tasks at different timesteps, based on the uncertainty. • We use a probabilistic Bayesian formulation for asymmetric knowledge transfer, where the amount of knowledge transfer depends on the uncertainty at the feature level. • We validate our model on clinical risk prediction tasks, on which it achieves significant improvements over baselines and provides meaningful interpretations, including temporal relationships between tasks. Conclusions 24