SlideShare a Scribd company logo
Knowledge Engineering in Oncology



           Andre Dekker, PhD
            Medical Physicist
            MAASTRO Clinic
2

The five components of Radiation
            Oncology

            Clinic               Biology


                     Radiation
                     Oncology
                                      Molecular
     Physics                          Imaging


                     Computer
                      science




    © MAASTRO 2013
Contents                                                            3




 09:00-09:45 Knowledge Engineering: The need and the data
 • Problem: “Current quality of Medical Decisions”
 • Rapid Learning – How to get the data?

 09:45-10:00 Break

 10:00-10:45 Knowledge Engineering: From data to decision support
 • Methodology & an example
 • Demo of machine learning
 • Evaluating decision support

 10:45-11:00 Q&R


              © MAASTRO 2013
Quality of medical decisions
Experiment                                 5




       Lowest                 Highest
       Survival               Survival
      Probability            Probability




            © MAASTRO 2013
Experiment                  6




                         AUC
                         1.00




                         AUC
                         0.72




                         AUC
                         0.50



        © MAASTRO 2013
Prediction by MDs?                                                           7



                                                Non Small Cell Lung Cancer
                                                2 year survival
                                                30 patients
                                                8 MDs
                                                Retrospective
                                                AUC: 0.57


                                                Non Small Cell Lung Cancer
                                                2 year survival
                                                158 patients
                                                5 MDs
                                                Prospective
                                                AUC: 0.56
         © MAASTRO 2013   Cary Oberije et al.
Question                                                     8




 Why do you think the doctor‟s are bad at predicting outcomes?




           © MAASTRO 2013
The doctor is drowning                                                                 9




                                                         • Explosion of data
                                                         • Explosion of decisions
                                                         • Explosion of „evidence‟*
                                                            • 3 % in trials, bias
                                                            • Sharp knife




 *2010: 1574 & 1354 articles on lung cancer & radiotherapy = 7.5 per day
 Half-life of knowledge estimated at 7 years
                                                                    J Clin Oncol 2010;28:4268
                                                                    JMI 2012 Friedman, Rigby
               © MAASTRO 2013
Problem definition                                            11




  • Personalized medicine is the future
  • Prediction of the outcome for treatment a, b or c is
    needed
  • The current method of getting evidence is too costly
    and unsustainable
  • It is unethical to ask a person to make that prediction


                                J Clin Oncol 28:4268-4274


           © MAASTRO 2013
Improve medical decisions
Rapid Learning                                 13




 In [..] rapid-learning [..] data routinely
 generated through patient care and
 clinical research feed into an ever-
 growing [..] set of coordinated
 databases.
 J Clin Oncol 2010;28:4268

 [..] rapid learning [..] where we can
 learn from each patient to guide
 practice, is [..] crucial to guide rational
 health policy and to contain costs [..].
 Lancet Oncol 2011;12:933




                    © MAASTRO 2013
Engineering Knowledge                                                14




         © MAASTRO 2013
         Lambin, P. et al. Nat. Rev. Clin. Oncol. 10, 27–40 (2013)
15




© MAASTRO 2013
CAT ~ 2005 = MAASTRO Knowledge Engineering          16




     Build Decision Support Systems to
           individualize patient care         Theme 2
          by using machine learning           Learning
                                              (5%)
            to extract multifactorial
       personalized prediction models
           from existing databases
                                              Theme 1
      containing all data on all patients     Data
    that are validated in external datasets   (95%)

          © MAASTRO 2013
Barriers to sharing data
and a way to overcome these
Question                                                     18




 What do you think would prevent people from sharing data?




           © MAASTRO 2013
Barriers to sharing data                                      19




  [..] the problem is not really technical […]. Rather, the
      problems are ethical, political, and administrative.
       Lancet Oncol 2011;12:933



  1. Administrative (time to capture, time to curate)
  2.    Political (value, authorship)
  3.    Ethical (privacy)

  4.    Technical




                    © MAASTRO 2013
Basic concept in data sharing   20




  • Syntax and Semantics
  • Centralized and Federated




           © MAASTRO 2013
Syntactics and Semantics – A story
Semantic interoperability – Patrick story                           22




  To explain and distinguish the 4 different levels, consider the
  following scenario:

  56 year old Patrick recently moved from Ireland to Spain to
  take up his new job in a multinational IT company. A few
  weeks after arriving, he falls ill, consults his local (Spanish) GP
  and is transferred to the next hospital for further tests.




             © MAASTRO 2013
             SemanticHEALTH Report, January 2009
Semantic interoperability – Level 0                                 23




  Level 0 (no interoperability at all)

  Patrick has to undergo a full set of lengthy investigations for
  the doctors to find out the cause of his severe pain.

  Unfortunately, results from the local GP as well as from his
  Irish GP are not available at the point of care within the
  hospital due to the missing technical equipment.



             © MAASTRO 2013
              SemanticHEALTH Report, January 2009
Semantic interoperability – Level 1                                          24




  Level 1 (technical and syntactical interoperability):

  Patrick’s doctor in the hospital is able to receive electronic documents
  that were released from the Irish GP as well as his local GP upon
  request. Widely available applications supporting syntactical
  interoperability (such as web browsers and email clients), allow the
  download of patient data and provide immediate access.

  Unfortunately, none of the available doctors in the hospital is able to
  translate the Irish document, and only human intervention allows
  interpreting the information submitted by the local GP for adding into
  the hospitals information system.
              © MAASTRO 2013
              SemanticHEALTH Report, January 2009
Semantic interoperability – Level 2                                           25




  Level 2 (partial semantic interoperability):

  The Spanish hospital doctor is able to securely access via the Internet
  parts of Patrick’s Electronic Health Record released by his Irish GP as
  well as the local GP that he visited just hours earlier.

  Although both documents contain mostly free text, fragments of high
  importance (such as demographics, allergies, diagnoses, and parts of
  medical history) are encoded using international coding schemes,
  which the hospital information system can automatically detect, interpret
  and meaningfully present to the attending physician.


              © MAASTRO 2013
              SemanticHEALTH Report, January 2009
Semantic interoperability – Level 3                                        26




  Level 3 (full semantic interoperability, co-operability)

  In this ideal situation and after thorough authentication took place, the
  Spanish hospital information system is able to automatically access,
  interpret and present all necessary medical information about Patrick to
  the physician at the point of care.

  Neither language nor technological differences prevent the system to
  seamlessly integrate the received information into the local record
  and provide a complete picture of Patrick’s health as if it would have
  been collected locally. Further, the anonymised data feeds directly
  into the tools of public health authorities and researchers.
              © MAASTRO 2013
              SemanticHEALTH Report, January 2009
Question                                                        27




 What semantic interoperability level do you think medicine is at
 the moment in NL & Europe?




           © MAASTRO 2013
Central versus Federated Data Sharing
Centralized Data (e.g. for Research)                                      29


      Hospital 1
               HIS                           Research System
                               data domains
              PACS
                                 clinical               integrated data
               LIS                 Open
                                   Clinica


                                 imaging
      Hospital 2
                                   NBIA
               HIS                                             e.g.
                                                           tranSMART
                               biobanking
              PACS
                                    e.g.
                                  caTissue
               LIS




              © MAASTRO 2013
Federated Data (e.g. for Research)            30


             Hospital 1
                            integrated data
           HIS


           PACS                   e.g.
                                euroCAT

           LIS




             Hospital 2     integrated data
           HIS

                                  e.g.
           PACS                 euroCAT



           LIS




           © MAASTRO 2013
Question                                                       31




 What do you think are the pros and cons for centralized vs.
 federated?




           © MAASTRO 2013
An example data sharing project: euroCAT
Example: MAASTRO‟s euroCAT approach                                                  33




 euroCAT is a research project in which

 we develop an IT infrastructure -> technical

 to make radiotherapy centers (Maastricht, Liege, Aachen, Eindhoven, Hasselt)

 semantic interoperable (SIOp*) / machine readable -> administrative

 while the data stays inside your hospital -> ethical

 under your full control -> political
 * SIOp level 3 = Machine Readable ->Data in common syntax and with common meaning

                   © MAASTRO 2013
Components                                             34




                           CTMS
                           PACS     ETL
                           Export

             Distributed                    Deident.
              Learning                      & Filter




            Application
                                            Ontology
             Sharing


                            XML
                                    Query
                           DICOM




        © MAASTRO 2013
Ontology – International Coding System                                                         35




                                                                     2. Search the ontology
                                                                    for the matching concept




                      1. Select the local term




     声门下区




                                                           3. Map the local term to
                                                                the ontology




                               4. See the result of your
            © MAASTRO 2013             mapping
Ontology use                         36




Ontology is a set of terms & their
relationships

Retrospective analysis
“Xerostomia”

All head & neck cancer patients




            © MAASTRO 2013
Data extraction system - Federated   37




          © MAASTRO 2013
Distributed                                         Final Model Created


    Learning                                                                               Update Model
    Architecture                                                                 Central Server




                                            Send Average                                           Send Average
                                                                    Send Average                   Consensus Model
                                            Consensus Model
                                                                    Consensus Model

       Send Model
       Parameters




                                           Send Model
                                           Parameters
                       Model Server RTOG                                              Send Model
                                                                                      Parameters

                                                                                                                       Model Server Roma
                                                                  Model Server
        Learn Model from
                                                                  MAASTRO
           Local Data




                                                                                                        Learn Model from
                                               Learn Model from                                            Local Data
                                                  Local Data
Only aggregate data is exchanged between the Central Server and the local Servers
Lecture key points                                                                            39




  •   The problem of decision is caused by
       • Too much data on an individual patient to process for a human being
       • Not enough evidence in literature to make a decision in an individual patient
       • Not enough patients in trials
       • Patients that are selected in these trials do not represent the patient in which a
          decision needs to be taken
  •   This problem will get worse as the number of decisions is rising in personalized
      medicine

  •   Rapid learning or learning from each patient is hampered by barriers to share data
  •   The most important barrier is the administrative effort needed to capture & share
      data
  •   Semantic interoperability and federation may overcome some of these barriers


                 © MAASTRO 2013
BREAK
How to go
from data to decision support
Engineering: Data>Model>Decision Support   42




         © MAASTRO 2013
Data>Model>Decision Support                      43




 1. Modeling
    “Learn a model from data”

 2. Validation
    “Estimate model performance”

 3. Decision Support
    “Impact of the model on clinical practice”



           © MAASTRO 2013
Learn a model from data                                   44




    Training cohort
         – 322 patients (MAASTRO)
    Clinical variables
    Support Vector Machines
    Nomogram




Dehing-Oberije (MAASTRO), IJROBP 2009;74:355
                    © MAASTRO 2013   Cary Oberije et al.
Estimate model performance                                                             45




                                                           • INDEPENDENT Validation
                                                             cohort
                                                             – 36 patients (Leuven)
                                                             – 65 patients (Ghent)
                                                           • Discrimination, Calibration,
                                                             Reclassification

                                                           • AUC 0.75

Dehing-Oberije (MAASTRO), IJROBP 2009;74:355
                    © MAASTRO 2013   Cary Oberije et al.
Decision Support                                   46



                             Stage IIIA 10 (14%)
                             Stage IIIB 13 (19%)
                             T4 12 (17%)




         © MAASTRO 2013   Cary Oberije et al.
How to learn better?          47




• More patients
• Diversity
• More variables
  •   Overfitting


• Different machine
  learning methods


             © MAASTRO 2013
Let‟s build a model from data
Lung cancer staging system   49




         © MAASTRO 2013
Hugin                    50




        © MAASTRO 2013
Discussion model                                              51




 • There is always missing data
 • Some states (N2) have low number of observations
 • Machine learning without input from domain experts gives
   models that do not make sense
   • T->N->M & TNM->Stage
 • There is always bias in data (e.g. T4N0M0->IIIB, T2N3M0-
   >IIIA) -> Most difficult too solve
 • You learn something new T->N Relation
 • (Low numbers give bad models)

          © MAASTRO 2013
Why is there a bias, any ideas?   52




          © MAASTRO 2013
Validation
Validation of a survival model                                       54




  • Discrimination: Is the model able to classify the population
    into two or more groups with different observed survival in
    an external validation set?
  • Calibration: Is the estimated probability of survival equal to
    the observed survival probability in an external validation
    set?
  • Clinical usefulness: Are the training and external validation
    set representative for my patient and is the predicted
    outcome clinically relevant for my patient?


            © MAASTRO 2013
Laryngeal carcinoma model          55




994 MAASTRO patients
1990-2005
www.predictcancer.org
Input parameters
    –   Age
    –   Hemoglobin
    –   T-stage
    –   Radiotherapy Dose (Gy)
    –   Gender
    –   N+
    –   Tumor location
Output parameters
    – Overall survival



                  © MAASTRO 2013
Validation @ RTOG (Trial 0522) - Result   56




         © MAASTRO 2013
Data>Model>Decision Support                                      57




 Prediction Models: Revolutionary in Principle, But Do They Do
 More Good Than Harm?

 “we are drowning in prediction models [..] more than 100
 prediction models on prostate cancer alone”

 “currently [..] a large number of models [..] are not
 independently validated at all”

 J Clin Oncol 2011;29:2951


                © MAASTRO 2013
Models built & validated                                   58




Lung cancer                    Rectal cancer
   – Survival                     –   Tumor response
   – Lung dyspnea                 –   Local recurrences
   – Lung dysphagia               –   Distant metastases
                                  –   Overall survival
Larynx cancer
   – Local recurrences
   – Overall survival



    www.predictcancer.org
              © MAASTRO 2013
Lecture key points                                                                           59




  •   The process to go from data to decision support is through Modeling & External!
      Validation
  •   Adding more patients and more diverse patients to your training set is always good
  •   Beware of adding more variables (overfitting) without adding more patients
  •   When evaluating decision support systems look at
       • Discrimination: Classify into two or more groups with different event probability
       • Calibration: Observed event probability in the external validation
       • Clinical Useful: Representative/relevant patient population




                 © MAASTRO 2013
Thank you for your attention

            More info on:
          www.eurocat.info
        www.predictcancer.org
         www.cancerdata.org
           www.mistir.info
          www.maastro.nl

More Related Content

PDF
Puplication-Assesssment of Vitritis On OCT-Journel TVST
PDF
Neural Network Based Classification and Diagnosis of Brain Hemorrhages
PDF
DISCOVERING ABNORMAL PATCHES AND TRANSFORMATIONS OF DIABETICS RETINOPATHY IN ...
PDF
[ASGO 2019] Artificial Intelligence in Medicine
PDF
Big data, big knowledge big data for personalized healthcare
PDF
인공지능은 의료를 어떻게 혁신하는가 (2019년 7월) (하)
PDF
Case Based Medical Diagnosis of Occupational Chronic Lung Diseases From Their...
PDF
Study on Physicians Request for Computed Tomography Examinations for Patients...
Puplication-Assesssment of Vitritis On OCT-Journel TVST
Neural Network Based Classification and Diagnosis of Brain Hemorrhages
DISCOVERING ABNORMAL PATCHES AND TRANSFORMATIONS OF DIABETICS RETINOPATHY IN ...
[ASGO 2019] Artificial Intelligence in Medicine
Big data, big knowledge big data for personalized healthcare
인공지능은 의료를 어떻게 혁신하는가 (2019년 7월) (하)
Case Based Medical Diagnosis of Occupational Chronic Lung Diseases From Their...
Study on Physicians Request for Computed Tomography Examinations for Patients...

What's hot (20)

PDF
[대한병리학회] 의료 인공지능 101: 병리를 중심으로
PDF
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
PDF
Hybrid lwt svd watermarking optimized using metaheuristic algorithms along wi...
PDF
Thinking about Data Strategy: for Ophthalmologists
PDF
3024507
PDF
Interpretability and informatics of deep learning in medical images3
PPTX
TADAA - Towards Automated Detection of Anaesthetic Activity
PPTX
MedicalResearch.com: Medical Research Exclusive Interviews December 14 2014
PDF
AI in Ophthalmology | Startup Landscape
PDF
Raai 2019 clinical unmet needs and its solutions of deep learning in medicine3
PDF
Sense Net Mmvr2005
PDF
APPLY MACHINE LEARNING METHODS TO PREDICT FAILURE OF GLAUCOMA DRAINAGE
PDF
Early Detection of High Blood Pressure and Diabetic Retinopathy on Retinal Fu...
PDF
Data-driven Ophthalmology
PDF
IRJET - Automatic Detection of Diabetic Retinopathy in Retinal Image
PDF
IFMIA 2019 Plenary Talk : Deep Learning in Medicine; Engineers' Perspectives
PDF
CLASSIFICATION OF OCT IMAGES FOR DETECTING DIABETIC RETINOPATHY DISEASE USING...
PPTX
ARTIFICIAL NEURAL NETWORKING in Dentistry
PDF
Biomedical Signal Origin and Dynamics
PDF
A COMPARATIVE STUDY OF MACHINE LEARNING ALGORITHMS FOR EEG SIGNAL CLASSIFICATION
[대한병리학회] 의료 인공지능 101: 병리를 중심으로
인공지능은 의료를 어떻게 혁신할 것인가 (ver 2)
Hybrid lwt svd watermarking optimized using metaheuristic algorithms along wi...
Thinking about Data Strategy: for Ophthalmologists
3024507
Interpretability and informatics of deep learning in medical images3
TADAA - Towards Automated Detection of Anaesthetic Activity
MedicalResearch.com: Medical Research Exclusive Interviews December 14 2014
AI in Ophthalmology | Startup Landscape
Raai 2019 clinical unmet needs and its solutions of deep learning in medicine3
Sense Net Mmvr2005
APPLY MACHINE LEARNING METHODS TO PREDICT FAILURE OF GLAUCOMA DRAINAGE
Early Detection of High Blood Pressure and Diabetic Retinopathy on Retinal Fu...
Data-driven Ophthalmology
IRJET - Automatic Detection of Diabetic Retinopathy in Retinal Image
IFMIA 2019 Plenary Talk : Deep Learning in Medicine; Engineers' Perspectives
CLASSIFICATION OF OCT IMAGES FOR DETECTING DIABETIC RETINOPATHY DISEASE USING...
ARTIFICIAL NEURAL NETWORKING in Dentistry
Biomedical Signal Origin and Dynamics
A COMPARATIVE STUDY OF MACHINE LEARNING ALGORITHMS FOR EEG SIGNAL CLASSIFICATION
Ad

Similar to Knowledge Engineering in Oncology (20)

PPTX
Future Diagnostics, In Vitro Diagnostiek "Bloedserieus"
PPTX
MAASTRO Knowledge Engineering: The Fun(ction) of Medical Physics in Cancer Care
PDF
SMi Group's 3D Cell Culture 2020 conference
PPTX
University of Toronto - Radiomics for Oncology - 2017
PDF
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
PDF
Predictions And Analytics In Healthcare: Advancements In Machine Learning
PPTX
Thomas Willkens-El impacto de las ciencias ómicas en la medicina, la nutrició...
PDF
Precision Oncology adoption: The future is now
PDF
Case Study: Advanced analytics in healthcare using unstructured data
PDF
Real-World Evidence: The Future of Data Generation and Usage
PDF
How does machine learning help in cancer detection
PPTX
Revolutionizing Clinical Trial Data Quality through Intelligent Query Management
PDF
Intracranial-Tumor Detection and Classification System using Convnet and Tran...
PDF
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...
PDF
IRJET- Oral Cancer Detection using Machine Learning
PPTX
The Application of Data science in Healthcare
PDF
Fyp presentation
PPTX
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
PDF
Paper id 36201506
PDF
SMi Group's 3d Cell Culture 2019 conference
Future Diagnostics, In Vitro Diagnostiek "Bloedserieus"
MAASTRO Knowledge Engineering: The Fun(ction) of Medical Physics in Cancer Care
SMi Group's 3D Cell Culture 2020 conference
University of Toronto - Radiomics for Oncology - 2017
How AstraZeneca is Applying AI, Imaging & Data Analytics (AI-Driven Drug Deve...
Predictions And Analytics In Healthcare: Advancements In Machine Learning
Thomas Willkens-El impacto de las ciencias ómicas en la medicina, la nutrició...
Precision Oncology adoption: The future is now
Case Study: Advanced analytics in healthcare using unstructured data
Real-World Evidence: The Future of Data Generation and Usage
How does machine learning help in cancer detection
Revolutionizing Clinical Trial Data Quality through Intelligent Query Management
Intracranial-Tumor Detection and Classification System using Convnet and Tran...
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...
IRJET- Oral Cancer Detection using Machine Learning
The Application of Data science in Healthcare
Fyp presentation
Big Databases and Outcome Research - Opportunities and Challenges for Radiati...
Paper id 36201506
SMi Group's 3d Cell Culture 2019 conference
Ad

More from MAASTRO clinic (16)

PPTX
Maastro corporate presentatie 2015
PPTX
e-Learning What, Why & How
PPTX
ARTFORCE - Radiation Protection Zr-cetuximab
PPTX
Multi-centric learning from medical data
PPTX
General information course 2.3
PPTX
Carbonic Anhydrase IX: regulation and role in cancer
PPT
Measuring of biological parameters of the tumor microenvironment – advantages...
PPT
Principles of 3D Fitting of Molecules
PPTX
Clinical Imaging Hypoxia
PPT
Hypoxia as a target for personalized medicine
PPT
Global & local oxygen control in in vitro systems
PPTX
Tumour Hypoxia - detection and prognostic significance
PPTX
Biological responses to tumor hypoxia & their potential as therapeutic targets
PPTX
HIF in cell Biology & Physiology
PDF
METOXIA Framework and Hypoxia and Acidosis in Human Physiology and Diseases
PDF
Epigenetic Mechanisms Regulating the Cellular Response to Hypoxia
Maastro corporate presentatie 2015
e-Learning What, Why & How
ARTFORCE - Radiation Protection Zr-cetuximab
Multi-centric learning from medical data
General information course 2.3
Carbonic Anhydrase IX: regulation and role in cancer
Measuring of biological parameters of the tumor microenvironment – advantages...
Principles of 3D Fitting of Molecules
Clinical Imaging Hypoxia
Hypoxia as a target for personalized medicine
Global & local oxygen control in in vitro systems
Tumour Hypoxia - detection and prognostic significance
Biological responses to tumor hypoxia & their potential as therapeutic targets
HIF in cell Biology & Physiology
METOXIA Framework and Hypoxia and Acidosis in Human Physiology and Diseases
Epigenetic Mechanisms Regulating the Cellular Response to Hypoxia

Recently uploaded (20)

PDF
01-Introduction-to-Information-Management.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Microbial disease of the cardiovascular and lymphatic systems
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Pharma ospi slides which help in ospi learning
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Classroom Observation Tools for Teachers
PDF
Weekly quiz Compilation Jan -July 25.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
01-Introduction-to-Information-Management.pdf
Final Presentation General Medicine 03-08-2024.pptx
Anesthesia in Laparoscopic Surgery in India
Microbial disease of the cardiovascular and lymphatic systems
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
O7-L3 Supply Chain Operations - ICLT Program
STATICS OF THE RIGID BODIES Hibbelers.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Orientation - ARALprogram of Deped to the Parents.pptx
Complications of Minimal Access Surgery at WLH
Pharma ospi slides which help in ospi learning
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Chinmaya Tiranga quiz Grand Finale.pdf
Classroom Observation Tools for Teachers
Weekly quiz Compilation Jan -July 25.pdf
Microbial diseases, their pathogenesis and prophylaxis

Knowledge Engineering in Oncology

  • 1. Knowledge Engineering in Oncology Andre Dekker, PhD Medical Physicist MAASTRO Clinic
  • 2. 2 The five components of Radiation Oncology Clinic Biology Radiation Oncology Molecular Physics Imaging Computer science © MAASTRO 2013
  • 3. Contents 3 09:00-09:45 Knowledge Engineering: The need and the data • Problem: “Current quality of Medical Decisions” • Rapid Learning – How to get the data? 09:45-10:00 Break 10:00-10:45 Knowledge Engineering: From data to decision support • Methodology & an example • Demo of machine learning • Evaluating decision support 10:45-11:00 Q&R © MAASTRO 2013
  • 4. Quality of medical decisions
  • 5. Experiment 5 Lowest Highest Survival Survival Probability Probability © MAASTRO 2013
  • 6. Experiment 6 AUC 1.00 AUC 0.72 AUC 0.50 © MAASTRO 2013
  • 7. Prediction by MDs? 7 Non Small Cell Lung Cancer 2 year survival 30 patients 8 MDs Retrospective AUC: 0.57 Non Small Cell Lung Cancer 2 year survival 158 patients 5 MDs Prospective AUC: 0.56 © MAASTRO 2013 Cary Oberije et al.
  • 8. Question 8 Why do you think the doctor‟s are bad at predicting outcomes? © MAASTRO 2013
  • 9. The doctor is drowning 9 • Explosion of data • Explosion of decisions • Explosion of „evidence‟* • 3 % in trials, bias • Sharp knife *2010: 1574 & 1354 articles on lung cancer & radiotherapy = 7.5 per day Half-life of knowledge estimated at 7 years J Clin Oncol 2010;28:4268 JMI 2012 Friedman, Rigby © MAASTRO 2013
  • 10. Problem definition 11 • Personalized medicine is the future • Prediction of the outcome for treatment a, b or c is needed • The current method of getting evidence is too costly and unsustainable • It is unethical to ask a person to make that prediction J Clin Oncol 28:4268-4274 © MAASTRO 2013
  • 12. Rapid Learning 13 In [..] rapid-learning [..] data routinely generated through patient care and clinical research feed into an ever- growing [..] set of coordinated databases. J Clin Oncol 2010;28:4268 [..] rapid learning [..] where we can learn from each patient to guide practice, is [..] crucial to guide rational health policy and to contain costs [..]. Lancet Oncol 2011;12:933 © MAASTRO 2013
  • 13. Engineering Knowledge 14 © MAASTRO 2013 Lambin, P. et al. Nat. Rev. Clin. Oncol. 10, 27–40 (2013)
  • 15. CAT ~ 2005 = MAASTRO Knowledge Engineering 16 Build Decision Support Systems to individualize patient care Theme 2 by using machine learning Learning (5%) to extract multifactorial personalized prediction models from existing databases Theme 1 containing all data on all patients Data that are validated in external datasets (95%) © MAASTRO 2013
  • 16. Barriers to sharing data and a way to overcome these
  • 17. Question 18 What do you think would prevent people from sharing data? © MAASTRO 2013
  • 18. Barriers to sharing data 19 [..] the problem is not really technical […]. Rather, the problems are ethical, political, and administrative. Lancet Oncol 2011;12:933 1. Administrative (time to capture, time to curate) 2. Political (value, authorship) 3. Ethical (privacy) 4. Technical © MAASTRO 2013
  • 19. Basic concept in data sharing 20 • Syntax and Semantics • Centralized and Federated © MAASTRO 2013
  • 21. Semantic interoperability – Patrick story 22 To explain and distinguish the 4 different levels, consider the following scenario: 56 year old Patrick recently moved from Ireland to Spain to take up his new job in a multinational IT company. A few weeks after arriving, he falls ill, consults his local (Spanish) GP and is transferred to the next hospital for further tests. © MAASTRO 2013 SemanticHEALTH Report, January 2009
  • 22. Semantic interoperability – Level 0 23 Level 0 (no interoperability at all) Patrick has to undergo a full set of lengthy investigations for the doctors to find out the cause of his severe pain. Unfortunately, results from the local GP as well as from his Irish GP are not available at the point of care within the hospital due to the missing technical equipment. © MAASTRO 2013 SemanticHEALTH Report, January 2009
  • 23. Semantic interoperability – Level 1 24 Level 1 (technical and syntactical interoperability): Patrick’s doctor in the hospital is able to receive electronic documents that were released from the Irish GP as well as his local GP upon request. Widely available applications supporting syntactical interoperability (such as web browsers and email clients), allow the download of patient data and provide immediate access. Unfortunately, none of the available doctors in the hospital is able to translate the Irish document, and only human intervention allows interpreting the information submitted by the local GP for adding into the hospitals information system. © MAASTRO 2013 SemanticHEALTH Report, January 2009
  • 24. Semantic interoperability – Level 2 25 Level 2 (partial semantic interoperability): The Spanish hospital doctor is able to securely access via the Internet parts of Patrick’s Electronic Health Record released by his Irish GP as well as the local GP that he visited just hours earlier. Although both documents contain mostly free text, fragments of high importance (such as demographics, allergies, diagnoses, and parts of medical history) are encoded using international coding schemes, which the hospital information system can automatically detect, interpret and meaningfully present to the attending physician. © MAASTRO 2013 SemanticHEALTH Report, January 2009
  • 25. Semantic interoperability – Level 3 26 Level 3 (full semantic interoperability, co-operability) In this ideal situation and after thorough authentication took place, the Spanish hospital information system is able to automatically access, interpret and present all necessary medical information about Patrick to the physician at the point of care. Neither language nor technological differences prevent the system to seamlessly integrate the received information into the local record and provide a complete picture of Patrick’s health as if it would have been collected locally. Further, the anonymised data feeds directly into the tools of public health authorities and researchers. © MAASTRO 2013 SemanticHEALTH Report, January 2009
  • 26. Question 27 What semantic interoperability level do you think medicine is at the moment in NL & Europe? © MAASTRO 2013
  • 27. Central versus Federated Data Sharing
  • 28. Centralized Data (e.g. for Research) 29 Hospital 1 HIS Research System data domains PACS clinical integrated data LIS Open Clinica imaging Hospital 2 NBIA HIS e.g. tranSMART biobanking PACS e.g. caTissue LIS © MAASTRO 2013
  • 29. Federated Data (e.g. for Research) 30 Hospital 1 integrated data HIS PACS e.g. euroCAT LIS Hospital 2 integrated data HIS e.g. PACS euroCAT LIS © MAASTRO 2013
  • 30. Question 31 What do you think are the pros and cons for centralized vs. federated? © MAASTRO 2013
  • 31. An example data sharing project: euroCAT
  • 32. Example: MAASTRO‟s euroCAT approach 33 euroCAT is a research project in which we develop an IT infrastructure -> technical to make radiotherapy centers (Maastricht, Liege, Aachen, Eindhoven, Hasselt) semantic interoperable (SIOp*) / machine readable -> administrative while the data stays inside your hospital -> ethical under your full control -> political * SIOp level 3 = Machine Readable ->Data in common syntax and with common meaning © MAASTRO 2013
  • 33. Components 34 CTMS PACS ETL Export Distributed Deident. Learning & Filter Application Ontology Sharing XML Query DICOM © MAASTRO 2013
  • 34. Ontology – International Coding System 35 2. Search the ontology for the matching concept 1. Select the local term 声门下区 3. Map the local term to the ontology 4. See the result of your © MAASTRO 2013 mapping
  • 35. Ontology use 36 Ontology is a set of terms & their relationships Retrospective analysis “Xerostomia” All head & neck cancer patients © MAASTRO 2013
  • 36. Data extraction system - Federated 37 © MAASTRO 2013
  • 37. Distributed Final Model Created Learning Update Model Architecture Central Server Send Average Send Average Send Average Consensus Model Consensus Model Consensus Model Send Model Parameters Send Model Parameters Model Server RTOG Send Model Parameters Model Server Roma Model Server Learn Model from MAASTRO Local Data Learn Model from Learn Model from Local Data Local Data Only aggregate data is exchanged between the Central Server and the local Servers
  • 38. Lecture key points 39 • The problem of decision is caused by • Too much data on an individual patient to process for a human being • Not enough evidence in literature to make a decision in an individual patient • Not enough patients in trials • Patients that are selected in these trials do not represent the patient in which a decision needs to be taken • This problem will get worse as the number of decisions is rising in personalized medicine • Rapid learning or learning from each patient is hampered by barriers to share data • The most important barrier is the administrative effort needed to capture & share data • Semantic interoperability and federation may overcome some of these barriers © MAASTRO 2013
  • 39. BREAK
  • 40. How to go from data to decision support
  • 42. Data>Model>Decision Support 43 1. Modeling “Learn a model from data” 2. Validation “Estimate model performance” 3. Decision Support “Impact of the model on clinical practice” © MAASTRO 2013
  • 43. Learn a model from data 44 Training cohort – 322 patients (MAASTRO) Clinical variables Support Vector Machines Nomogram Dehing-Oberije (MAASTRO), IJROBP 2009;74:355 © MAASTRO 2013 Cary Oberije et al.
  • 44. Estimate model performance 45 • INDEPENDENT Validation cohort – 36 patients (Leuven) – 65 patients (Ghent) • Discrimination, Calibration, Reclassification • AUC 0.75 Dehing-Oberije (MAASTRO), IJROBP 2009;74:355 © MAASTRO 2013 Cary Oberije et al.
  • 45. Decision Support 46 Stage IIIA 10 (14%) Stage IIIB 13 (19%) T4 12 (17%) © MAASTRO 2013 Cary Oberije et al.
  • 46. How to learn better? 47 • More patients • Diversity • More variables • Overfitting • Different machine learning methods © MAASTRO 2013
  • 47. Let‟s build a model from data
  • 48. Lung cancer staging system 49 © MAASTRO 2013
  • 49. Hugin 50 © MAASTRO 2013
  • 50. Discussion model 51 • There is always missing data • Some states (N2) have low number of observations • Machine learning without input from domain experts gives models that do not make sense • T->N->M & TNM->Stage • There is always bias in data (e.g. T4N0M0->IIIB, T2N3M0- >IIIA) -> Most difficult too solve • You learn something new T->N Relation • (Low numbers give bad models) © MAASTRO 2013
  • 51. Why is there a bias, any ideas? 52 © MAASTRO 2013
  • 53. Validation of a survival model 54 • Discrimination: Is the model able to classify the population into two or more groups with different observed survival in an external validation set? • Calibration: Is the estimated probability of survival equal to the observed survival probability in an external validation set? • Clinical usefulness: Are the training and external validation set representative for my patient and is the predicted outcome clinically relevant for my patient? © MAASTRO 2013
  • 54. Laryngeal carcinoma model 55 994 MAASTRO patients 1990-2005 www.predictcancer.org Input parameters – Age – Hemoglobin – T-stage – Radiotherapy Dose (Gy) – Gender – N+ – Tumor location Output parameters – Overall survival © MAASTRO 2013
  • 55. Validation @ RTOG (Trial 0522) - Result 56 © MAASTRO 2013
  • 56. Data>Model>Decision Support 57 Prediction Models: Revolutionary in Principle, But Do They Do More Good Than Harm? “we are drowning in prediction models [..] more than 100 prediction models on prostate cancer alone” “currently [..] a large number of models [..] are not independently validated at all” J Clin Oncol 2011;29:2951 © MAASTRO 2013
  • 57. Models built & validated 58 Lung cancer Rectal cancer – Survival – Tumor response – Lung dyspnea – Local recurrences – Lung dysphagia – Distant metastases – Overall survival Larynx cancer – Local recurrences – Overall survival www.predictcancer.org © MAASTRO 2013
  • 58. Lecture key points 59 • The process to go from data to decision support is through Modeling & External! Validation • Adding more patients and more diverse patients to your training set is always good • Beware of adding more variables (overfitting) without adding more patients • When evaluating decision support systems look at • Discrimination: Classify into two or more groups with different event probability • Calibration: Observed event probability in the external validation • Clinical Useful: Representative/relevant patient population © MAASTRO 2013
  • 59. Thank you for your attention More info on: www.eurocat.info www.predictcancer.org www.cancerdata.org www.mistir.info www.maastro.nl