SlideShare a Scribd company logo
Connectome Classification:
     Statistical Connectomics for
    Analysis of Connectome Data

        Joshua T. Vogelstein, PhD
        d: Applied Math. & Stats
        u: Johns Hopkins
        w: jovo.me
        e: joshuav@jhu.edu
Statistical Connectomics

  Statistics   “the art of data collection and analysis”

Connectomics “the study of connectomes”
  Statistical “the art of connectome data collection
Connectomics and analysis”
Contributors
         Stats
   Carey E. Priebe        Data Collection
 Glen A. Coppersmith       Susan Resnick
    Mark Dredze

Connectome Inference
    Will R. Gray               Wisdom
    John Bogovic          R. Jacob Vogelstein
     Jerry Prince


          Support: various grants
Simplest. Example. Ever.
Blind People        Deaf People

    V1                  V1




  A1 M1              A1 M1
Simplest. Example. Ever.
Blind People                            Deaf People

    V1          No possible classifier       V1
                   based on graph
               invariants can perform
                 this insanely simple
                     classification
  A1 M1                                  A1 M1
                      problem!!!
Realest. Example. Ever.
MR Connectome Gender Classification


 statistical graph model   graph invariants

    > 83% accuracy         < 75% accuracy
Statistical              Connectomics
1. Collect Data                  Multi-Modal MR Imaging
2. Preprocess Data              MR Connectome Pipeline
3. Assumptions                       Signal Subgraph
4. Construct a Decision Rule   Robust Bayes Plugin Classifier
5. Evaluate Performance        Leave-One-Out X-Validation
6. Check Assumptions              Synthetic Data Analysis
7. Extensions                       Relax assumptions
Statistical              Connectomics
1. Collect Data                  Multi-Modal MR Imaging
2. Preprocess Data              MR Connectome Pipeline
3. Assumptions                       Signal Subgraph
4. Construct a Decision Rule   Robust Bayes Plugin Classifier
5. Evaluate Performance        Leave-One-Out X-Validation
6. Check Assumptions              Synthetic Data Analysis
7. Extensions                       Relax assumptions
1. Collect Data:
  Multi-Modal MR Imaging

• 49 senior individuals; 25 male, 24 female
 • diffusion: standard DTI protocol
 • structural: standard MPRAGE protocol
2. Preprocess Data:
MR Connectome Automated Pipeline


• coherent collection of code
• fully automatic and modular
• about 12 hrs/subject/core
• yields 70 vertex graph/subject


        http://www.nitrc.org/projects/mrcap/
3. Data Assumptions:
   Signal Subgraph
4. Construct a Decision Rule:
Robust Bayes Plugin Classifier

• asymptotically optimal and robust
• finite sample niceness
         
                  auv                   1−auv
 y=
 ˆ               puv|y (1
                 ˆ          − puv|y )
                              ˆ                 πy
                                                ˆ
             ˆ
       (u,v)∈S
5. Evaluate Performance:
    Leave-One-Out X-Validation
                                      incoherent estimator                                                  coherent estimator
                                                                                                                                      0.5
misclassification rate




                                                                                   # signal−vertices
                          0.5 L π
                              ˆˆ     = 0. 5                        ˆ
                                                                   L n b = 0. 41                                 ˆ
                                                                                                                 L c o h= 0. 16
                                                                                                       10                             0.4

                         0.25                                                                          20                             0.3
                                              ˆ
                                              L i n c= 0. 27

                                                                                                       30
                            0 0                1               2               3
                                                                                                                                      0.16
                            10             10          10       10                                           200 400 600 800 1000
                                    log size of signal subgraph                                           size of signal subgraph
                                     some coherent estimators                                          zoomed in coherent estimator
                                                                                                                                      0.5
lassification rate




                          0.5
                                                                                   star−vertices




                                                                                                       15
                                                                                                                                      0.4
                                                                                                       18
                         0.25                                                                                                         0.3
                         0.16                                                                          21
6. Check Assumptions:
Synthetic Data Analysis
                 Correlation Matrix
                                       1


           100                         0.5
  vertex




                                       0
           200

                                       −0.5
           300

                                       −1
                  100     200    300
                        vertex
7. Extensions


• relax the independent edge assumption
• relax binary edge assumption
Discussion


• 83%  75%
• yay statistical modeling!
Q(A)


• anything?
4. Construct a Decision Rule:
 Signal Subgraph Estimation
 •   for each edge, we compute the significance of
     the difference between the two classes using
     Fisher’s exact test
 •   the incoherent signal subgraph estimator finds
     the s edges that are most significant
 •   the coherent signal subgraph estimator finds the
     s edges that are most significant incident to m
     vertices
4. Construct a Decision Rule:
 Signal Subgraph Estimation

                   negative log                       incoherent                    coherent
                significance matrix                    estimate                     estimate




                                                                   # correct = 15
                                      # correct = 7
           20
  vertex
  n=64




           40

           60

                   20    40    60                                                              −4.4   −1.
                      vertex
6. Check Assumptions:
                                          incoherent estimator                                                     coherent estimator
                              1
misclassification rate




                                                                                   # star−vertices
                         0.75                                                                                                                             0.7
                                                                                                       10



                         0.25
                             0.5
                                       Synthetic Data Analysis                                         20                                                 0.5


                                                                                                       30                                                 0.3
                              0                                                                                                                           0.18
                                0                 1            2          3                                        200     400    600   800 1000
                              10                10         10           10
                                       log size of signal subgraph                                               size of signal subgraph

                              1                                                                        0.5




                                                                              misclassification rate
          missed−edge rate




                                                                                                                                                    coh
                                                                                                       0.4                                          inc
                                                                                                       0.3                                          nb
                             0.5
                                                                                                       0.2

                                                                                                       0.1

                              0
                                   0     20          40   60       80   100                                  0       20          40     60     80         100
                                              # training samples                                                          # training samples

More Related Content

PDF
White.p.johnson.k
PDF
Ei09 Opposite Green
PDF
Model-based analysis using generative embedding
PDF
Effective and Efficient Shape-Based Pattern Detection over Streaming Time Series
PDF
TALAT Lecture 2301: Design of Members Example 8.1: Torsion constants for open...
PDF
Open Source in Quant Finance - xlwings
PPTX
Multiple sclerosis as a simultaneous "2 components" disease
PPTX
how neurons connect to each others?
White.p.johnson.k
Ei09 Opposite Green
Model-based analysis using generative embedding
Effective and Efficient Shape-Based Pattern Detection over Streaming Time Series
TALAT Lecture 2301: Design of Members Example 8.1: Torsion constants for open...
Open Source in Quant Finance - xlwings
Multiple sclerosis as a simultaneous "2 components" disease
how neurons connect to each others?

Viewers also liked (14)

DOCX
VEU_CST499_FinalReport
PPTX
CMB Final Presentation
PPTX
Slide Presentation
PDF
Innovation Connectomics - Mapping Knowledge Flow in Innovation Networks
PPTX
From Brains to BRAINs: Neuroscience at the Cutting Edge
PDF
Robust Object Recognition with Cortex-Like Mechanisms
PPTX
Open science resources for `Big Data' Analyses of the human connectome
PDF
A tutorial in Connectome Analysis (3) - Marcus Kaiser
PDF
A tutorial in Connectome Analysis (2) - Marcus Kaiser
PPTX
Multiple Sclerosis and pregnancy: Guidelines from the literature
PPTX
Healthy Mind
PDF
Connectomics: Parcellations and Network Analysis Methods
PPTX
Connectome
PPTX
Computational approaches for mapping the human connectome
VEU_CST499_FinalReport
CMB Final Presentation
Slide Presentation
Innovation Connectomics - Mapping Knowledge Flow in Innovation Networks
From Brains to BRAINs: Neuroscience at the Cutting Edge
Robust Object Recognition with Cortex-Like Mechanisms
Open science resources for `Big Data' Analyses of the human connectome
A tutorial in Connectome Analysis (3) - Marcus Kaiser
A tutorial in Connectome Analysis (2) - Marcus Kaiser
Multiple Sclerosis and pregnancy: Guidelines from the literature
Healthy Mind
Connectomics: Parcellations and Network Analysis Methods
Connectome
Computational approaches for mapping the human connectome
Ad

Similar to Connectome Classification: Statistical Connectomics for Analysis of Connectome Data (20)

PDF
Why we don’t know how many colors there are
PPT
Qualifications And Experience Presentation
PDF
Sattose 2011
PDF
Studying the impact of dependency network measures on software quality
DOCX
Est3 tutorial3mejorado
PDF
Iirs Artificial Naural network based Urban growth Modeling
PDF
SPICE MODEL of MA4S111 (Standard Model) in SPICE PARK
PDF
SPICE MODEL of MA4S111 (Standard Model) in SPICE PARK
KEY
Wikipedia ws
KEY
Wikimedia Conference 2009 presentation
PDF
SPICE MODEL of MA4S111 (Professional Model) in SPICE PARK
PDF
SPICE MODEL of MA4S111 (Professional Model) in SPICE PARK
PDF
Session 5
PDF
ST.Monteiro-EmbeddedFeatureSelection.pdf
PPTX
RIT 101: Understanding Scores From MAP
PDF
Color Imaging Lab Research Interests 2010
PDF
Paper and pencil_cosmological_calculator
XLS
Boyles Law Sim Sc08
PPT
Image compression
PDF
MapReduce Debates and Schema-Free
Why we don’t know how many colors there are
Qualifications And Experience Presentation
Sattose 2011
Studying the impact of dependency network measures on software quality
Est3 tutorial3mejorado
Iirs Artificial Naural network based Urban growth Modeling
SPICE MODEL of MA4S111 (Standard Model) in SPICE PARK
SPICE MODEL of MA4S111 (Standard Model) in SPICE PARK
Wikipedia ws
Wikimedia Conference 2009 presentation
SPICE MODEL of MA4S111 (Professional Model) in SPICE PARK
SPICE MODEL of MA4S111 (Professional Model) in SPICE PARK
Session 5
ST.Monteiro-EmbeddedFeatureSelection.pdf
RIT 101: Understanding Scores From MAP
Color Imaging Lab Research Interests 2010
Paper and pencil_cosmological_calculator
Boyles Law Sim Sc08
Image compression
MapReduce Debates and Schema-Free
Ad

Recently uploaded (20)

DOC
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
PPTX
anaemia in PGJKKKKKKKKKKKKKKKKHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...
PPTX
ACID BASE management, base deficit correction
PPTX
SKIN Anatomy and physiology and associated diseases
PDF
Oral Aspect of Metabolic Disease_20250717_192438_0000.pdf
PPTX
LUNG ABSCESS - respiratory medicine - ppt
PPTX
Respiratory drugs, drugs acting on the respi system
PPTX
CEREBROVASCULAR DISORDER.POWERPOINT PRESENTATIONx
PPTX
anal canal anatomy with illustrations...
PPTX
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
PDF
شيت_عطا_0000000000000000000000000000.pdf
PPTX
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
PDF
NEET PG 2025 | 200 High-Yield Recall Topics Across All Subjects
PDF
Medical Evidence in the Criminal Justice Delivery System in.pdf
PPTX
Cardiovascular - antihypertensive medical backgrounds
PPTX
Important Obstetric Emergency that must be recognised
PDF
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
PPT
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
PPT
HIV lecture final - student.pptfghjjkkejjhhge
PPTX
neonatal infection(7392992y282939y5.pptx
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
anaemia in PGJKKKKKKKKKKKKKKKKHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...
ACID BASE management, base deficit correction
SKIN Anatomy and physiology and associated diseases
Oral Aspect of Metabolic Disease_20250717_192438_0000.pdf
LUNG ABSCESS - respiratory medicine - ppt
Respiratory drugs, drugs acting on the respi system
CEREBROVASCULAR DISORDER.POWERPOINT PRESENTATIONx
anal canal anatomy with illustrations...
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
شيت_عطا_0000000000000000000000000000.pdf
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
NEET PG 2025 | 200 High-Yield Recall Topics Across All Subjects
Medical Evidence in the Criminal Justice Delivery System in.pdf
Cardiovascular - antihypertensive medical backgrounds
Important Obstetric Emergency that must be recognised
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
HIV lecture final - student.pptfghjjkkejjhhge
neonatal infection(7392992y282939y5.pptx

Connectome Classification: Statistical Connectomics for Analysis of Connectome Data

  • 1. Connectome Classification: Statistical Connectomics for Analysis of Connectome Data Joshua T. Vogelstein, PhD d: Applied Math. & Stats u: Johns Hopkins w: jovo.me e: joshuav@jhu.edu
  • 2. Statistical Connectomics Statistics “the art of data collection and analysis” Connectomics “the study of connectomes” Statistical “the art of connectome data collection Connectomics and analysis”
  • 3. Contributors Stats Carey E. Priebe Data Collection Glen A. Coppersmith Susan Resnick Mark Dredze Connectome Inference Will R. Gray Wisdom John Bogovic R. Jacob Vogelstein Jerry Prince Support: various grants
  • 4. Simplest. Example. Ever. Blind People Deaf People V1 V1 A1 M1 A1 M1
  • 5. Simplest. Example. Ever. Blind People Deaf People V1 No possible classifier V1 based on graph invariants can perform this insanely simple classification A1 M1 A1 M1 problem!!!
  • 6. Realest. Example. Ever. MR Connectome Gender Classification statistical graph model graph invariants > 83% accuracy < 75% accuracy
  • 7. Statistical Connectomics 1. Collect Data Multi-Modal MR Imaging 2. Preprocess Data MR Connectome Pipeline 3. Assumptions Signal Subgraph 4. Construct a Decision Rule Robust Bayes Plugin Classifier 5. Evaluate Performance Leave-One-Out X-Validation 6. Check Assumptions Synthetic Data Analysis 7. Extensions Relax assumptions
  • 8. Statistical Connectomics 1. Collect Data Multi-Modal MR Imaging 2. Preprocess Data MR Connectome Pipeline 3. Assumptions Signal Subgraph 4. Construct a Decision Rule Robust Bayes Plugin Classifier 5. Evaluate Performance Leave-One-Out X-Validation 6. Check Assumptions Synthetic Data Analysis 7. Extensions Relax assumptions
  • 9. 1. Collect Data: Multi-Modal MR Imaging • 49 senior individuals; 25 male, 24 female • diffusion: standard DTI protocol • structural: standard MPRAGE protocol
  • 10. 2. Preprocess Data: MR Connectome Automated Pipeline • coherent collection of code • fully automatic and modular • about 12 hrs/subject/core • yields 70 vertex graph/subject http://www.nitrc.org/projects/mrcap/
  • 11. 3. Data Assumptions: Signal Subgraph
  • 12. 4. Construct a Decision Rule: Robust Bayes Plugin Classifier • asymptotically optimal and robust • finite sample niceness auv 1−auv y= ˆ puv|y (1 ˆ − puv|y ) ˆ πy ˆ ˆ (u,v)∈S
  • 13. 5. Evaluate Performance: Leave-One-Out X-Validation incoherent estimator coherent estimator 0.5 misclassification rate # signal−vertices 0.5 L π ˆˆ = 0. 5 ˆ L n b = 0. 41 ˆ L c o h= 0. 16 10 0.4 0.25 20 0.3 ˆ L i n c= 0. 27 30 0 0 1 2 3 0.16 10 10 10 10 200 400 600 800 1000 log size of signal subgraph size of signal subgraph some coherent estimators zoomed in coherent estimator 0.5 lassification rate 0.5 star−vertices 15 0.4 18 0.25 0.3 0.16 21
  • 14. 6. Check Assumptions: Synthetic Data Analysis Correlation Matrix 1 100 0.5 vertex 0 200 −0.5 300 −1 100 200 300 vertex
  • 15. 7. Extensions • relax the independent edge assumption • relax binary edge assumption
  • 16. Discussion • 83% 75% • yay statistical modeling!
  • 18. 4. Construct a Decision Rule: Signal Subgraph Estimation • for each edge, we compute the significance of the difference between the two classes using Fisher’s exact test • the incoherent signal subgraph estimator finds the s edges that are most significant • the coherent signal subgraph estimator finds the s edges that are most significant incident to m vertices
  • 19. 4. Construct a Decision Rule: Signal Subgraph Estimation negative log incoherent coherent significance matrix estimate estimate # correct = 15 # correct = 7 20 vertex n=64 40 60 20 40 60 −4.4 −1. vertex
  • 20. 6. Check Assumptions: incoherent estimator coherent estimator 1 misclassification rate # star−vertices 0.75 0.7 10 0.25 0.5 Synthetic Data Analysis 20 0.5 30 0.3 0 0.18 0 1 2 3 200 400 600 800 1000 10 10 10 10 log size of signal subgraph size of signal subgraph 1 0.5 misclassification rate missed−edge rate coh 0.4 inc 0.3 nb 0.5 0.2 0.1 0 0 20 40 60 80 100 0 20 40 60 80 100 # training samples # training samples