SlideShare a Scribd company logo
Sandeep Sharma ROC curve 2013
Procedure below:
1. convert number representation to number format using excel properties
2. copy data from data sheet into
Genuine into column Genuine and i16.dat imposter in column imposter
3. Classify genuine and imposter into range in this case 0-0.1,0.1-0.2 etc
As shown in excel sheet column D GenuineClass, ImposterClass
Formulae used:
For GenuineClass (range in column D in excel sheet, based on Genuine Score of Column B)
=IF(AND(B:B>=0,B:B<=0.1),"0.1" ,IF(AND(B:B>0.1,B:B<=0.2),"0.2" ,IF(AND(B:B>0.2,B:B<=0.3),"0.3",
IF(AND(B:B>0.3,B:B<=0.4),"0.4", IF(AND(B:B>0.4,B:B<=0.5),"0.5", IF(AND(B:B>0.5,B:B<=0.6),"0.6",
IF(AND(B:B>0.6,B:B<=0.7),"0.7", IF(AND(B:B>0.7,B:B<=0.8),"0.8", IF(AND(B:B>0.8,B:B<=0.9),"0.9",
"0.1")))))))))
For GenuineClass (range in column C in excel sheet, based on Imposter Score of Column C)
=IF(AND(C:C>=0,C:C<=0.1),"0.1" ,IF(AND(C:C>0.1,C:C<=0.2),"0.2" ,IF(AND(C:C>0.2,C:C<=0.3),"0.3",
IF(AND(C:C>0.3,C:C<=0.4),"0.4", IF(AND(C:C>0.4,C:C<=0.5),"0.5", IF(AND(C:C>0.5,C:C<=0.6),"0.6",
IF(AND(C:C>0.6,C:C<=0.7),"0.7", IF(AND(C:C>0.7,C:C<=0.8),"0.8", IF(AND(C:C>0.8,C:C<=0.9),"0.9",
"0.1")))))))))
4. Count the classification for each Imposter and Genuine
Imposter shown in Column F
0
200
400
600
800
1000
1200
0 0.2 0.4 0.6 0.8 1 1.2
Diagonal
Genuine
imposter
Sandeep Sharma ROC curve 2013
For 0.0 =COUNTIF(E:E,"0.0") , For 0.1 =COUNTIF(E:E,"0.1")……
Genuine:
For 0.1 =COUNTIF(D:D,"0.1"), for 0.2 =COUNTIF(D:D,"0.2") , and so on..
Draw Scatter curve : Series Diagonal, plot values (Scale X,ScaleY) as below
scaleX scaleY
0 0
0.1 110
0.2 220
0.3 330
0.4 440
0.5 550
0.6 660
0.7 770
0.8 880
0.9 990
1 1100
Similary draw values of imposter and Genuine as counted :
So Add 2 more series by name Imposter, Genuine
X -
Axis
Y-axis
imposter scaleX scaleY genuine
0 0 0 0
374 0.1 110 53
175 0.2 220 46
185 0.3 330 65
155 0.4 440 82
73 0.5 550 135
30 0.6 660 166
7 0.7 770 185
1 0.8 880 206
0 0.9 990 195
0 1 1100 1067
Sum:1000 5.5 6050 2200
We get following curve:
Sandeep Sharma ROC curve 2013
Question 2:
FP Rate = count(interval FP)/Total FP
TP Rate= count(interval TP count)/Total TP
We get this table Using this score we calculate accumulated FP, TP rate
scaleX scaleY FPRate TPRate imposter genuine Acc FP Acc TP
0 0 0 0 0 0 0 0
0.1 0.1 0.374 0.02409 374 53 0.374 0.024091
0.2 0.2 0.175 0.02091 175 46 0.549 0.045
0.3 0.3 0.185 0.02955 185 65 0.734 0.074545
0.4 0.4 0.155 0.03727 155 82 0.889 0.111818
0.5 0.5 0.073 0.06136 73 135 0.962 0.173182
0.6 0.6 0.03 0.07545 30 166 0.992 0.248636
0.7 0.7 0.007 0.08409 7 185 0.999 0.332727
0.8 0.8 0.001 0.09364 1 206 1 0.426364
0.9 0.9 0 0.08864 0 195 1 0.515
1 1 0 0.485 0 1067 1 1
1000 2200 1 1
We get curve:
0
200
400
600
800
1000
1200
0 0.2 0.4 0.6 0.8 1 1.2
Diagonal
Genuine
imposter
Sandeep Sharma ROC curve 2013
ROC Curve Question 2
Acc FP Acc TP
0 0
0.374 0.024091
0.549 0.045
0.734 0.074545
0.889 0.111818
0.962 0.173182
0.992 0.248636
0.999 0.332727
1 0.426364
1 0.515
1 1
1 1
0
0.2
0.4
0.6
0.8
1
1.2
0 0.2 0.4 0.6 0.8 1 1.2
Diagonal
FP Rate
TP Rate
Sandeep Sharma ROC curve 2013
3. Determine the EER
EER = nm/PN
The point where the FMR curve and FNMR curve intersect is known as the equal errorrate (EER). It is
called this because at this threshold, the FMR and FNMR are equal.With respect to the score
distributions, the EER occurs at the threshold where thearea under (i.e. the integration of) the genuine
distribution<t equals the area under theimpostor distribution≥t.
Point (0.1,0.9)
y scaleX scaleY FP Rate TP Rate
1 0 0 0 0
0
0.2
0.4
0.6
0.8
1
1.2
0 0.2 0.4 0.6 0.8 1 1.2
Acc TP
diagonal
0
0.2
0.4
0.6
0.8
1
1.2
0 0.2 0.4 0.6 0.8 1 1.2
ROC curve
diagonal
rev
Sandeep Sharma ROC curve 2013
0.9 0.1 0.1 0.374 0.0240909
0.8 0.2 0.2 0.549 0.045
0.7 0.3 0.3 0.734 0.0745455
0.6 0.4 0.4 0.889 0.1118182
0.5 0.5 0.5 0.962 0.1731818
0.4 0.6 0.6 0.992 0.2486364
0.3 0.7 0.7 0.999 0.3327273
0.2 0.8 0.8 1 0.4263636
0.1 0.9 0.9 1 0.515
0 1 1 1 1
4. If the cost of a false accept is 10 euro and a false reject is 30 euro, estimate a suitable
operating point on the ROC curve that minimises the overall cost. Assume equal apriori
probabilities.
scaleX scaleY FPRate TPRate imposter genuine Acc FP Acc TP
0 0 0 0 0 0 0 0
0.1 0.1 0.374 0.02409 374 53 0.374 0.024091
0.2 0.2 0.175 0.02091 175 46 0.549 0.045
0.3 0.3 0.185 0.02955 185 65 0.734 0.074545
0.4 0.4 0.155 0.03727 155 82 0.889 0.111818
0.5 0.5 0.073 0.06136 73 135 0.962 0.173182
0.6 0.6 0.03 0.07545 30 166 0.992 0.248636
0.7 0.7 0.007 0.08409 7 185 0.999 0.332727
0.8 0.8 0.001 0.09364 1 206 1 0.426364
0.9 0.9 0 0.08864 0 195 1 0.515
1 1 0 0.485 0 1067 1 1
1000 2200 1 1
Sandeep Sharma ROC curve 2013
= Consider a scenario in
which negatives outnumber positives by 10 to 1, but false
positives and false negatives have equal cost. By Eq. (1)
m = 10, and the most northwest line of slope m = 10 is a,
tangent to classifier A, which would be the best performing
classifier for these conditions.
Consider another scenario in which the positive and
negative example populations are evenly balanced but a
false negative is 10 times as expensive as a false positive.
By Eq. (1) m = 1/10. The most northwest line of slope 1/
10 would be line b, tangent to classifier C. C is the optimal
classifier for these conditions.
0
0.2
0.4
0.6
0.8
1
1.2
0 0.2 0.4 0.6 0.8 1 1.2
FRR
FAR

More Related Content

PDF
SF20LC30 LTspice Model (Free SPICE Model)
PDF
SPICE MODEL of 1N5818 (Standard Model) in SPICE PARK
PDF
SPICE MODEL of HRC0203C (Standard Model) in SPICE PARK
PDF
SPICE MODEL of 1SS367 (Standard Model) in SPICE PARK
PPTX
Device Modeling of 3INPUT COMPARATOR using PSpice
PDF
SCS110AG (LTspice Model)
PDF
SPICE MODEL of S25SC6M (Standard Model) in SPICE PARK
PDF
SPICE MODEL of DF30PC3M (Standard Model) in SPICE PARK
SF20LC30 LTspice Model (Free SPICE Model)
SPICE MODEL of 1N5818 (Standard Model) in SPICE PARK
SPICE MODEL of HRC0203C (Standard Model) in SPICE PARK
SPICE MODEL of 1SS367 (Standard Model) in SPICE PARK
Device Modeling of 3INPUT COMPARATOR using PSpice
SCS110AG (LTspice Model)
SPICE MODEL of S25SC6M (Standard Model) in SPICE PARK
SPICE MODEL of DF30PC3M (Standard Model) in SPICE PARK

What's hot (7)

PDF
SPICE MODEL of 1SS394 (Standard Model) in SPICE PARK
PDF
20GL2C41A 150C PSpice Model (Free SPICE Model)
PDF
20DL2CZ47A PSpice Model (Free SPICE Model)
PDF
20GL2C41A 110C PSpice Model (Free SPICE Model)
PDF
SPICE MODEL of DC005-06 , PSpice Model in SPICE PARK
PDF
SPICE MODEL of AM-1820 , PSpice Model in SPICE PARK
PDF
SPICE MODEL of 30GWJ2CZ47C (Standard Model) in SPICE PARK
SPICE MODEL of 1SS394 (Standard Model) in SPICE PARK
20GL2C41A 150C PSpice Model (Free SPICE Model)
20DL2CZ47A PSpice Model (Free SPICE Model)
20GL2C41A 110C PSpice Model (Free SPICE Model)
SPICE MODEL of DC005-06 , PSpice Model in SPICE PARK
SPICE MODEL of AM-1820 , PSpice Model in SPICE PARK
SPICE MODEL of 30GWJ2CZ47C (Standard Model) in SPICE PARK
Ad

Viewers also liked (8)

PPTX
How to read a receiver operating characteritic (ROC) curve
PDF
Receiver Operating Characteristic (ROC) curve analysis. 19.12
PPTX
PPT
Roc Search
PDF
General Introduction to ROC Curves
PDF
A Classification Problem of Credit Risk Rating Investigated and Solved by Opt...
PDF
ID3 Algorithm & ROC Analysis
PPT
05 powerpoint-alessandra young
How to read a receiver operating characteritic (ROC) curve
Receiver Operating Characteristic (ROC) curve analysis. 19.12
Roc Search
General Introduction to ROC Curves
A Classification Problem of Credit Risk Rating Investigated and Solved by Opt...
ID3 Algorithm & ROC Analysis
05 powerpoint-alessandra young
Ad

Similar to Roc curve, analytics (20)

PPT
Lecture11_ Evaluation Metrics for classification.ppt
PPTX
Module 3_ Classification.pptx
PDF
modelperfcheatsheet.pdf
PDF
3Assessing classification performance.pdf
PPTX
Roc auc curve
DOCX
BIometrics- plotting DET and EER curve using Matlab
PDF
Side 2019 #8
PPTX
Introduction to ROC Curve Analysis with Application in Functional Genomics
PPTX
Data mining model
PDF
Ways to evaluate a machine learning model’s performance
PPTX
Classification Evaluation Metrics (2).pptx
PPTX
r_concepts
PPTX
Insurance Fraud Claims Detection
PPTX
AUC: at what cost(s)?
PPTX
Model Evaluation Matrix: Confusion Matrix, F1 Score, ROC curve AUC
PPTX
ROC Curve 101
PDF
Noorbehbahani classification evaluation measure
PPTX
MLA_Confusion Matrix for Classification
PPTX
Classification Assessment Methods.pptx
PPT
classifier_evaluation_lecture_ai_101.ppt
Lecture11_ Evaluation Metrics for classification.ppt
Module 3_ Classification.pptx
modelperfcheatsheet.pdf
3Assessing classification performance.pdf
Roc auc curve
BIometrics- plotting DET and EER curve using Matlab
Side 2019 #8
Introduction to ROC Curve Analysis with Application in Functional Genomics
Data mining model
Ways to evaluate a machine learning model’s performance
Classification Evaluation Metrics (2).pptx
r_concepts
Insurance Fraud Claims Detection
AUC: at what cost(s)?
Model Evaluation Matrix: Confusion Matrix, F1 Score, ROC curve AUC
ROC Curve 101
Noorbehbahani classification evaluation measure
MLA_Confusion Matrix for Classification
Classification Assessment Methods.pptx
classifier_evaluation_lecture_ai_101.ppt

More from Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW (20)

PDF
Management Consultancy Saudi Telecom Digital Transformation Design Thinking
PPTX
Digital transformation journey Consulting
DOCX
Lnt and bbby Retail Houseare industry Case assignment sandeep sharma
DOCX
Risk management Consulting For Municipality
DOCX
GDPR And Privacy By design Consultancy
PPTX
Real implementation Blockchain Best Use Cases Examples
DOCX
Biztalk architecture for Configured SMS service
DOCX
DOCX
Cloud manager client provisioning guideline draft 1.0
PPTX
DOCX
Government Digital transformation trend draft 1.0
DOCX
Enterprise architecture maturity rating draft 1.0
DOCX
Organisation Structure For digital Transformation Team
Management Consultancy Saudi Telecom Digital Transformation Design Thinking
Digital transformation journey Consulting
Lnt and bbby Retail Houseare industry Case assignment sandeep sharma
Risk management Consulting For Municipality
GDPR And Privacy By design Consultancy
Real implementation Blockchain Best Use Cases Examples
Biztalk architecture for Configured SMS service
Cloud manager client provisioning guideline draft 1.0
Government Digital transformation trend draft 1.0
Enterprise architecture maturity rating draft 1.0
Organisation Structure For digital Transformation Team

Recently uploaded (20)

PDF
[EN] Industrial Machine Downtime Prediction
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
How to run a consulting project- client discovery
PDF
annual-report-2024-2025 original latest.
PPTX
A Complete Guide to Streamlining Business Processes
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPT
DATA COLLECTION METHODS-ppt for nursing research
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Managing Community Partner Relationships
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PPTX
SAP 2 completion done . PRESENTATION.pptx
[EN] Industrial Machine Downtime Prediction
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Acceptance and paychological effects of mandatory extra coach I classes.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
STERILIZATION AND DISINFECTION-1.ppthhhbx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
How to run a consulting project- client discovery
annual-report-2024-2025 original latest.
A Complete Guide to Streamlining Business Processes
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
DATA COLLECTION METHODS-ppt for nursing research
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
climate analysis of Dhaka ,Banglades.pptx
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Data_Analytics_and_PowerBI_Presentation.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Managing Community Partner Relationships
Pilar Kemerdekaan dan Identi Bangsa.pptx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
SAP 2 completion done . PRESENTATION.pptx

Roc curve, analytics

  • 1. Sandeep Sharma ROC curve 2013 Procedure below: 1. convert number representation to number format using excel properties 2. copy data from data sheet into Genuine into column Genuine and i16.dat imposter in column imposter 3. Classify genuine and imposter into range in this case 0-0.1,0.1-0.2 etc As shown in excel sheet column D GenuineClass, ImposterClass Formulae used: For GenuineClass (range in column D in excel sheet, based on Genuine Score of Column B) =IF(AND(B:B>=0,B:B<=0.1),"0.1" ,IF(AND(B:B>0.1,B:B<=0.2),"0.2" ,IF(AND(B:B>0.2,B:B<=0.3),"0.3", IF(AND(B:B>0.3,B:B<=0.4),"0.4", IF(AND(B:B>0.4,B:B<=0.5),"0.5", IF(AND(B:B>0.5,B:B<=0.6),"0.6", IF(AND(B:B>0.6,B:B<=0.7),"0.7", IF(AND(B:B>0.7,B:B<=0.8),"0.8", IF(AND(B:B>0.8,B:B<=0.9),"0.9", "0.1"))))))))) For GenuineClass (range in column C in excel sheet, based on Imposter Score of Column C) =IF(AND(C:C>=0,C:C<=0.1),"0.1" ,IF(AND(C:C>0.1,C:C<=0.2),"0.2" ,IF(AND(C:C>0.2,C:C<=0.3),"0.3", IF(AND(C:C>0.3,C:C<=0.4),"0.4", IF(AND(C:C>0.4,C:C<=0.5),"0.5", IF(AND(C:C>0.5,C:C<=0.6),"0.6", IF(AND(C:C>0.6,C:C<=0.7),"0.7", IF(AND(C:C>0.7,C:C<=0.8),"0.8", IF(AND(C:C>0.8,C:C<=0.9),"0.9", "0.1"))))))))) 4. Count the classification for each Imposter and Genuine Imposter shown in Column F 0 200 400 600 800 1000 1200 0 0.2 0.4 0.6 0.8 1 1.2 Diagonal Genuine imposter
  • 2. Sandeep Sharma ROC curve 2013 For 0.0 =COUNTIF(E:E,"0.0") , For 0.1 =COUNTIF(E:E,"0.1")…… Genuine: For 0.1 =COUNTIF(D:D,"0.1"), for 0.2 =COUNTIF(D:D,"0.2") , and so on.. Draw Scatter curve : Series Diagonal, plot values (Scale X,ScaleY) as below scaleX scaleY 0 0 0.1 110 0.2 220 0.3 330 0.4 440 0.5 550 0.6 660 0.7 770 0.8 880 0.9 990 1 1100 Similary draw values of imposter and Genuine as counted : So Add 2 more series by name Imposter, Genuine X - Axis Y-axis imposter scaleX scaleY genuine 0 0 0 0 374 0.1 110 53 175 0.2 220 46 185 0.3 330 65 155 0.4 440 82 73 0.5 550 135 30 0.6 660 166 7 0.7 770 185 1 0.8 880 206 0 0.9 990 195 0 1 1100 1067 Sum:1000 5.5 6050 2200 We get following curve:
  • 3. Sandeep Sharma ROC curve 2013 Question 2: FP Rate = count(interval FP)/Total FP TP Rate= count(interval TP count)/Total TP We get this table Using this score we calculate accumulated FP, TP rate scaleX scaleY FPRate TPRate imposter genuine Acc FP Acc TP 0 0 0 0 0 0 0 0 0.1 0.1 0.374 0.02409 374 53 0.374 0.024091 0.2 0.2 0.175 0.02091 175 46 0.549 0.045 0.3 0.3 0.185 0.02955 185 65 0.734 0.074545 0.4 0.4 0.155 0.03727 155 82 0.889 0.111818 0.5 0.5 0.073 0.06136 73 135 0.962 0.173182 0.6 0.6 0.03 0.07545 30 166 0.992 0.248636 0.7 0.7 0.007 0.08409 7 185 0.999 0.332727 0.8 0.8 0.001 0.09364 1 206 1 0.426364 0.9 0.9 0 0.08864 0 195 1 0.515 1 1 0 0.485 0 1067 1 1 1000 2200 1 1 We get curve: 0 200 400 600 800 1000 1200 0 0.2 0.4 0.6 0.8 1 1.2 Diagonal Genuine imposter
  • 4. Sandeep Sharma ROC curve 2013 ROC Curve Question 2 Acc FP Acc TP 0 0 0.374 0.024091 0.549 0.045 0.734 0.074545 0.889 0.111818 0.962 0.173182 0.992 0.248636 0.999 0.332727 1 0.426364 1 0.515 1 1 1 1 0 0.2 0.4 0.6 0.8 1 1.2 0 0.2 0.4 0.6 0.8 1 1.2 Diagonal FP Rate TP Rate
  • 5. Sandeep Sharma ROC curve 2013 3. Determine the EER EER = nm/PN The point where the FMR curve and FNMR curve intersect is known as the equal errorrate (EER). It is called this because at this threshold, the FMR and FNMR are equal.With respect to the score distributions, the EER occurs at the threshold where thearea under (i.e. the integration of) the genuine distribution<t equals the area under theimpostor distribution≥t. Point (0.1,0.9) y scaleX scaleY FP Rate TP Rate 1 0 0 0 0 0 0.2 0.4 0.6 0.8 1 1.2 0 0.2 0.4 0.6 0.8 1 1.2 Acc TP diagonal 0 0.2 0.4 0.6 0.8 1 1.2 0 0.2 0.4 0.6 0.8 1 1.2 ROC curve diagonal rev
  • 6. Sandeep Sharma ROC curve 2013 0.9 0.1 0.1 0.374 0.0240909 0.8 0.2 0.2 0.549 0.045 0.7 0.3 0.3 0.734 0.0745455 0.6 0.4 0.4 0.889 0.1118182 0.5 0.5 0.5 0.962 0.1731818 0.4 0.6 0.6 0.992 0.2486364 0.3 0.7 0.7 0.999 0.3327273 0.2 0.8 0.8 1 0.4263636 0.1 0.9 0.9 1 0.515 0 1 1 1 1 4. If the cost of a false accept is 10 euro and a false reject is 30 euro, estimate a suitable operating point on the ROC curve that minimises the overall cost. Assume equal apriori probabilities. scaleX scaleY FPRate TPRate imposter genuine Acc FP Acc TP 0 0 0 0 0 0 0 0 0.1 0.1 0.374 0.02409 374 53 0.374 0.024091 0.2 0.2 0.175 0.02091 175 46 0.549 0.045 0.3 0.3 0.185 0.02955 185 65 0.734 0.074545 0.4 0.4 0.155 0.03727 155 82 0.889 0.111818 0.5 0.5 0.073 0.06136 73 135 0.962 0.173182 0.6 0.6 0.03 0.07545 30 166 0.992 0.248636 0.7 0.7 0.007 0.08409 7 185 0.999 0.332727 0.8 0.8 0.001 0.09364 1 206 1 0.426364 0.9 0.9 0 0.08864 0 195 1 0.515 1 1 0 0.485 0 1067 1 1 1000 2200 1 1
  • 7. Sandeep Sharma ROC curve 2013 = Consider a scenario in which negatives outnumber positives by 10 to 1, but false positives and false negatives have equal cost. By Eq. (1) m = 10, and the most northwest line of slope m = 10 is a, tangent to classifier A, which would be the best performing classifier for these conditions. Consider another scenario in which the positive and negative example populations are evenly balanced but a false negative is 10 times as expensive as a false positive. By Eq. (1) m = 1/10. The most northwest line of slope 1/ 10 would be line b, tangent to classifier C. C is the optimal classifier for these conditions. 0 0.2 0.4 0.6 0.8 1 1.2 0 0.2 0.4 0.6 0.8 1 1.2 FRR FAR