Ml3 logistic regression-and_classification_error_metrics

Legal Notices and Disclaimers
This presentation is for informational purposes only. INTEL MAKES NO WARRANTIES,
EXPRESS OR IMPLIED, IN THIS SUMMARY.
Intel technologies’ features and benefits depend on system configuration and may require
enabled hardware, software or service activation. Performance varies depending on system
configuration. Check with your system manufacturer or retailer or learn more at intel.com.
This sample source code is released under the Intel Sample Source Code License Agreement.
Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.
*Other names and brands may be claimed as the property of others.
Copyright © 2017, Intel Corporation. All rights reserved.

Introduction to Logistic Regression
Number of Positive Nodes
Patient
Status
After Five
Years
Survived
Lost

Linear Regression for Classification?
Survived
LostPatient
Status
After Five
Years
𝑦 𝛽 𝑥 = 𝛽0 + 𝛽1 𝑥 + ε

Survived: 0.0
Lost: 1.0Patient
Status
After Five
Years
0.5
𝑦 𝛽 𝑥 = 𝛽0 + 𝛽1 𝑥 + ε

Survived: 0.0
Lost: 1.0Patient
Status
After Five
Years
If model result > 0.5: predict lost
If model result < 0.5: predict survived
0.5

Survived: 0.0
Lost: 1.0Patient
Status
After Five
Years
If model result > 0.5: predict lost
If model result < 0.5: predict survived
0.5
0 0
0000
1 1 1 1 1 1 1
Prediction

What is this Function?
0.0
1.0
0.2
0.4
0.6
0.8
0-5-10 5 10
𝑦 =
1
1+𝑒−𝑥

The Decision Boundary
Survived: 0.0
Lost: 1.0Patient
Status
After Five
Years
0.5
𝑦 𝛽 𝑥 =
1
1+𝑒−(𝛽0+ 𝛽1 𝑥 + ε )

Logistic Regression
Survived: 0.0
Lost: 1.0Patient
Status
After Five
Years
0.5
𝑦 𝛽 𝑥 =
1
1+𝑒−(𝛽0+ 𝛽1 𝑥 + ε )

Relationship of Logistic to Linear Regression
Logistic
Function
𝑃 𝑥 =
1
1 + 𝑒−(𝛽0+ 𝛽1 𝑥 + ε )

Logistic
Function
𝑃 𝑥 =
1
1 + 𝑒−(𝛽0+ 𝛽1 𝑥 + ε )
𝑃 𝑥 =
𝑒(𝛽0+ 𝛽1 𝑥)
1+𝑒(𝛽0+ 𝛽1 𝑥)

Logistic
Function
𝑃 𝑥 =
𝑒(𝛽0+ 𝛽1 𝑥)
1+𝑒(𝛽0+ 𝛽1 𝑥)

𝑃 𝑥 =
𝑒(𝛽0+ 𝛽1 𝑥)
1+𝑒(𝛽0+ 𝛽1 𝑥)
𝑃 𝑥
1 − 𝑃 𝑥
= 𝑒 𝛽0+ 𝛽1 𝑥
Logistic
Function
Odds
Ratio

𝑙𝑜𝑔
𝑃 𝑥
1 − 𝑃 𝑥
= 𝛽0 + 𝛽1 𝑥
𝑃 𝑥 =
𝑒(𝛽0+ 𝛽1 𝑥)
1+𝑒(𝛽0+ 𝛽1 𝑥)
Logistic
Function
Log
Odds

Classification with Logistic Regression
Survived: 0.0
Lost: 1.0Patient
Status
After Five
Years
0.5
One feature (nodes)
Two labels (survived, lost)

Number of Malignant Nodes
0
Age
60
40
20
10 20
Two features (nodes, age)

0
Age
60
40
20
10 20
Decision
Boundary

0
Age
60
40
20
10 20
new example
(predict)
Decision
Boundary

0
Age
60
40
20
10 20
Three labels (survived, complications,
lost)
Multiclass Classification with Logistic Regression

0
Age
60
40
20
10 20
One vs All: Survived vs All

0
Age
60
40
20
10 20
One vs All: Complications vs All

0
Age
60
40
20
10 20
One vs All: Loss vs All

0
Age
60
40
20
10 20
Assign most probable class to each region
Multiclass Decision Boundary

Import the class containing the classification method
from sklearn.linear_model import LogisticRegression
Create an instance of the class
LR = LogisticRegression(penalty='l2', c=10.0)
Fit the instance on the data and then predict the expected value
LR = LR.fit(X_train, y_train)
y_predict = LR.predict(X_test)
Tune regularization parameters with cross-validation: LogisticRegressionCV.
Logistic Regression: The Syntax

regularization
parameters

Ml3 logistic regression-and_classification_error_metrics

• You are asked to build a classifier for leukemia
• Training data: 1% patients with leukemia, 99% healthy
• Measure accuracy: total % of predictions that are
correct
Choosing the Right Error Measurement

• You are asked to build a classifier for leukemia
• Training data: 1% patients with leukemia, 99% healthy
• Measure accuracy: total % of predictions that are
correct
• Build a simple model that always predicts "healthy"
• Accuracy will be 99%...
Choosing the Right Error Measurement

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Confusion MatrixConfusion Matrix

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Type I Error
Confusion MatrixConfusion Matrix
Type II Error

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Confusion MatrixAccuracy: Predicting Correctly
Accuracy =
TP + TN
TP + FN + FP + TN

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Confusion MatrixRecall: Identifying All Positive Instances
Recall or
Sensitivity
TP
TP +
FN
=

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Confusion MatrixPrecision: Identifying Only Positive Instances
Precision
=
TP
TP + FP

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Confusion MatrixSpecificity: Avoiding False Alarms
Specificity =
TN
FP + TN

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Confusion MatrixError Measurements
Accuracy =
TP + TN
TP + FN + FP + TN
Precision =
TP
TP + FP

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Accuracy =
TP + TN
TP + FN + FP + TN
Precision = TP
TP + FP
Specificity =
TN
FP + TN
Recall or
Sensitivity
TP
TP + FN
=

Predicted
Positive
Predicted
Negative
True Positive
(TP)
False Negative
(FN)
Actual
Positive
False Positive
(FP)
True Negative
(TN)
Actual
Negative
Accuracy =
TP + TN
TP + FN + FP + TN
Precision =
TP
TP + FP
Specificity =
TN
FP + TN
Recall or
Sensitivity
TP
TP + FN
=
F1 = 2
Precision * Recall
Precision + Recall

Random
Guess
Worse
Better
0.2
0.4
0.6
0.8
1.0
0.2 0.4 0.6 0.8 1.0
Receiver Operating Characteristic (ROC)
Evaluation of model at all possible thresholds
Perfect
Model
False Positive Rate (1 – Specificity)
TruePositiveRate(Sensitivity)

Measures total area under ROC curve
False Positive Rate (1 – Specificity)
TruePositiveRate(Sensitivity)
AUC 0.5
AUC 0.75
AUC 0.9
0.2
0.4
0.6
0.8
1.0
0.2 0.4 0.6 0.8 1.0
Area Under Curve (AUC)

Recall
Precision
0.2
0.4
0.6
0.8
1.0
0.2 0.4 0.6 0.8 1.0
Precision Recall Curve (PR Curve)
Model 1
Model 2
Measures trade-off between precision and recall

Multiple Class Error Metrics
Accuracy =
TP1 + TP2 + TP3
𝐼𝑛𝑐𝑜𝑟𝑟𝑒𝑐𝑡
𝐶𝑙𝑎𝑠𝑠𝑖𝑓𝑖𝑐𝑎𝑡𝑖𝑜𝑛𝑠
Predicted
Class 1
Predicted
Class 2
TP1
Actual
Class 1
TP2
Actual
Class 2
Predicted
Class 3
Actual
Class 3
TP3
Most multi-class error
metrics are similar to
binary versions—
just expand elements
as a sum
Incorrect
Classifications

Accuracy =
TP1 + TP2 + TP3
Predicted
Class 1
Predicted
Class 2
TP1
Actual
Class 1
TP2
Actual
Class 2
Predicted
Class 3
Actual
Class 3
TP3
binary versions—
as a sum
Total

Predicted
Class 1
Predicted
Class 2
TP1
Actual
Class 1
TP2
Actual
Class 2
Predicted
Class 3
Actual
Class 3
TP3
binary versions—
as a sum
Accuracy =
TP1 + TP2 + TP3
Total

Import the desired error function
from sklearn.metrics import accuracy_score
Classification Error Metrics: The Syntax

Calculate the error on the test and predicted data sets
accuracy_value = accuracy_score(y_test, y_pred)

Calculate the error on the test and predicted data sets
accuracy_value = accuracy_score(y_test, y_pred)
Lots of other error metrics and diagnostic tools:
from sklearn.metrics import precision_score, recall_score,
f1_score, roc_auc_score,
confusion_matrix, roc_curve,
precision_recall_curve

Ml3 logistic regression-and_classification_error_metrics

More Related Content

What's hot (20)

Similar to Ml3 logistic regression-and_classification_error_metrics (20)

More from ankit_ppt (20)

Recently uploaded (20)

Ml3 logistic regression-and_classification_error_metrics