SlideShare a Scribd company logo
ECWAY TECHNOLOGIES
IEEE PROJECTS & SOFTWARE DEVELOPMENTS
OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE
CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111
VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com

RELATIONSHIPS BETWEEN DIVERSITY OF CLASSIFICATION ENSEMBLES AND
SINGLE-CLASS PERFORMANCE MEASURES
ABSTRACT:
In class imbalance learning problems, how to better recognize examples from the minority class
is the key focus, since it is usually more important and expensive than the majority class. Quite a
few ensemble solutions have been proposed in the literature with varying degrees of success. It is
generally believed that diversity in an ensemble could help to improve the performance of class
imbalance learning. However, no study has actually investigated diversity in depth in terms of its
definitions and effects in the context of class imbalance learning. It is unclear whether diversity
will have a similar or different impact on the performance of minority and majority classes.

In this paper, we aim to gain a deeper understanding of if and when ensemble diversity has a
positive impact on the classification of imbalanced data sets. First, we explain when and why
diversity measured by Q-statistic can bring improved overall accuracy based on two
classification patterns proposed by Kuncheva et al. We define and give insights into good and
bad patterns in imbalanced scenarios. Then, the pattern analysis is extended to single-class
performance measures, including recall, precision, and Fmeasure, which are widely used in class
imbalance learning. Six different situations of diversity’s impact on these measures are obtained
through theoretical analysis.

Finally, to further understand how diversity affects the single class performance and overall
performance in class imbalance problems, we carry out extensive experimental studies on both
artificial data sets and real-world benchmarks with highly skewed class distributions. We find
strong correlations between diversity and discussed performance measures. Diversity shows a
positive impact on the minority class in general. It is also beneficial to the overall performance in
terms of AUC and G-mean.

More Related Content

PDF
Dotnet relationships between diversity of classification ensembles and singl...
DOCX
Relationships between diversity of classification ensembles and single class
PPTX
Lambert tamuc symposium_3_apr2014
PDF
Linq 2013 session_red_3_grammatikopoulos_gregoriadis_natsi_klapsinou
PPTX
Comprasion
PDF
Confirming the Mediation Effect of a Structural Model by Using Bootstrap Appr...
PPTX
Sweety_AU
PPT
Using the Ados Severity Metric to Evaluate a Behavioral Intervention in a Lar...
Dotnet relationships between diversity of classification ensembles and singl...
Relationships between diversity of classification ensembles and single class
Lambert tamuc symposium_3_apr2014
Linq 2013 session_red_3_grammatikopoulos_gregoriadis_natsi_klapsinou
Comprasion
Confirming the Mediation Effect of a Structural Model by Using Bootstrap Appr...
Sweety_AU
Using the Ados Severity Metric to Evaluate a Behavioral Intervention in a Lar...

What's hot (10)

PPTX
Education and Methodology
PPTX
Action research
DOCX
Abstract english
PPTX
Systems approach
PPTX
Sumdog powerpoint
PPT
How Do Coping Strategies Correlate With Job Satisfaction Revised
PDF
Effect of Utilizing Geometer’s Sketchpad Software on Students’ Academic Achie...
PPTX
The relationship between classroom behavior and reading performance
DOCX
Essay on formative assessment
Education and Methodology
Action research
Abstract english
Systems approach
Sumdog powerpoint
How Do Coping Strategies Correlate With Job Satisfaction Revised
Effect of Utilizing Geometer’s Sketchpad Software on Students’ Academic Achie...
The relationship between classroom behavior and reading performance
Essay on formative assessment
Ad

Viewers also liked (19)

PDF
Dotnet scalable and secure sharing of personal health records in cloud compu...
DOCX
2 historia de los juegos olímpicos modernos
PDF
Dotnet secure communication based on ambient audio
PDF
Dotnet security analysis of a single sign-on mechanism for distributed compu...
PDF
Dotnet region-based foldings in process discovery
PPS
Meses Faltantes 2008
PPS
Fotos Incriveis
DOC
DecisionMakingChart
PPT
Etorki Zun
ODP
DOCX
Lengua tema 2 Victor
PPT
Voz Pasiva
PPTX
Apresentação1
ODT
Cristo, nosso Sacerdote_Mapa_Mental_842013
DOCX
Concepto salud
PDF
Pdf demo
DOCX
EL TECLADO
DOCX
Nucleo1
PPS
Pintures Rupestres
Dotnet scalable and secure sharing of personal health records in cloud compu...
2 historia de los juegos olímpicos modernos
Dotnet secure communication based on ambient audio
Dotnet security analysis of a single sign-on mechanism for distributed compu...
Dotnet region-based foldings in process discovery
Meses Faltantes 2008
Fotos Incriveis
DecisionMakingChart
Etorki Zun
Lengua tema 2 Victor
Voz Pasiva
Apresentação1
Cristo, nosso Sacerdote_Mapa_Mental_842013
Concepto salud
Pdf demo
EL TECLADO
Nucleo1
Pintures Rupestres
Ad

Similar to Dotnet relationships between diversity of classification ensembles and single-class performance measures (20)

PDF
Java relationships between diversity of classification ensembles and single-...
PDF
Relationships between diversity of classification ensembles and single class ...
PDF
Java relationships between diversity of classification ensembles and single-...
PDF
Relationships between diversity of classification ensembles and single class ...
PDF
The Impact of Class Rebalancing Techniques on the Performance and Interpretat...
PDF
Buddi health class imbalance based deep learning
PDF
An overview on data mining designed for imbalanced datasets
PDF
An overview on data mining designed for imbalanced datasets
PDF
A three-step combination strategy for addressing outliers and class imbalance...
PPTX
COMP_GroupA2.pptx
PDF
Multi-Cluster Based Approach for skewed Data in Data Mining
PDF
Predicting instructor performance using data mining techniques in higher educ...
PDF
Analysis of Imbalanced Classification Algorithms A Perspective View
PDF
Evaluation measures for models assessment over imbalanced data sets
PDF
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
PDF
Student Performance Prediction via Data Mining & Machine Learning
PDF
Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...
Java relationships between diversity of classification ensembles and single-...
Relationships between diversity of classification ensembles and single class ...
Java relationships between diversity of classification ensembles and single-...
Relationships between diversity of classification ensembles and single class ...
The Impact of Class Rebalancing Techniques on the Performance and Interpretat...
Buddi health class imbalance based deep learning
An overview on data mining designed for imbalanced datasets
An overview on data mining designed for imbalanced datasets
A three-step combination strategy for addressing outliers and class imbalance...
COMP_GroupA2.pptx
Multi-Cluster Based Approach for skewed Data in Data Mining
Predicting instructor performance using data mining techniques in higher educ...
Analysis of Imbalanced Classification Algorithms A Perspective View
Evaluation measures for models assessment over imbalanced data sets
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
Student Performance Prediction via Data Mining & Machine Learning
Learning from a Class Imbalanced Public Health Dataset: a Cost-based Comparis...

Dotnet relationships between diversity of classification ensembles and single-class performance measures

  • 1. ECWAY TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111 VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com RELATIONSHIPS BETWEEN DIVERSITY OF CLASSIFICATION ENSEMBLES AND SINGLE-CLASS PERFORMANCE MEASURES ABSTRACT: In class imbalance learning problems, how to better recognize examples from the minority class is the key focus, since it is usually more important and expensive than the majority class. Quite a few ensemble solutions have been proposed in the literature with varying degrees of success. It is generally believed that diversity in an ensemble could help to improve the performance of class imbalance learning. However, no study has actually investigated diversity in depth in terms of its definitions and effects in the context of class imbalance learning. It is unclear whether diversity will have a similar or different impact on the performance of minority and majority classes. In this paper, we aim to gain a deeper understanding of if and when ensemble diversity has a positive impact on the classification of imbalanced data sets. First, we explain when and why diversity measured by Q-statistic can bring improved overall accuracy based on two classification patterns proposed by Kuncheva et al. We define and give insights into good and bad patterns in imbalanced scenarios. Then, the pattern analysis is extended to single-class performance measures, including recall, precision, and Fmeasure, which are widely used in class imbalance learning. Six different situations of diversity’s impact on these measures are obtained through theoretical analysis. Finally, to further understand how diversity affects the single class performance and overall performance in class imbalance problems, we carry out extensive experimental studies on both artificial data sets and real-world benchmarks with highly skewed class distributions. We find strong correlations between diversity and discussed performance measures. Diversity shows a positive impact on the minority class in general. It is also beneficial to the overall performance in terms of AUC and G-mean.