SlideShare a Scribd company logo
RELATIONSHIPS BETWEEN DIVERSITY OF CLASSIFICATION ENSEMBLES AND
SINGLE-CLASS PERFORMANCE MEASURES
ABSTRACT:
In class imbalance learning problems, how to better recognize examples from the minority class
is the key focus, since it is usually more important and expensive than the majority class. Quite a
few ensemble solutions have been proposed in the literature with varying degrees of success. It is
generally believed that diversity in an ensemble could help to improve the performance of class
imbalance learning. However, no study has actually investigated diversity in depth in terms of its
definitions and effects in the context of class imbalance learning. It is unclear whether diversity
will have a similar or different impact on the performance of minority and majority classes.
In this paper, we aim to gain a deeper understanding of if and when ensemble diversity has a
positive impact on the classification of imbalanced data sets. First, we explain when and why
diversity measured by Q-statistic can bring improved overall accuracy based on two
classification patterns proposed by Kuncheva et al. We define and give insights into good and
bad patterns in imbalanced scenarios. Then, the pattern analysis is extended to single-class
performance measures, including recall, precision, and Fmeasure, which are widely used in class
imbalance learning. Six different situations of diversity’s impact on these measures are obtained
through theoretical analysis.
Finally, to further understand how diversity affects the single class performance and overall
performance in class imbalance problems, we carry out extensive experimental studies on both
artificial data sets and real-world benchmarks with highly skewed class distributions. We find
strong correlations between diversity and discussed performance measures. Diversity shows a
positive impact on the minority class in general. It is also beneficial to the overall performance in
terms of AUC and G-mean.
ECWAY TECHNOLOGIES
IEEE PROJECTS & SOFTWARE DEVELOPMENTS
OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE
CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111
VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com

More Related Content

DOCX
Relationships between diversity of classification ensembles and single class
PPTX
Lambert tamuc symposium_3_apr2014
PPTX
Education and Methodology
PDF
Linq 2013 session_red_3_grammatikopoulos_gregoriadis_natsi_klapsinou
PPTX
Action research
PPTX
Comprasion
DOCX
Abstract english
PDF
Confirming the Mediation Effect of a Structural Model by Using Bootstrap Appr...
Relationships between diversity of classification ensembles and single class
Lambert tamuc symposium_3_apr2014
Education and Methodology
Linq 2013 session_red_3_grammatikopoulos_gregoriadis_natsi_klapsinou
Action research
Comprasion
Abstract english
Confirming the Mediation Effect of a Structural Model by Using Bootstrap Appr...

What's hot (8)

PPTX
Impact of social deveelopment on academic adjustment of
PPTX
Correlation between Theoretical Classroom Instruction and Related Learning Ex...
PPTX
Systems approach
PPTX
Resilient & responsive health systems for a changing world: Analysis of the S...
PPTX
Mixing and Matching Learning Design and Learning Analytics (best paper award)
PDF
Measuring Engagement in Technology-Based Health Interventions
 
PDF
2016 SSLMA Award Presentation
PPTX
Application and challenges to the use of mixed methods in health systems rese...
Impact of social deveelopment on academic adjustment of
Correlation between Theoretical Classroom Instruction and Related Learning Ex...
Systems approach
Resilient & responsive health systems for a changing world: Analysis of the S...
Mixing and Matching Learning Design and Learning Analytics (best paper award)
Measuring Engagement in Technology-Based Health Interventions
 
2016 SSLMA Award Presentation
Application and challenges to the use of mixed methods in health systems rese...
Ad

Viewers also liked (11)

PDF
Mecillinam 32887-01-7-api
DOCX
Dotnet scalable and secure sharing of personal health records in cloud compu...
DOCX
15 140929140625-phpapp1y
PDF
Wybory do samorządu 2014 informator wyborczy w pigułce
PDF
Newsletter 1
PDF
Dotnet scalable and secure sharing of personal health records in cloud compu...
PDF
Crowdfunding I - Introducción. El acceso al capital para proyectos de emprend...
PDF
E info intensiv_c_si_001
PDF
Presentation CACEIS - Leader europeen asset servicing
PDF
A Brave New World of Delivering IT
PDF
Intervención colombia.ministro yesid reyes alvarado 59 periodo
Mecillinam 32887-01-7-api
Dotnet scalable and secure sharing of personal health records in cloud compu...
15 140929140625-phpapp1y
Wybory do samorządu 2014 informator wyborczy w pigułce
Newsletter 1
Dotnet scalable and secure sharing of personal health records in cloud compu...
Crowdfunding I - Introducción. El acceso al capital para proyectos de emprend...
E info intensiv_c_si_001
Presentation CACEIS - Leader europeen asset servicing
A Brave New World of Delivering IT
Intervención colombia.ministro yesid reyes alvarado 59 periodo
Ad

Similar to Dotnet relationships between diversity of classification ensembles and single-class performance measures (20)

PDF
Dotnet relationships between diversity of classification ensembles and singl...
PDF
Heterogeneous Peer Effects and Rank Concerns: Theory and Evidence
DOCX
Manova Report
DOCX
The Elaboration ModelIntroductionThe elaboration mod.docx
PPSX
Causal – comparative
DOCX
BUS 308 Week 3 Lecture 1 Examining Differences - Continued.docx
PDF
Meta-Analysis of Interaction in Distance Education
PPTX
#1 Characteristics, Strengths, Weaknesses, Kinds of.pptx
PDF
Applied behavior analysis for educators teacher centered and classroom based
PDF
Time To Practice Week 3
DOCX
Aantekeningen uit referenties van Biostatistics.docx
PDF
Iris Publishers - Journal of Addiction and Psychology | Meaningful Learning ...
PDF
Iris Publishers - Journal of Addiction and Psychology | Meaningful Learning E...
PDF
Creative Problem Solving Model for Promoting Achievement among Higher Seconda...
PPTX
Presentatioddthydtytdydgeegege5gen1.pptx
PPTX
CHAPTER 2 RDL 2.pptx
DOCX
Question 1The Uniform Commercial Code incorporates some of the s.docx
PDF
Assessment For Learning In Immersive And Virtual Environments Evidence-Cent...
DOC
Research in DE
Dotnet relationships between diversity of classification ensembles and singl...
Heterogeneous Peer Effects and Rank Concerns: Theory and Evidence
Manova Report
The Elaboration ModelIntroductionThe elaboration mod.docx
Causal – comparative
BUS 308 Week 3 Lecture 1 Examining Differences - Continued.docx
Meta-Analysis of Interaction in Distance Education
#1 Characteristics, Strengths, Weaknesses, Kinds of.pptx
Applied behavior analysis for educators teacher centered and classroom based
Time To Practice Week 3
Aantekeningen uit referenties van Biostatistics.docx
Iris Publishers - Journal of Addiction and Psychology | Meaningful Learning ...
Iris Publishers - Journal of Addiction and Psychology | Meaningful Learning E...
Creative Problem Solving Model for Promoting Achievement among Higher Seconda...
Presentatioddthydtytdydgeegege5gen1.pptx
CHAPTER 2 RDL 2.pptx
Question 1The Uniform Commercial Code incorporates some of the s.docx
Assessment For Learning In Immersive And Virtual Environments Evidence-Cent...
Research in DE

Dotnet relationships between diversity of classification ensembles and single-class performance measures

  • 1. RELATIONSHIPS BETWEEN DIVERSITY OF CLASSIFICATION ENSEMBLES AND SINGLE-CLASS PERFORMANCE MEASURES ABSTRACT: In class imbalance learning problems, how to better recognize examples from the minority class is the key focus, since it is usually more important and expensive than the majority class. Quite a few ensemble solutions have been proposed in the literature with varying degrees of success. It is generally believed that diversity in an ensemble could help to improve the performance of class imbalance learning. However, no study has actually investigated diversity in depth in terms of its definitions and effects in the context of class imbalance learning. It is unclear whether diversity will have a similar or different impact on the performance of minority and majority classes. In this paper, we aim to gain a deeper understanding of if and when ensemble diversity has a positive impact on the classification of imbalanced data sets. First, we explain when and why diversity measured by Q-statistic can bring improved overall accuracy based on two classification patterns proposed by Kuncheva et al. We define and give insights into good and bad patterns in imbalanced scenarios. Then, the pattern analysis is extended to single-class performance measures, including recall, precision, and Fmeasure, which are widely used in class imbalance learning. Six different situations of diversity’s impact on these measures are obtained through theoretical analysis. Finally, to further understand how diversity affects the single class performance and overall performance in class imbalance problems, we carry out extensive experimental studies on both artificial data sets and real-world benchmarks with highly skewed class distributions. We find strong correlations between diversity and discussed performance measures. Diversity shows a positive impact on the minority class in general. It is also beneficial to the overall performance in terms of AUC and G-mean. ECWAY TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111 VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com