On the Diffusion of Test Smells in Automatically Generated Test Code: An Empirical Study

On the Diffusion ofTest Smells in Automatically
GeneratedTest Code:An Empirical Study
Fabio Palomba*, Dario Di Nucci*,Annibale Panichella+,
Rocco Oliveto°,Andrea De Lucia*
* University of Salerno, Italy
+Delft University ofTechnology,The Netherlands
° University of Molise, Italy
SBST 2016
May, 16th 2016
Austin,TX, USA

Automatic Generation ofTest Code
method1()
method2()
…
methodN()
Production Class
test_method1()
test_method2()
…
test_methodN()
Test Suite

[Arcuri and Fraser - SSBSE’14]
Effectiveness of Whole Suite
Test Case Generation
[Panichella et al. - ICSE’16]
Impact ofTest Case Summaries
on Bug Fixing Performance
[Shamshiri et al. - ASE’15]
Effectiveness of Generated
Test Cases on Effectiveness

[Rojas et al. - ISSTA’15]
Usability of Automatic
GenerationTools in Practice

What about the characteristics of test code produced by such tools?

Test Smells inTest Code
[Van Deursen et al. - XP 2001]
“Test Smells represent a set of a
poor design solutions to write tests ”
11test smells related to the way
developers write test ﬁxtures
and test cases

public void test12 () throws Throwable {
JSTerm jSTerm0 = new JSTerm(); 
jSTerm0.makeVariable () ;
jSTerm0.add((Object) ””);
jSTerm0.matches(jSTerm0); 
assertEquals (false, jSTerm0.isGround ());
assertEquals(true, jSTerm0.isVariable());
}

}
The test method checks the production method isGround()

}
But also the production method isVariable()

}
This is an EagerTest, namely a test which checks more than
one method of the class to be tested, making difﬁcult
the comprehension of the actual test target.

A test case is affected by a Resource Optimism when
it makes assumptions about the state or the existence
of external resources, providing a non-deterministic
result that depend on the state of the resources.
An Assertion Roulette comes from having a number of
assertions in a test method that have no explanation.
If an assertion fails, the identiﬁcation of
the assert that failed can be difﬁcult.

Who cares aboutTest Smells?
Test Cases can be re-generated!

True

True
BUT

Developers modify and remove test code
Developers add tests when automatic tools leave
uncovered branches
Developers combine generated with manually written tests
[Rojas et al. - ISSTA’15]
Usability of Automatic GenerationTools in Practice
True
BUT

Empirical Study Design

8test smell types
“RefactoringTest Code”

110software projects
8test smell types
[Fraser and Arcuri -TOSEM 2014]
“A Large Scale Evaluation of Automated Unit
Test Generation using Evosuite”
“RefactoringTest Code”

16,603JUnit classes
[Fraser and Arcuri -TOSEM 2014]
“A Large Scale Evaluation of Automated Unit
Test Generation using Evosuite”

Data Extraction
test_method1()
test_method2()
…
test_methodN()
Test Suite
[Bavota et al. - EMSE 2015]
“AreTest Smells Harmful? An Empirical Study”
Test Smell Detector

Data Extraction
test_method1()
test_method2()
…
test_methodN()
Test Suite
[Bavota et al. - EMSE 2015]
“AreTest Smells Harmful? An Empirical Study”
Test Smell Detector
75%
precision
100%
recall
Sample size: 378 JUnit classes

Research Questions
RQ1:To What ExtentTest Smells are Spread in
Automatically GeneratedTest Classes?
?

RQ2:WhichTest Smells Occur More Frequently
in Automatically GeneratedTest Classes?
?
Research Questions

RQ3:WhichTest Smells Co-OccurTogether?
?
Research Questions

RQ4: IsThere a Relationship Between the Presence
ofTest Smells and the Project Characteristics?
?
Research Questions

Analysis of the Results

Results of the Study
!13,791smelly JUnit classes

!83%of the JUnit classes analyzed

!RQ1
Test Smells are highly diffused in the
automatically generated test suites

!Assertion Roulette 54%

Test Code Duplication
33%

EagerTest
33%
29%

!public void test8 () throws Throwable {
Document document0 = new Document(); 
assertNotNull(document0);
document0.procText.add((Character) ”s”);
String string0 = document0.stringify(); 
assertEquals (“s”, document0.stringify());  
assertNotNull(string0); 
assertEquals(“s”, string0);
}
Assertion Roulette

!
Assertion Roulette
What is the behavior
under test?
Are the generated
assertions valid?

!
Assertion Roulette
[Panichella et al. - ICSE’16]
The Impact ofTest Case Summaries on Bug Fixing Performance
An Empirical Investigation
These problems have a huge impact on
developers’ ability to ﬁnd faults

!public void test8 () throws Throwable {
GenericProperties generic0 = new GenericProperties(); 
boolean boolean0 = generic0.isValidClassname();
…
}
GenericProperties generic0 = new GenericProperties(); 
boolean boolean0 = generic0.isValidClassname();
…
}

!
This problem can be avoided by
generating test ﬁxtures!

!
Assertion Roulette Eager Test
Assertion Roulette Sensitive Equality
Resource Optimism Mystery Guest

!
Assertion Roulette Eager Test
Assertion Roulette Sensitive Equality
Resource Optimism Mystery Guest
Automatic tools have as main goal that of maximize
coverage, without considering test code quality

!Yes! The higher the LOC to be tested, the higher the
probability to produce a smelly test case!

!Yes! The higher the LOC to be tested, the higher the
The higher the LOC of the JUnit class, the higher the

Summarizing
Lesson 1: Current implementations of search-based algorithms
for automatic test case generation do not consider code quality,
increasing the probability to introduce smells!

Summarizing
Lesson 1: Current implementations of search-based algorithms
for automatic test case generation do not consider code quality,
increasing the probability to introduce smells!
NB: Considering test code quality is important not only to
avoid the introduction of smells, but also because the
coverage can be increased!
[F. Palomba,A. Panichella,A. Zaidman, R. Oliveto,A. De Lucia - ISSTA’16]
AutomaticTest Case Generation:What IfTest Code Quality Matters?

Summarizing
Lesson 2:Automatic test case generation tools do not produce
text ﬁxtures during their computation, and this implies the
introduction of several code clones in the resulting JUnit classes.
Future research should spend effort in the automatic
generation of test ﬁxtures!

From now on…
?
Challenge 1: EvaluatingTest Smells inTest Cases
automatically generated by other tools

From now on…
?
Challenge 1: EvaluatingTest Smells inTest Cases
automatically generated by other tools
Challenge 2: Deﬁning new algorithms able to solve
the design problems analyzed (e.g., test ﬁxtures).

On the Diffusion of Test Smells in Automatically Generated Test Code: An Empirical Study

More Related Content

What's hot (12)

Viewers also liked (17)

Similar to On the Diffusion of Test Smells in Automatically Generated Test Code: An Empirical Study (20)

More from Fabio Palomba (6)

Recently uploaded (20)

On the Diffusion of Test Smells in Automatically Generated Test Code: An Empirical Study