The application of mutation testing to enhance the automated assessment of introductory programming assignments

Abstract

Growing cohorts of students enrolled in introductory programming courses reveal a challenge in manual assessment; it is impractical for a tutor to manually evaluate hundreds or even thousands of programs written by students in a timely manner. Furthermore, manual assessment is not always fair; tutors can make mistakes in their assessment. Automated assessment provides a solution to these problems; a computer can evaluate the correctness and style of students’ programs, and generate feedback accordingly, in much less time, and with a high degree of consistency. A particularly widespread approach to do this is test-based automated assessment, in which a tutor writes a test suite to evaluate the correctness of students’ programs, which is automatically executed by a computer to generate a grade and applicable feedback according to the results of these tests.

Such assessment test suites are not necessarily flawless, however. For example, a test suite may not detect some faults present in students’ programs; they may receive inaccurate grades and feedback where their mistakes are missed. In the software engineering industry, adequacy metrics and test goals are often employed to ensure that test suites can detect faults; by achieving such test goals and high adequacy metrics, a test suite should be able to detect faults more reliably. One approach is to measure coverage; which elements of a program are executed by a test suite, and which are not. Naturally, a test suite which exercises more of a program should be more capable of detecting faults. However, executing a program element does not guarantee that a fault within it is detected, for example, some faults only manifest for particular states of the program. Mutation testing offers a different approach to evaluating the adequacy of a test suite. Mutation testing involves generating artificial faulty variants of the program, called mutants, and executing the test suite on each of them. A test suite which detects more of these mutants should be more capable of detecting faults. Furthermore, the undetected mutants can be used to inform the creation of new tests to improve adequacy.

Accordingly, in this thesis I investigate how mutation testing can be used to improve grading test suites. First, I consider how different test suites can generate varying grades for students’ solution programs; is there a risk of inadequate test suites generating unfair grades? I also investigate how different observable properties, including coverage and the detection of mutants, impact such changes in grades. Finally, I evaluate how applicable mutation testing is to improving grading test suites; do the fundamental assumptions of mutation testing hold for students’ programs, and does improving a test suite’s ability to detect artificial faults also improve its ability to detect students’ faults?

Metadata

Supervisors:	McMinn, Phil and Fraser, Gordon
Related URLs:	Diagnosability, Adequacy & Size: How Test Suites Impact Autograding (Related publication) Gradeer: An Open-Source Modular Hybrid Grader (Related publication) An Empirical Study to Determine if Mutants Can Effectively Simulate Students' Programming Mistakes to Increase Tutors' Confidence in Autograding (Related publication) The Influence of Test Suite Properties on Automated Grading of Programming Exercises (Related publication) Simulating Student Mistakes to Evaluate the Fairness of Automated Grading (Related publication)
Keywords:	mutation analysis, mutation testing, automated assessment, introductory programming, education, automated grading, assessment, software testing
Awarding institution:	University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield)
Identification Number/EthosID:	uk.bl.ethos.858756
Depositing User:	Mr Benjamin Simon Clegg
Date Deposited:	30 Jun 2022 15:06
Last Modified:	01 Sep 2022 09:53
Open Archives Initiative ID (OAI ID):	oai:etheses.whiterose.ac.uk:30513

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

The application of mutation testing to enhance the automated assessment of introductory programming assignments

Abstract

Metadata

Download

Final eThesis - complete (pdf)

Export

Statistics