Use of Machine Learning Methods in the Assessment of Programming Assignments

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Programming has become an important skill in today’s world and is taught widely both in traditional and online settings. Educators need to grade increasing numbers of student submissions. Unit testing can contribute to the automation of the grading process; however, it cannot assess the structure, or partial correctness, which are needed for finely differentiated grading. This paper builds on previous research that investigated several machine learning models for determining the correctness of source code. It was found that some such models can be successful, provided that the code samples used for fitting and prediction fulfil the same sets of requirements (corresponding to coding assignments). The hypothesis investigated in this paper is that code samples can be grouped by similarity of the requirements that they fulfil and that models built with samples of code from such a group can be used for determining the quality of new samples that belong to the same group, even if they do not correspond to the same coding assignment, which would make for a much more useful predictive model in practice. The investigation involved ten different machine learning algorithms used on over four hundred thousand student code submissions and it confirmed the hypothesis.

Original languageEnglish
Title of host publicationText, Speech, and Dialogue - 25th International Conference, TSD 2022, Proceedings
EditorsPetr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala
PublisherSpringer Science and Business Media Deutschland GmbH
Pages151-159
Number of pages9
ISBN (Print)9783031162695
DOIs
Publication statusPublished - 2022
Event25th International Conference on Text, Speech, and Dialogue, TSD 2022 - Brno, Czech Republic
Duration: 6 Sep 20229 Sep 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13502 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Text, Speech, and Dialogue, TSD 2022
Country/TerritoryCzech Republic
CityBrno
Period6/09/229/09/22

Keywords

  • Applied Machine Learning for Code Assessment
  • Automated Grading
  • Student Programming Code Grading

Fingerprint

Dive into the research topics of 'Use of Machine Learning Methods in the Assessment of Programming Assignments'. Together they form a unique fingerprint.

Cite this