Predicting Success in CS1 - An Open Access Data Project

Keith Quille, Keith Nolan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

PreSS# is an online Machine Learning prediction model that aims to identify students at risk of failing or dropping out in an introductory programming course (typically called CS1). PreSS# has been developed over the past 16 years, where the model is capable of predicting at-risk students with an accuracy of 71%. There is, however, a need to re-validate the model using a larger international multi-jurisdictional multi-university data set, as up until now the data sets have been predominantly from a single jurisdiction. The goal of this study is to not only re-validate the model using a multi-jurisdictional data set, but, in line with a 2015 ITiCSE working group report's Grand Challenges, to openly publish the data set itself. This work timely to the CSEd community as other researchers can use this data to further their research, re-validate PreSS# and will be able to then contribute, by submitting their local PreSS# data sets to this global online repository.

Original languageEnglish
Title of host publicationSIGCSE 2022 - Proceedings of the 53rd ACM Technical Symposium on Computer Science Education V.2
PublisherAssociation for Computing Machinery, Inc
Pages1126
Number of pages1
ISBN (Electronic)9781450390712
DOIs
Publication statusPublished - 3 Mar 2022
Event53rd Annual ACM Technical Symposium on Computer Science Education, SIGCSE 2022 - Virtual, Online, United States
Duration: 3 Mar 20225 Mar 2022

Publication series

NameSIGCSE 2022 - Proceedings of the 53rd ACM Technical Symposium on Computer Science Education V.2

Conference

Conference53rd Annual ACM Technical Symposium on Computer Science Education, SIGCSE 2022
Country/TerritoryUnited States
CityVirtual, Online
Period3/03/225/03/22

Keywords

  • edm
  • educational data mining
  • machine learning
  • metrics
  • re-validation

Fingerprint

Dive into the research topics of 'Predicting Success in CS1 - An Open Access Data Project'. Together they form a unique fingerprint.

Cite this