TY - GEN
T1 - Predicting Success in CS1 - An Open Access Data Project
AU - Quille, Keith
AU - Nolan, Keith
N1 - Publisher Copyright:
© 2022 Owner/Author.
PY - 2022/3/3
Y1 - 2022/3/3
N2 - PreSS# is an online Machine Learning prediction model that aims to identify students at risk of failing or dropping out in an introductory programming course (typically called CS1). PreSS# has been developed over the past 16 years, where the model is capable of predicting at-risk students with an accuracy of 71%. There is, however, a need to re-validate the model using a larger international multi-jurisdictional multi-university data set, as up until now the data sets have been predominantly from a single jurisdiction. The goal of this study is to not only re-validate the model using a multi-jurisdictional data set, but, in line with a 2015 ITiCSE working group report's Grand Challenges, to openly publish the data set itself. This work timely to the CSEd community as other researchers can use this data to further their research, re-validate PreSS# and will be able to then contribute, by submitting their local PreSS# data sets to this global online repository.
AB - PreSS# is an online Machine Learning prediction model that aims to identify students at risk of failing or dropping out in an introductory programming course (typically called CS1). PreSS# has been developed over the past 16 years, where the model is capable of predicting at-risk students with an accuracy of 71%. There is, however, a need to re-validate the model using a larger international multi-jurisdictional multi-university data set, as up until now the data sets have been predominantly from a single jurisdiction. The goal of this study is to not only re-validate the model using a multi-jurisdictional data set, but, in line with a 2015 ITiCSE working group report's Grand Challenges, to openly publish the data set itself. This work timely to the CSEd community as other researchers can use this data to further their research, re-validate PreSS# and will be able to then contribute, by submitting their local PreSS# data sets to this global online repository.
KW - edm
KW - educational data mining
KW - machine learning
KW - metrics
KW - re-validation
UR - http://www.scopus.com/inward/record.url?scp=85127373343&partnerID=8YFLogxK
U2 - 10.1145/3478432.3499092
DO - 10.1145/3478432.3499092
M3 - Conference contribution
AN - SCOPUS:85127373343
T3 - SIGCSE 2022 - Proceedings of the 53rd ACM Technical Symposium on Computer Science Education V.2
SP - 1126
BT - SIGCSE 2022 - Proceedings of the 53rd ACM Technical Symposium on Computer Science Education V.2
PB - Association for Computing Machinery, Inc
T2 - 53rd Annual ACM Technical Symposium on Computer Science Education, SIGCSE 2022
Y2 - 3 March 2022 through 5 March 2022
ER -