dc.contributor.author
Sieg, Miriam
dc.contributor.author
Roselló Atanet, Iván
dc.contributor.author
Tomova, Mihaela Todorova
dc.contributor.author
Schoeneberg, Uwe
dc.contributor.author
Sehy, Victoria
dc.contributor.author
Mäder, Patrick
dc.contributor.author
März, Maren
dc.date.accessioned
2025-08-12T11:44:26Z
dc.date.available
2025-08-12T11:44:26Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/48674
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-48398
dc.description.abstract
BackgroundThe Progress Test Medizin (PTM) is a 200-question formative test that is administered to approximately 11,000 students at medical universities (Germany, Austria, Switzerland) each term. Students receive feedback on their knowledge (development) mostly in comparison to their own cohort. In this study, we use the data of the PTM to find groups with similar response patterns.MethodsWe performed k-means clustering with a dataset of 5,444 students, selected cluster number k = 5, and answers as features. Subsequently, the data was passed to XGBoost with the cluster assignment as target enabling the identification of cluster-relevant questions for each cluster with SHAP. Clusters were examined by total scores, response patterns, and confidence level. Relevant questions were evaluated for difficulty index, discriminatory index, and competence levels.ResultsThree of the five clusters can be seen as "performance" clusters: cluster 0 (n = 761) consisted predominantly of students close to graduation. Relevant questions tend to be difficult, but students answered confidently and correctly. Students in cluster 1 (n = 1,357) were advanced, cluster 3 (n = 1,453) consisted mainly of beginners. Relevant questions for these clusters were rather easy. The number of guessed answers increased. There were two "drop-out" clusters: students in cluster 2 (n = 384) dropped out of the test about halfway through after initially performing well; cluster 4 (n = 1,489) included students from the first semesters as well as "non-serious" students both with mostly incorrect guesses or no answers.ConclusionClusters placed performance in the context of participating universities. Relevant questions served as good cluster separators and further supported our "performance" cluster groupings.
en
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
Progress test
en
dc.subject
Unsupervised machine learning
en
dc.subject
Supervised machine learning
en
dc.subject
Student groups
en
dc.subject
Classification
en
dc.subject
Ensemble learning
en
dc.subject
Boosting algorithm
en
dc.subject.ddc
600 Technik, Medizin, angewandte Wissenschaften::610 Medizin und Gesundheit::610 Medizin und Gesundheit
dc.title
Discovering unknown response patterns in progress test data to improve the estimation of student performance
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.articlenumber
193
dcterms.bibliographicCitation.doi
10.1186/s12909-023-04172-w
dcterms.bibliographicCitation.journaltitle
BMC Medical Education
dcterms.bibliographicCitation.number
1
dcterms.bibliographicCitation.originalpublishername
Springer Nature
dcterms.bibliographicCitation.volume
23
refubium.affiliation
Charité - Universitätsmedizin Berlin
refubium.funding
Springer Nature DEAL
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.bibliographicCitation.pmid
36978145
dcterms.isPartOf.eissn
1472-6920