Using Process Mining for Git Log Analysis of Projects in a Software Development Course

Investor logo

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

MACÁK Martin KRÚŽELOVÁ Daniela CHREN Stanislav BÜHNOVÁ Barbora

Year of publication 2021
Type Article in Periodical
Magazine / Source Education and Information Technologies
MU Faculty or unit

Faculty of Informatics

Citation
Web https://link.springer.com/article/10.1007%2Fs10639-021-10564-6
Doi http://dx.doi.org/10.1007/s10639-021-10564-6
Keywords Learning analytics; Mining software repositories; Software development; Process mining; Educational data mining; Git
Description Understanding the processes in education, such as the student learning behavior within a specific course, is a key to continuous course improvement. In online learning systems, students’ learning can be tracked and examined based on data collected by the systems themselves. However, it is non-trivial to decide how to extract the desired students’ behavior from the limited data in traditional classroom courses. Software development courses are a domain where student behavior analysis would be especially useful, as continuous teaching improvement in this fast progressing domain is necessary. In this paper, we propose to use process mining for improvement-motivated process analysis of a software development course (web development in particular). To this end, we analyze Git logs of students’ projects to understand their development processes. Process mining has been chosen as it can help us to find a descriptive model of this process. The main contribution of this paper is the detailed methodology of process mining usage for students’ project development analysis, considering various commit characteristics, which are crucial in understanding student coding-behavior patterns. The process mining analysis proved to be very useful, indicating multiple directions for the course improvement, which we also include in this work as a secondary contribution. The third contribution of this work is the summary and discussion of the process mining advantages and current gaps in process mining research for this task. The data we used are made publicly available to other researchers.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.