给戴夫还是给博士?利用LMS活动和学术档案预测高校IT学生在软件团队中的突出作用

2022 26th International Computer Science and Engineering Conference (ICSEC) Pub Date : 2022-12-21 DOI:10.1109/ICSEC56337.2022.10049348

Varit Rungbanapan, Tipajin Thaipisutikul, Siripen Pongpaichet, A. Supratak, Chih-Yang Lin, Suppawong Tuarob

{"title":"给戴夫还是给博士?利用LMS活动和学术档案预测高校IT学生在软件团队中的突出作用","authors":"Varit Rungbanapan, Tipajin Thaipisutikul, Siripen Pongpaichet, A. Supratak, Chih-Yang Lin, Suppawong Tuarob","doi":"10.1109/ICSEC56337.2022.10049348","DOIUrl":null,"url":null,"abstract":"A software team comprises software practitioners with diverse backgrounds and responsibilities, such as programmers, reviewers, testers, and documentation experts. Whether developing the architecture, implementing new features, creating test cases, or providing documentation for users and the development team, each of these jobs is essential to the accomplishment of software tasks. Current methods for determining a student’s software development skill include sending questionnaires and monitoring students while they work. Not only are these techniques restricted in coverage, but they also rely on intervention strategies, which may result in social desirability bias and student exhaustion. In this research, we offer a multivariate time-series classification strategy for automatically identifying students’ expertise in software development based on information passively accessible via LMS logs and course grades. Several machine learning and deep learning models, including XGBoost, Random Forest, SVM, Stochastic Gradient Descent, Multi-layer Perceptron, Gaussian Naive Baye, Complement Naive Bayes, Long Short-Term Memory (LSTM), and XceptionTime, are examined for their ability to model students’ LMS activities and academic performance at various degrees of granularity, namely semester and daily levels. A case study of 33 IT-majoring college students is utilized to validate the effectiveness of the proposed strategy. The experimental findings demonstrate that our best models yield F1 values of 79.52% and 75.68% for the developer and documenter identification tasks, utilizing Multilayer Perceptron with daily features and LSTM with semester features, respectively. We are the first to attempt to determine the roles of students in software development using passively accessible data. The findings not only shed light on the ability to create personalized education tailored to each student’s needs but also pave the way for numerous intelligent education technology applications that aim to automatically evaluate certain student characteristics in order to optimize student learning outcomes.","PeriodicalId":430850,"journal":{"name":"2022 26th International Computer Science and Engineering Conference (ICSEC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"To Dev or to Doc?: Predicting College IT Students’ Prominent Functions in Software Teams Using LMS Activities and Academic Profiles\",\"authors\":\"Varit Rungbanapan, Tipajin Thaipisutikul, Siripen Pongpaichet, A. Supratak, Chih-Yang Lin, Suppawong Tuarob\",\"doi\":\"10.1109/ICSEC56337.2022.10049348\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A software team comprises software practitioners with diverse backgrounds and responsibilities, such as programmers, reviewers, testers, and documentation experts. Whether developing the architecture, implementing new features, creating test cases, or providing documentation for users and the development team, each of these jobs is essential to the accomplishment of software tasks. Current methods for determining a student’s software development skill include sending questionnaires and monitoring students while they work. Not only are these techniques restricted in coverage, but they also rely on intervention strategies, which may result in social desirability bias and student exhaustion. In this research, we offer a multivariate time-series classification strategy for automatically identifying students’ expertise in software development based on information passively accessible via LMS logs and course grades. Several machine learning and deep learning models, including XGBoost, Random Forest, SVM, Stochastic Gradient Descent, Multi-layer Perceptron, Gaussian Naive Baye, Complement Naive Bayes, Long Short-Term Memory (LSTM), and XceptionTime, are examined for their ability to model students’ LMS activities and academic performance at various degrees of granularity, namely semester and daily levels. A case study of 33 IT-majoring college students is utilized to validate the effectiveness of the proposed strategy. The experimental findings demonstrate that our best models yield F1 values of 79.52% and 75.68% for the developer and documenter identification tasks, utilizing Multilayer Perceptron with daily features and LSTM with semester features, respectively. We are the first to attempt to determine the roles of students in software development using passively accessible data. The findings not only shed light on the ability to create personalized education tailored to each student’s needs but also pave the way for numerous intelligent education technology applications that aim to automatically evaluate certain student characteristics in order to optimize student learning outcomes.\",\"PeriodicalId\":430850,\"journal\":{\"name\":\"2022 26th International Computer Science and Engineering Conference (ICSEC)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 26th International Computer Science and Engineering Conference (ICSEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSEC56337.2022.10049348\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Computer Science and Engineering Conference (ICSEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSEC56337.2022.10049348","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

软件团队由具有不同背景和职责的软件从业者组成，例如程序员、审阅者、测试人员和文档专家。无论是开发体系结构、实现新特性、创建测试用例，还是为用户和开发团队提供文档，这些工作中的每一个都是完成软件任务所必需的。当前确定学生软件开发技能的方法包括发送调查问卷和在学生工作时监视他们。这些技术不仅在覆盖范围上受到限制，而且还依赖于干预策略，这可能导致社会可取性偏见和学生疲惫。在这项研究中，我们提供了一个多变量时间序列分类策略，用于自动识别学生在软件开发方面的专业知识，该策略基于通过LMS日志和课程成绩被动访问的信息。几个机器学习和深度学习模型，包括XGBoost，随机森林，支持向量机，随机梯度下降，多层感知器，高斯朴素贝叶斯，补充朴素贝叶斯，长短期记忆(LSTM)和XceptionTime，测试了它们在不同粒度程度(即学期和日常级别)上模拟学生LMS活动和学习成绩的能力。通过对33名信息技术专业大学生的个案研究，验证了该策略的有效性。实验结果表明，对于开发者和文档人员的识别任务，我们的最佳模型分别使用具有日常特征的多层感知器和具有学期特征的LSTM, F1值分别为79.52%和75.68%。我们是第一个尝试使用被动可访问数据来确定学生在软件开发中的角色的人。这些发现不仅阐明了根据每个学生的需求量身定制个性化教育的能力，而且为许多智能教育技术应用铺平了道路，这些应用旨在自动评估学生的某些特征，以优化学生的学习成果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

To Dev or to Doc?: Predicting College IT Students’ Prominent Functions in Software Teams Using LMS Activities and Academic Profiles

A software team comprises software practitioners with diverse backgrounds and responsibilities, such as programmers, reviewers, testers, and documentation experts. Whether developing the architecture, implementing new features, creating test cases, or providing documentation for users and the development team, each of these jobs is essential to the accomplishment of software tasks. Current methods for determining a student’s software development skill include sending questionnaires and monitoring students while they work. Not only are these techniques restricted in coverage, but they also rely on intervention strategies, which may result in social desirability bias and student exhaustion. In this research, we offer a multivariate time-series classification strategy for automatically identifying students’ expertise in software development based on information passively accessible via LMS logs and course grades. Several machine learning and deep learning models, including XGBoost, Random Forest, SVM, Stochastic Gradient Descent, Multi-layer Perceptron, Gaussian Naive Baye, Complement Naive Bayes, Long Short-Term Memory (LSTM), and XceptionTime, are examined for their ability to model students’ LMS activities and academic performance at various degrees of granularity, namely semester and daily levels. A case study of 33 IT-majoring college students is utilized to validate the effectiveness of the proposed strategy. The experimental findings demonstrate that our best models yield F1 values of 79.52% and 75.68% for the developer and documenter identification tasks, utilizing Multilayer Perceptron with daily features and LSTM with semester features, respectively. We are the first to attempt to determine the roles of students in software development using passively accessible data. The findings not only shed light on the ability to create personalized education tailored to each student’s needs but also pave the way for numerous intelligent education technology applications that aim to automatically evaluate certain student characteristics in order to optimize student learning outcomes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 26th International Computer Science and Engineering Conference (ICSEC)

自引率

0.00%

发文量