Haihua Ling, Peng Han, Jian-jua Qiu, Li Peng, Dongmei Liu, Kaiqing Luo
{"title":"基于说话人化的智能课堂师生语音分离方法","authors":"Haihua Ling, Peng Han, Jian-jua Qiu, Li Peng, Dongmei Liu, Kaiqing Luo","doi":"10.1145/3498765.3498774","DOIUrl":null,"url":null,"abstract":"The analysis of classroom teaching behavior is of great significance to improve the teaching quality of teachers, but there are few researches on teaching behavior and teaching model analysis based on speech processing. This paper proposes a method of speech separation between teachers and students based on speaker diarization, which can help educators distinguish the teaching model objectively by identifying and analyzing the speech acts of teachers and students in the classroom. Firstly, the speaker segmentation based on Bayesian distance is used to detect and segment the classroom audio. Then, based on the pre-trained speaker model, speaker detection and clustering are performed on multiple segments of speech processed by speaker segmentation. Finally, the speech acts distribution in the classroom is visualized, and the teaching model is distinguished based on the Student-Teacher analysis method. This paper uses classroom audio collected in the classrooms of two elementary schools as the experimental data. The experimental results show that the method proposed in this paper achieves 74.0% and 98.28% accuracy in speaker segmentation and speech recognition, respectively. And this experiment takes a Chinese open class as an example to analyze and discuss its teaching model in detail. The research can help teachers summarize classroom teaching effect in time, and improve the quality of classroom teaching.","PeriodicalId":273698,"journal":{"name":"Proceedings of the 13th International Conference on Education Technology and Computers","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Method of Speech Separation between Teachers and Students in Smart Classrooms Based on Speaker Diarization\",\"authors\":\"Haihua Ling, Peng Han, Jian-jua Qiu, Li Peng, Dongmei Liu, Kaiqing Luo\",\"doi\":\"10.1145/3498765.3498774\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The analysis of classroom teaching behavior is of great significance to improve the teaching quality of teachers, but there are few researches on teaching behavior and teaching model analysis based on speech processing. This paper proposes a method of speech separation between teachers and students based on speaker diarization, which can help educators distinguish the teaching model objectively by identifying and analyzing the speech acts of teachers and students in the classroom. Firstly, the speaker segmentation based on Bayesian distance is used to detect and segment the classroom audio. Then, based on the pre-trained speaker model, speaker detection and clustering are performed on multiple segments of speech processed by speaker segmentation. Finally, the speech acts distribution in the classroom is visualized, and the teaching model is distinguished based on the Student-Teacher analysis method. This paper uses classroom audio collected in the classrooms of two elementary schools as the experimental data. The experimental results show that the method proposed in this paper achieves 74.0% and 98.28% accuracy in speaker segmentation and speech recognition, respectively. And this experiment takes a Chinese open class as an example to analyze and discuss its teaching model in detail. The research can help teachers summarize classroom teaching effect in time, and improve the quality of classroom teaching.\",\"PeriodicalId\":273698,\"journal\":{\"name\":\"Proceedings of the 13th International Conference on Education Technology and Computers\",\"volume\":\"130 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 13th International Conference on Education Technology and Computers\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3498765.3498774\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 13th International Conference on Education Technology and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3498765.3498774","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Method of Speech Separation between Teachers and Students in Smart Classrooms Based on Speaker Diarization
The analysis of classroom teaching behavior is of great significance to improve the teaching quality of teachers, but there are few researches on teaching behavior and teaching model analysis based on speech processing. This paper proposes a method of speech separation between teachers and students based on speaker diarization, which can help educators distinguish the teaching model objectively by identifying and analyzing the speech acts of teachers and students in the classroom. Firstly, the speaker segmentation based on Bayesian distance is used to detect and segment the classroom audio. Then, based on the pre-trained speaker model, speaker detection and clustering are performed on multiple segments of speech processed by speaker segmentation. Finally, the speech acts distribution in the classroom is visualized, and the teaching model is distinguished based on the Student-Teacher analysis method. This paper uses classroom audio collected in the classrooms of two elementary schools as the experimental data. The experimental results show that the method proposed in this paper achieves 74.0% and 98.28% accuracy in speaker segmentation and speech recognition, respectively. And this experiment takes a Chinese open class as an example to analyze and discuss its teaching model in detail. The research can help teachers summarize classroom teaching effect in time, and improve the quality of classroom teaching.