{"title":"Research on the CRF-based Sequence Labeling Algorithm Used in Reference Resolution of Mathematical Word Problem Understanding","authors":"Qingtang Liu, Xinqian Ma, Peng Zhou, Linjing Wu, Shuang Yu, Xueyan Yang","doi":"10.1145/3498765.3498769","DOIUrl":null,"url":null,"abstract":"Resolving the reference phenomenon in application problems is a key step to realize the understanding of mathematical problems. Compared with the Chinese corpus in the general field, this study analyzed the characteristics of reference resolution in mathematical stratified sampling word problems, and explored key factors by combining the C4.5 decision tree algorithm. On this basis, the study proposed a CRF-based sequence labeling algorithm, which was exploited to identify the to-be-solved items of the stratified sampling problem and resolved them. The experimental data is the stratified sampling word problems collected from the math problems in the Chinese college entrance examination and the test problems in the textbook from 2012 to 2020. The results show that the F-values for the antecedents and anaphors of the mathematical stratified sampling word problems based on CRF sequence labeling can reach 81.29% and 90.69%, respectively, and the F-values for reference resolution can reach 84.84%, which is higher than the traditional Bayesian method.","PeriodicalId":273698,"journal":{"name":"Proceedings of the 13th International Conference on Education Technology and Computers","volume":"161 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 13th International Conference on Education Technology and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3498765.3498769","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Resolving the reference phenomenon in application problems is a key step to realize the understanding of mathematical problems. Compared with the Chinese corpus in the general field, this study analyzed the characteristics of reference resolution in mathematical stratified sampling word problems, and explored key factors by combining the C4.5 decision tree algorithm. On this basis, the study proposed a CRF-based sequence labeling algorithm, which was exploited to identify the to-be-solved items of the stratified sampling problem and resolved them. The experimental data is the stratified sampling word problems collected from the math problems in the Chinese college entrance examination and the test problems in the textbook from 2012 to 2020. The results show that the F-values for the antecedents and anaphors of the mathematical stratified sampling word problems based on CRF sequence labeling can reach 81.29% and 90.69%, respectively, and the F-values for reference resolution can reach 84.84%, which is higher than the traditional Bayesian method.