Jingjing Cao, Wang Ye, Wang Ying, Xiaoxia Zhang, Zhang Xue
{"title":"Research on talent data analysis method based on Text Mining","authors":"Jingjing Cao, Wang Ye, Wang Ying, Xiaoxia Zhang, Zhang Xue","doi":"10.25236/AJCIS.2021.040410","DOIUrl":null,"url":null,"abstract":"In order to solve the problem of low clustering contour coefficient caused by inaccurate keyword extraction of talent data information, a talent data analysis method based on text mining is proposed. Through the preprocessing of word segmentation and stop words, the text set of talent data is established, and the keyword graph is constructed by text mining, and the information keywords are obtained according to the weight iteration results. The keywords in this paper are transformed into the form of multi-dimensional vector, and the similarity is calculated to get the results of text analysis. The experimental results show that the contour coefficient of the proposed method is 0.736, which is 0.267 and 0.221 higher than that of the K-means and single pass methods, respectively. The design method of this paper has a reasonable clustering performance, which is suitable for talent big data analysis.","PeriodicalId":387664,"journal":{"name":"Academic Journal of Computing & Information Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Academic Journal of Computing & Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25236/AJCIS.2021.040410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In order to solve the problem of low clustering contour coefficient caused by inaccurate keyword extraction of talent data information, a talent data analysis method based on text mining is proposed. Through the preprocessing of word segmentation and stop words, the text set of talent data is established, and the keyword graph is constructed by text mining, and the information keywords are obtained according to the weight iteration results. The keywords in this paper are transformed into the form of multi-dimensional vector, and the similarity is calculated to get the results of text analysis. The experimental results show that the contour coefficient of the proposed method is 0.736, which is 0.267 and 0.221 higher than that of the K-means and single pass methods, respectively. The design method of this paper has a reasonable clustering performance, which is suitable for talent big data analysis.