{"title":"文本特征对信息有用性的影响分析——以旅游文本为例","authors":"Wenhua Jiang, Ruo-yu Song","doi":"10.22457/jmhr.v08a052253","DOIUrl":null,"url":null,"abstract":"This paper takes 9260 domestic free tourism strategy data collected from the Mafengwo website as samples, quantifies the travel guide texts by using data mining, and performs word frequency statistics and keyword extraction on the data through Python code and NLPIR platform, then divides the high-weight words into three categories that affect the usefulness of information through hierarchical clustering. Ten hypotheses are proposed in the paper based on previous research. And a negative binomial regression model is built to conduct analysis. The results show that when the number of reads is regarded as a control variable, all ten text features, such as the rate of containing pictures, have a significant positive relationship on information usefulness. Therefore, suggestions are provided to develop influential and high-quality tourism strategies in terms of text features.","PeriodicalId":206239,"journal":{"name":"Journal of Management and Humanity Research","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of the Influence of Text Features on the Usefulness of Information: A Case of Tourism Text\",\"authors\":\"Wenhua Jiang, Ruo-yu Song\",\"doi\":\"10.22457/jmhr.v08a052253\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper takes 9260 domestic free tourism strategy data collected from the Mafengwo website as samples, quantifies the travel guide texts by using data mining, and performs word frequency statistics and keyword extraction on the data through Python code and NLPIR platform, then divides the high-weight words into three categories that affect the usefulness of information through hierarchical clustering. Ten hypotheses are proposed in the paper based on previous research. And a negative binomial regression model is built to conduct analysis. The results show that when the number of reads is regarded as a control variable, all ten text features, such as the rate of containing pictures, have a significant positive relationship on information usefulness. Therefore, suggestions are provided to develop influential and high-quality tourism strategies in terms of text features.\",\"PeriodicalId\":206239,\"journal\":{\"name\":\"Journal of Management and Humanity Research\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Management and Humanity Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22457/jmhr.v08a052253\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Management and Humanity Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22457/jmhr.v08a052253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis of the Influence of Text Features on the Usefulness of Information: A Case of Tourism Text
This paper takes 9260 domestic free tourism strategy data collected from the Mafengwo website as samples, quantifies the travel guide texts by using data mining, and performs word frequency statistics and keyword extraction on the data through Python code and NLPIR platform, then divides the high-weight words into three categories that affect the usefulness of information through hierarchical clustering. Ten hypotheses are proposed in the paper based on previous research. And a negative binomial regression model is built to conduct analysis. The results show that when the number of reads is regarded as a control variable, all ten text features, such as the rate of containing pictures, have a significant positive relationship on information usefulness. Therefore, suggestions are provided to develop influential and high-quality tourism strategies in terms of text features.