{"title":"一种基于Web文本中提及到相关实体的实体性能评估方法","authors":"V. Sampaio, Renato Fileto, D. D. J. D. Macedo","doi":"10.1145/3366030.3366079","DOIUrl":null,"url":null,"abstract":"Publications on the Web can influence the public opinion about certain entities (e.g., politicians, institutions). At the same time, a variety of indicators can be extracted from these publications and used to estimate entity performance (e.g., popularity, votes share). This work proposes an automatic method that employs state-of-the-art natural language processing tools to extract indicators about entities mentioned in texts, for estimating the performance of these entities or semantically related ones. Our method calculates performance metrics from performance indicators consolidated for semantically related entities, assess correlations of these consolidated metrics with ground true performance, and uses these metrics to predict certain fluctuations in entity performance. Experimental results in a case study on politics show that consolidated metrics for several interrelated entities are better correlated to observed real performance measures of some target entities and lead to better predictions, than metrics for just one entity.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Method to Estimate Entity Performance from Mentions to Related Entities in Texts on the Web\",\"authors\":\"V. Sampaio, Renato Fileto, D. D. J. D. Macedo\",\"doi\":\"10.1145/3366030.3366079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Publications on the Web can influence the public opinion about certain entities (e.g., politicians, institutions). At the same time, a variety of indicators can be extracted from these publications and used to estimate entity performance (e.g., popularity, votes share). This work proposes an automatic method that employs state-of-the-art natural language processing tools to extract indicators about entities mentioned in texts, for estimating the performance of these entities or semantically related ones. Our method calculates performance metrics from performance indicators consolidated for semantically related entities, assess correlations of these consolidated metrics with ground true performance, and uses these metrics to predict certain fluctuations in entity performance. Experimental results in a case study on politics show that consolidated metrics for several interrelated entities are better correlated to observed real performance measures of some target entities and lead to better predictions, than metrics for just one entity.\",\"PeriodicalId\":446280,\"journal\":{\"name\":\"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3366030.3366079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3366030.3366079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Method to Estimate Entity Performance from Mentions to Related Entities in Texts on the Web
Publications on the Web can influence the public opinion about certain entities (e.g., politicians, institutions). At the same time, a variety of indicators can be extracted from these publications and used to estimate entity performance (e.g., popularity, votes share). This work proposes an automatic method that employs state-of-the-art natural language processing tools to extract indicators about entities mentioned in texts, for estimating the performance of these entities or semantically related ones. Our method calculates performance metrics from performance indicators consolidated for semantically related entities, assess correlations of these consolidated metrics with ground true performance, and uses these metrics to predict certain fluctuations in entity performance. Experimental results in a case study on politics show that consolidated metrics for several interrelated entities are better correlated to observed real performance measures of some target entities and lead to better predictions, than metrics for just one entity.