{"title":"利用最优引文频次进行重要引文识别","authors":"Shahzad Nazir, Muhammad Asif, Shahbaz Ahmad","doi":"10.1109/ICEET48479.2020.9048224","DOIUrl":null,"url":null,"abstract":"Research is always based on previously done work. To acknowledge the worthy work of the predecessors of the field, researchers do citations. Citations are factors that are used for measuring the impact factor of journals, to rank the researchers, to find out latest research topics, for allocating research grants etc. In current epoch the research community has turned their focus towards citations and is of the view that all citations are not equally important. To find out important citations, researchers used different approaches such as context based, cue word based, metadata based, frequency based, textual based etc. Among proposed methodologies, frequency based approach was extensively used. The citation with high frequency was considered as important, but it is yet unclear that what should be the frequency cut off value of citation for considering it important. This research explored the significance of applying Threshold value over Frequency count for binary classification. We identified optimal threshold value of frequency count and further applied this to classify the citations into important and non-important ones. To evaluate the proposed approach a benchmark data set annotated by two domain experts was used that consisted of 465 citation pairs. The results were compared with state of the art precision value of 0.72. While the experiment showed increased value of 0.75 in terms of precision","PeriodicalId":144846,"journal":{"name":"2020 International Conference on Engineering and Emerging Technologies (ICEET)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Important Citation Identification by Exploiting the Optimal In-text Citation Frequency\",\"authors\":\"Shahzad Nazir, Muhammad Asif, Shahbaz Ahmad\",\"doi\":\"10.1109/ICEET48479.2020.9048224\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Research is always based on previously done work. To acknowledge the worthy work of the predecessors of the field, researchers do citations. Citations are factors that are used for measuring the impact factor of journals, to rank the researchers, to find out latest research topics, for allocating research grants etc. In current epoch the research community has turned their focus towards citations and is of the view that all citations are not equally important. To find out important citations, researchers used different approaches such as context based, cue word based, metadata based, frequency based, textual based etc. Among proposed methodologies, frequency based approach was extensively used. The citation with high frequency was considered as important, but it is yet unclear that what should be the frequency cut off value of citation for considering it important. This research explored the significance of applying Threshold value over Frequency count for binary classification. We identified optimal threshold value of frequency count and further applied this to classify the citations into important and non-important ones. To evaluate the proposed approach a benchmark data set annotated by two domain experts was used that consisted of 465 citation pairs. The results were compared with state of the art precision value of 0.72. While the experiment showed increased value of 0.75 in terms of precision\",\"PeriodicalId\":144846,\"journal\":{\"name\":\"2020 International Conference on Engineering and Emerging Technologies (ICEET)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Engineering and Emerging Technologies (ICEET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEET48479.2020.9048224\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Engineering and Emerging Technologies (ICEET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEET48479.2020.9048224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Important Citation Identification by Exploiting the Optimal In-text Citation Frequency
Research is always based on previously done work. To acknowledge the worthy work of the predecessors of the field, researchers do citations. Citations are factors that are used for measuring the impact factor of journals, to rank the researchers, to find out latest research topics, for allocating research grants etc. In current epoch the research community has turned their focus towards citations and is of the view that all citations are not equally important. To find out important citations, researchers used different approaches such as context based, cue word based, metadata based, frequency based, textual based etc. Among proposed methodologies, frequency based approach was extensively used. The citation with high frequency was considered as important, but it is yet unclear that what should be the frequency cut off value of citation for considering it important. This research explored the significance of applying Threshold value over Frequency count for binary classification. We identified optimal threshold value of frequency count and further applied this to classify the citations into important and non-important ones. To evaluate the proposed approach a benchmark data set annotated by two domain experts was used that consisted of 465 citation pairs. The results were compared with state of the art precision value of 0.72. While the experiment showed increased value of 0.75 in terms of precision