{"title":"基于关键词相关性的文章检索与图像标注","authors":"Shu-Chen Cheng, Chun Lu","doi":"10.1109/ICMLC48188.2019.8949205","DOIUrl":null,"url":null,"abstract":"When users input keywords into the search engine, a massive search results will be retrieved. However, it becomes difficult for the users to learn as it is unreadable with the excessive amount of results. This study establishes an information retrieval system for computer science related articles. It firstly collects articles by running a web crawler, and uses TF-IDF (Term Frequency-Inverse Document Frequency) method to extract keywords to acquire the focus of the article. And with the use of association rules and cosine similarity, the articles are classified by their relevance. Finally, according to users' feedbacks, the system provides appropriate resources to improve the motivation and willingness to learn. In addition, the pictures in the articles are also a basis for analyzing the articles. This study uses image semantic analysis to label the pictures so as to improve the accuracy in analyzing the articles.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Retrieving Articles and Image Labeling Based on Relevance of Keywords\",\"authors\":\"Shu-Chen Cheng, Chun Lu\",\"doi\":\"10.1109/ICMLC48188.2019.8949205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When users input keywords into the search engine, a massive search results will be retrieved. However, it becomes difficult for the users to learn as it is unreadable with the excessive amount of results. This study establishes an information retrieval system for computer science related articles. It firstly collects articles by running a web crawler, and uses TF-IDF (Term Frequency-Inverse Document Frequency) method to extract keywords to acquire the focus of the article. And with the use of association rules and cosine similarity, the articles are classified by their relevance. Finally, according to users' feedbacks, the system provides appropriate resources to improve the motivation and willingness to learn. In addition, the pictures in the articles are also a basis for analyzing the articles. This study uses image semantic analysis to label the pictures so as to improve the accuracy in analyzing the articles.\",\"PeriodicalId\":221349,\"journal\":{\"name\":\"2019 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC48188.2019.8949205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC48188.2019.8949205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Retrieving Articles and Image Labeling Based on Relevance of Keywords
When users input keywords into the search engine, a massive search results will be retrieved. However, it becomes difficult for the users to learn as it is unreadable with the excessive amount of results. This study establishes an information retrieval system for computer science related articles. It firstly collects articles by running a web crawler, and uses TF-IDF (Term Frequency-Inverse Document Frequency) method to extract keywords to acquire the focus of the article. And with the use of association rules and cosine similarity, the articles are classified by their relevance. Finally, according to users' feedbacks, the system provides appropriate resources to improve the motivation and willingness to learn. In addition, the pictures in the articles are also a basis for analyzing the articles. This study uses image semantic analysis to label the pictures so as to improve the accuracy in analyzing the articles.