{"title":"Retrieving Articles and Image Labeling Based on Relevance of Keywords","authors":"Shu-Chen Cheng, Chun Lu","doi":"10.1109/ICMLC48188.2019.8949205","DOIUrl":null,"url":null,"abstract":"When users input keywords into the search engine, a massive search results will be retrieved. However, it becomes difficult for the users to learn as it is unreadable with the excessive amount of results. This study establishes an information retrieval system for computer science related articles. It firstly collects articles by running a web crawler, and uses TF-IDF (Term Frequency-Inverse Document Frequency) method to extract keywords to acquire the focus of the article. And with the use of association rules and cosine similarity, the articles are classified by their relevance. Finally, according to users' feedbacks, the system provides appropriate resources to improve the motivation and willingness to learn. In addition, the pictures in the articles are also a basis for analyzing the articles. This study uses image semantic analysis to label the pictures so as to improve the accuracy in analyzing the articles.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC48188.2019.8949205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
When users input keywords into the search engine, a massive search results will be retrieved. However, it becomes difficult for the users to learn as it is unreadable with the excessive amount of results. This study establishes an information retrieval system for computer science related articles. It firstly collects articles by running a web crawler, and uses TF-IDF (Term Frequency-Inverse Document Frequency) method to extract keywords to acquire the focus of the article. And with the use of association rules and cosine similarity, the articles are classified by their relevance. Finally, according to users' feedbacks, the system provides appropriate resources to improve the motivation and willingness to learn. In addition, the pictures in the articles are also a basis for analyzing the articles. This study uses image semantic analysis to label the pictures so as to improve the accuracy in analyzing the articles.