Hima Bindu Ankem Venkata, Andrea Calazacon, Taha M. Mahmoud, T. Hanne
{"title":"A Technology Recommender System Based on Web Crawling and Natural Language Processing","authors":"Hima Bindu Ankem Venkata, Andrea Calazacon, Taha M. Mahmoud, T. Hanne","doi":"10.1109/AIC55036.2022.9848970","DOIUrl":null,"url":null,"abstract":"The goal of this study is to develop a prototype for a recommendation system that could assist individuals and organizations without in-depth knowledge of technology products and services to choose an appropriate tool/service that best suits their needs. Recommendation systems, as they are popularly known, are personalized information filtering systems which are usually integrated into various consumer and commercial applications. These personalized systems play a vital role, especially when the user is unsure of what to search for. Our work focuses in particular on web-based recommendation systems for end-users in academia and industries who need recommendations for their software tools and services. This paper focuses on extracting information from the web using Apache Nutch, an open-source web crawler which extracts data from websites widely used for software recommendations. The information extracted is indexed in Elasticsearch, whose Natural Language Processing (NLP) and text mining capabilities are applied to provide appropriate recommendations to the end-users. Kibana dashboards and visualizations are used to visualize the recommendations in a format that is conducive for the end-users.","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The goal of this study is to develop a prototype for a recommendation system that could assist individuals and organizations without in-depth knowledge of technology products and services to choose an appropriate tool/service that best suits their needs. Recommendation systems, as they are popularly known, are personalized information filtering systems which are usually integrated into various consumer and commercial applications. These personalized systems play a vital role, especially when the user is unsure of what to search for. Our work focuses in particular on web-based recommendation systems for end-users in academia and industries who need recommendations for their software tools and services. This paper focuses on extracting information from the web using Apache Nutch, an open-source web crawler which extracts data from websites widely used for software recommendations. The information extracted is indexed in Elasticsearch, whose Natural Language Processing (NLP) and text mining capabilities are applied to provide appropriate recommendations to the end-users. Kibana dashboards and visualizations are used to visualize the recommendations in a format that is conducive for the end-users.