Hima Bindu Ankem Venkata, Andrea Calazacon, Taha M. Mahmoud, T. Hanne
{"title":"基于网络抓取和自然语言处理的技术推荐系统","authors":"Hima Bindu Ankem Venkata, Andrea Calazacon, Taha M. Mahmoud, T. Hanne","doi":"10.1109/AIC55036.2022.9848970","DOIUrl":null,"url":null,"abstract":"The goal of this study is to develop a prototype for a recommendation system that could assist individuals and organizations without in-depth knowledge of technology products and services to choose an appropriate tool/service that best suits their needs. Recommendation systems, as they are popularly known, are personalized information filtering systems which are usually integrated into various consumer and commercial applications. These personalized systems play a vital role, especially when the user is unsure of what to search for. Our work focuses in particular on web-based recommendation systems for end-users in academia and industries who need recommendations for their software tools and services. This paper focuses on extracting information from the web using Apache Nutch, an open-source web crawler which extracts data from websites widely used for software recommendations. The information extracted is indexed in Elasticsearch, whose Natural Language Processing (NLP) and text mining capabilities are applied to provide appropriate recommendations to the end-users. Kibana dashboards and visualizations are used to visualize the recommendations in a format that is conducive for the end-users.","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Technology Recommender System Based on Web Crawling and Natural Language Processing\",\"authors\":\"Hima Bindu Ankem Venkata, Andrea Calazacon, Taha M. Mahmoud, T. Hanne\",\"doi\":\"10.1109/AIC55036.2022.9848970\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The goal of this study is to develop a prototype for a recommendation system that could assist individuals and organizations without in-depth knowledge of technology products and services to choose an appropriate tool/service that best suits their needs. Recommendation systems, as they are popularly known, are personalized information filtering systems which are usually integrated into various consumer and commercial applications. These personalized systems play a vital role, especially when the user is unsure of what to search for. Our work focuses in particular on web-based recommendation systems for end-users in academia and industries who need recommendations for their software tools and services. This paper focuses on extracting information from the web using Apache Nutch, an open-source web crawler which extracts data from websites widely used for software recommendations. The information extracted is indexed in Elasticsearch, whose Natural Language Processing (NLP) and text mining capabilities are applied to provide appropriate recommendations to the end-users. Kibana dashboards and visualizations are used to visualize the recommendations in a format that is conducive for the end-users.\",\"PeriodicalId\":433590,\"journal\":{\"name\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIC55036.2022.9848970\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Technology Recommender System Based on Web Crawling and Natural Language Processing
The goal of this study is to develop a prototype for a recommendation system that could assist individuals and organizations without in-depth knowledge of technology products and services to choose an appropriate tool/service that best suits their needs. Recommendation systems, as they are popularly known, are personalized information filtering systems which are usually integrated into various consumer and commercial applications. These personalized systems play a vital role, especially when the user is unsure of what to search for. Our work focuses in particular on web-based recommendation systems for end-users in academia and industries who need recommendations for their software tools and services. This paper focuses on extracting information from the web using Apache Nutch, an open-source web crawler which extracts data from websites widely used for software recommendations. The information extracted is indexed in Elasticsearch, whose Natural Language Processing (NLP) and text mining capabilities are applied to provide appropriate recommendations to the end-users. Kibana dashboards and visualizations are used to visualize the recommendations in a format that is conducive for the end-users.