{"title":"Web Information Retrieval using Vector Space Model and Docu-Tally Metric","authors":"Nitish Chaturvedi, Eshanika Ray, K. Meenakshi","doi":"10.1109/ICCCI56745.2023.10128258","DOIUrl":null,"url":null,"abstract":"Regardless of the type of data set, it is frequently challenging to sift through the vast amount of data that is available on the Internet as a result of technological improvements. To deal with challenges mentioned above, we have come up with a ranking approach which is computed using NLP and vector space model. The approaches used for information retrieval start with a basic machine learning model and progress to multi-stage architectures and frameworks like language modelling and term matching. The main goal of this work is to use a standard retrieval process to glean insights from large amounts of data, which is the problem we are aiming to solve. The method utilised in the study is latent semantic analysis, which takes advantage of the semantic aspects at play and can be used to glean insights from lengthy texts.","PeriodicalId":205683,"journal":{"name":"2023 International Conference on Computer Communication and Informatics (ICCCI)","volume":"148 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Communication and Informatics (ICCCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCI56745.2023.10128258","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Regardless of the type of data set, it is frequently challenging to sift through the vast amount of data that is available on the Internet as a result of technological improvements. To deal with challenges mentioned above, we have come up with a ranking approach which is computed using NLP and vector space model. The approaches used for information retrieval start with a basic machine learning model and progress to multi-stage architectures and frameworks like language modelling and term matching. The main goal of this work is to use a standard retrieval process to glean insights from large amounts of data, which is the problem we are aiming to solve. The method utilised in the study is latent semantic analysis, which takes advantage of the semantic aspects at play and can be used to glean insights from lengthy texts.