{"title":"Machine Learning and Neural Networks Tools to Address Noisy Data Issues","authors":"Maria Teresa Artese, I. Gagliardi","doi":"10.55630/dipp.2021.11.8","DOIUrl":null,"url":null,"abstract":"In this paper, we present tools for addressing noisy keyword issues in digital libraries. Two tasks, language detection and misspelling detection and correction, are addressed using both machine learning and deep learning techniques.\nTo train and validate the models, different datasets were used/created/scraped.\nEncouraging preliminary results are presented and discussed.","PeriodicalId":268414,"journal":{"name":"Digital Presentation and Preservation of Cultural and Scientific Heritage","volume":"395 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Presentation and Preservation of Cultural and Scientific Heritage","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55630/dipp.2021.11.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we present tools for addressing noisy keyword issues in digital libraries. Two tasks, language detection and misspelling detection and correction, are addressed using both machine learning and deep learning techniques.
To train and validate the models, different datasets were used/created/scraped.
Encouraging preliminary results are presented and discussed.