Ivan L. Acosta-Guzmán, Eleanor Varela-Tapia, Alexandra E. Piza-Guale, Nory X. Acosta-Guzmán, Christopher I. Acosta Varela
{"title":"为萨斯-Cov-2 病毒感染者文本对话内容评估确定适当 NLP 技术的初步进展","authors":"Ivan L. Acosta-Guzmán, Eleanor Varela-Tapia, Alexandra E. Piza-Guale, Nory X. Acosta-Guzmán, Christopher I. Acosta Varela","doi":"10.53591/easi.v2i3.2488","DOIUrl":null,"url":null,"abstract":"When Covid-19 became a pandemic on March 2020, an urgent need arose for reliable info and advice, so Virtual Assistants were created to help teach the public how to avoid the Alpha variant. But when new variants like Beta, Delta, and Omicron appeared with different symptoms, they caused new waves of infections and deaths. To tackle this, a Natural Language Processing prototype was created to analyze experiences of 4422 people, who had been infected in Ecuador, and to detect which symptoms were most common in their conversations. For this purpose, Python language was used, Google Collab platform, and several combinations of text processing techniques with various classifiers were tested. Finally, the results were measured using quality metrics, accuracy, precision, Recall, F1, to identify the most appropriate model, finding that the combination of Stop Word, Tokenization, stemming techniques together with the LSTM classifier reached high effectiveness among the options tested for a classifier model with multi-label output.","PeriodicalId":191327,"journal":{"name":"EASI: Ingeniería y Ciencias Aplicadas en la Industria","volume":"17 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Initial Progress of Identification of the Appropriate NLP Technique for Content Evaluation in Textual Conversations of People Infected by Sars-Cov-2\",\"authors\":\"Ivan L. Acosta-Guzmán, Eleanor Varela-Tapia, Alexandra E. Piza-Guale, Nory X. Acosta-Guzmán, Christopher I. Acosta Varela\",\"doi\":\"10.53591/easi.v2i3.2488\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When Covid-19 became a pandemic on March 2020, an urgent need arose for reliable info and advice, so Virtual Assistants were created to help teach the public how to avoid the Alpha variant. But when new variants like Beta, Delta, and Omicron appeared with different symptoms, they caused new waves of infections and deaths. To tackle this, a Natural Language Processing prototype was created to analyze experiences of 4422 people, who had been infected in Ecuador, and to detect which symptoms were most common in their conversations. For this purpose, Python language was used, Google Collab platform, and several combinations of text processing techniques with various classifiers were tested. Finally, the results were measured using quality metrics, accuracy, precision, Recall, F1, to identify the most appropriate model, finding that the combination of Stop Word, Tokenization, stemming techniques together with the LSTM classifier reached high effectiveness among the options tested for a classifier model with multi-label output.\",\"PeriodicalId\":191327,\"journal\":{\"name\":\"EASI: Ingeniería y Ciencias Aplicadas en la Industria\",\"volume\":\"17 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EASI: Ingeniería y Ciencias Aplicadas en la Industria\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.53591/easi.v2i3.2488\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EASI: Ingeniería y Ciencias Aplicadas en la Industria","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.53591/easi.v2i3.2488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Initial Progress of Identification of the Appropriate NLP Technique for Content Evaluation in Textual Conversations of People Infected by Sars-Cov-2
When Covid-19 became a pandemic on March 2020, an urgent need arose for reliable info and advice, so Virtual Assistants were created to help teach the public how to avoid the Alpha variant. But when new variants like Beta, Delta, and Omicron appeared with different symptoms, they caused new waves of infections and deaths. To tackle this, a Natural Language Processing prototype was created to analyze experiences of 4422 people, who had been infected in Ecuador, and to detect which symptoms were most common in their conversations. For this purpose, Python language was used, Google Collab platform, and several combinations of text processing techniques with various classifiers were tested. Finally, the results were measured using quality metrics, accuracy, precision, Recall, F1, to identify the most appropriate model, finding that the combination of Stop Word, Tokenization, stemming techniques together with the LSTM classifier reached high effectiveness among the options tested for a classifier model with multi-label output.