Julio Cesar Cavalcanti, Ronaldo Rodrigues da Silva, Anders Eriksson, P. Barbosa
{"title":"Exploring the performance of automatic speaker recognition using twin speech and deep learning-based artificial neural networks","authors":"Julio Cesar Cavalcanti, Ronaldo Rodrigues da Silva, Anders Eriksson, P. Barbosa","doi":"10.3389/frai.2024.1287877","DOIUrl":"https://doi.org/10.3389/frai.2024.1287877","url":null,"abstract":"This study assessed the influence of speaker similarity and sample length on the performance of an automatic speaker recognition (ASR) system utilizing the SpeechBrain toolkit. The dataset comprised recordings from 20 male identical twin speakers engaged in spontaneous dialogues and interviews. Performance evaluations involved comparing identical twins, all speakers in the dataset (including twin pairs), and all speakers excluding twin pairs. Speech samples, ranging from 5 to 30 s, underwent assessment based on equal error rates (EER) and Log cost-likelihood ratios (Cllr). Results highlight the substantial challenge posed by identical twins to the ASR system, leading to a decrease in overall speaker recognition accuracy. Furthermore, analyses based on longer speech samples outperformed those using shorter samples. As sample size increased, standard deviation values for both intra and inter-speaker similarity scores decreased, indicating reduced variability in estimating speaker similarity/dissimilarity levels in longer speech stretches compared to shorter ones. The study also uncovered varying degrees of likeness among identical twins, with certain pairs presenting a greater challenge for ASR systems. These outcomes align with prior research and are discussed within the context of relevant literature.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"4 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139851960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sabine Wehnert, Praneeth Chedella, Jonas Asche, Ernesto William De Luca
{"title":"A dynamic approach for visualizing and exploring concept hierarchies from textbooks","authors":"Sabine Wehnert, Praneeth Chedella, Jonas Asche, Ernesto William De Luca","doi":"10.3389/frai.2024.1285026","DOIUrl":"https://doi.org/10.3389/frai.2024.1285026","url":null,"abstract":"In this study, we propose a visualization technique to explore and visualize concept hierarchies generated from a textbook in the legal domain. Through a human-centered design process, we developed a tool that allows users to effectively navigate through and explore complex hierarchical concepts in three kinds of traversal techniques: top-down, middle-out, and bottom-up. Our concept hierarchies offer an overview over a given domain, with increasing level of detail toward the bottom of the hierarchy which is consisting of entities. In the legal use case we considered, the concepts were adapted from section headings in a legal textbook, whereas references to law or legal cases inside the textbook became entities. The design of this tool is refined following various steps such as gathering user needs, pain points of an existing visualization, prototyping, testing, and refining. The resulting interface offers users several key features such as dynamic search and filter, explorable concept nodes, and a preview of leaf nodes at every stage. A high-fidelity prototype was created to test our theory and design. To test our concept, we used the System Usability Scale as a way to measure the prototype's usability, a task-based survey to asses the tool's ability in assisting users in gathering information and interacting with the prototype, and finally mouse tracking to understand user interaction patterns. Along with this, we gathered audio and video footage of users when participating in the study. This footage also helped us in getting feedback when the survey responses required further information. The data collected provided valuable insights to set the directions for extending this study. As a result, we have accounted for varying hierarchy depths, longer text spans than only one to two words in the elements of the hierarchy, searchability, and exploration of the hierarchies. At the same time, we aimed for minimizing visual clutter and cognitive overload. We show that existing approaches are not suitable to visualize the type of data which we support with our visualization.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"49 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139852080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Hammoud, Melaku N. Getahun, Anna Baldycheva, Andrey Somov
{"title":"Machine learning-based infant crying interpretation","authors":"M. Hammoud, Melaku N. Getahun, Anna Baldycheva, Andrey Somov","doi":"10.3389/frai.2024.1337356","DOIUrl":"https://doi.org/10.3389/frai.2024.1337356","url":null,"abstract":"Crying is an inevitable character trait that occurs throughout the growth of infants, under conditions where the caregiver may have difficulty interpreting the underlying cause of the cry. Crying can be treated as an audio signal that carries a message about the infant's state, such as discomfort, hunger, and sickness. The primary infant caregiver requires traditional ways of understanding these feelings. Failing to understand them correctly can cause severe problems. Several methods attempt to solve this problem; however, proper audio feature representation and classifiers are necessary for better results. This study uses time-, frequency-, and time-frequency-domain feature representations to gain in-depth information from the data. The time-domain features include zero-crossing rate (ZCR) and root mean square (RMS), the frequency-domain feature includes the Mel-spectrogram, and the time-frequency-domain feature includes Mel-frequency cepstral coefficients (MFCCs). Moreover, time-series imaging algorithms are applied to transform 20 MFCC features into images using different algorithms: Gramian angular difference fields, Gramian angular summation fields, Markov transition fields, recurrence plots, and RGB GAF. Then, these features are provided to different machine learning classifiers, such as decision tree, random forest, K nearest neighbors, and bagging. The use of MFCCs, ZCR, and RMS as features achieved high performance, outperforming state of the art (SOTA). Optimal parameters are found via the grid search method using 10-fold cross-validation. Our MFCC-based random forest (RF) classifier approach achieved an accuracy of 96.39%, outperforming SOTA, the scalogram-based shuffleNet classifier, which had an accuracy of 95.17%.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"53 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139853145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Raissa Souza, Emma A. M. Stanley, Milton Camacho, Richard Camicioli, O. Monchi, Zahinoor Ismail, M. Wilms, Nils D. Forkert
{"title":"A multi-center distributed learning approach for Parkinson's disease classification using the traveling model paradigm","authors":"Raissa Souza, Emma A. M. Stanley, Milton Camacho, Richard Camicioli, O. Monchi, Zahinoor Ismail, M. Wilms, Nils D. Forkert","doi":"10.3389/frai.2024.1301997","DOIUrl":"https://doi.org/10.3389/frai.2024.1301997","url":null,"abstract":"Distributed learning is a promising alternative to central learning for machine learning (ML) model training, overcoming data-sharing problems in healthcare. Previous studies exploring federated learning (FL) or the traveling model (TM) setup for medical image-based disease classification often relied on large databases with a limited number of centers or simulated artificial centers, raising doubts about real-world applicability. This study develops and evaluates a convolution neural network (CNN) for Parkinson's disease classification using data acquired by 83 diverse real centers around the world, mostly contributing small training samples. Our approach specifically makes use of the TM setup, which has proven effective in scenarios with limited data availability but has never been used for image-based disease classification. Our findings reveal that TM is effective for training CNN models, even in complex real-world scenarios with variable data distributions. After sufficient training cycles, the TM-trained CNN matches or slightly surpasses the performance of the centrally trained counterpart (AUROC of 83% vs. 80%). Our study highlights, for the first time, the effectiveness of TM in 3D medical image classification, especially in scenarios with limited training samples and heterogeneous distributed data. These insights are relevant for situations where ML models are supposed to be trained using data from small or remote medical centers, and rare diseases with sparse cases. The simplicity of this approach enables a broad application to many deep learning tasks, enhancing its clinical utility across various contexts and medical facilities.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"68 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139856108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Raissa Souza, Emma A. M. Stanley, Milton Camacho, Richard Camicioli, O. Monchi, Zahinoor Ismail, M. Wilms, Nils D. Forkert
{"title":"A multi-center distributed learning approach for Parkinson's disease classification using the traveling model paradigm","authors":"Raissa Souza, Emma A. M. Stanley, Milton Camacho, Richard Camicioli, O. Monchi, Zahinoor Ismail, M. Wilms, Nils D. Forkert","doi":"10.3389/frai.2024.1301997","DOIUrl":"https://doi.org/10.3389/frai.2024.1301997","url":null,"abstract":"Distributed learning is a promising alternative to central learning for machine learning (ML) model training, overcoming data-sharing problems in healthcare. Previous studies exploring federated learning (FL) or the traveling model (TM) setup for medical image-based disease classification often relied on large databases with a limited number of centers or simulated artificial centers, raising doubts about real-world applicability. This study develops and evaluates a convolution neural network (CNN) for Parkinson's disease classification using data acquired by 83 diverse real centers around the world, mostly contributing small training samples. Our approach specifically makes use of the TM setup, which has proven effective in scenarios with limited data availability but has never been used for image-based disease classification. Our findings reveal that TM is effective for training CNN models, even in complex real-world scenarios with variable data distributions. After sufficient training cycles, the TM-trained CNN matches or slightly surpasses the performance of the centrally trained counterpart (AUROC of 83% vs. 80%). Our study highlights, for the first time, the effectiveness of TM in 3D medical image classification, especially in scenarios with limited training samples and heterogeneous distributed data. These insights are relevant for situations where ML models are supposed to be trained using data from small or remote medical centers, and rare diseases with sparse cases. The simplicity of this approach enables a broad application to many deep learning tasks, enhancing its clinical utility across various contexts and medical facilities.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"358 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139796428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nataliya Tkachenko, Simon Frieder, Ryan-Rhys Griffiths, Christoph Nedopil
{"title":"Analyzing global utilization and missed opportunities in debt-for-nature swaps with generative AI","authors":"Nataliya Tkachenko, Simon Frieder, Ryan-Rhys Griffiths, Christoph Nedopil","doi":"10.3389/frai.2024.1167137","DOIUrl":"https://doi.org/10.3389/frai.2024.1167137","url":null,"abstract":"We deploy a prompt-augmented GPT-4 model to distill comprehensive datasets on the global application of debt-for-nature swaps (DNS), a pivotal financial tool for environmental conservation. Our analysis includes 195 nations and identifies 21 countries that have not yet used DNS before as prime candidates for DNS. A significant proportion demonstrates consistent commitments to conservation finance (0.86 accuracy as compared to historical swaps records). Conversely, 35 countries previously active in DNS before 2010 have since been identified as unsuitable. Notably, Argentina, grappling with soaring inflation and a substantial sovereign debt crisis, and Poland, which has achieved economic stability and gained access to alternative EU conservation funds, exemplify the shifting suitability landscape. The study's outcomes illuminate the fragility of DNS as a conservation strategy amid economic and political volatility.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"5 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139805663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nataliya Tkachenko, Simon Frieder, Ryan-Rhys Griffiths, Christoph Nedopil
{"title":"Analyzing global utilization and missed opportunities in debt-for-nature swaps with generative AI","authors":"Nataliya Tkachenko, Simon Frieder, Ryan-Rhys Griffiths, Christoph Nedopil","doi":"10.3389/frai.2024.1167137","DOIUrl":"https://doi.org/10.3389/frai.2024.1167137","url":null,"abstract":"We deploy a prompt-augmented GPT-4 model to distill comprehensive datasets on the global application of debt-for-nature swaps (DNS), a pivotal financial tool for environmental conservation. Our analysis includes 195 nations and identifies 21 countries that have not yet used DNS before as prime candidates for DNS. A significant proportion demonstrates consistent commitments to conservation finance (0.86 accuracy as compared to historical swaps records). Conversely, 35 countries previously active in DNS before 2010 have since been identified as unsuitable. Notably, Argentina, grappling with soaring inflation and a substantial sovereign debt crisis, and Poland, which has achieved economic stability and gained access to alternative EU conservation funds, exemplify the shifting suitability landscape. The study's outcomes illuminate the fragility of DNS as a conservation strategy amid economic and political volatility.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"51 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139865708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Editorial: Financial intermediation versus disintermediation: opportunities and challenges in the FinTech era, volume II","authors":"Anna Omarini","doi":"10.3389/frai.2024.1326358","DOIUrl":"https://doi.org/10.3389/frai.2024.1326358","url":null,"abstract":"","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"22 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139525266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Artificial intelligence in biology and medicine, and radioprotection research: perspectives from Jerusalem","authors":"Y. Socol, Ariella Richardson, Imene Garali-Zineddine, Stephane Grison, Guillaume Vares, Dmitry Klokov","doi":"10.3389/frai.2023.1291136","DOIUrl":"https://doi.org/10.3389/frai.2023.1291136","url":null,"abstract":"While AI is widely used in biomedical research and medical practice, its use is constrained to few specific practical areas, e.g., radiomics. Participants of the workshop on “Artificial Intelligence in Biology and Medicine” (Jerusalem, Feb 14–15, 2023), both researchers and practitioners, aimed to build a holistic picture by exploring AI advancements, challenges and perspectives, as well as to suggest new fields for AI applications. Presentations showcased the potential of large language models (LLMs) in generating molecular structures, predicting protein-ligand interactions, and promoting democratization of AI development. Ethical concerns in medical decision making were also addressed. In biological applications, AI integration of multi-omics and clinical data elucidated the health relevant effects of low doses of ionizing radiation. Bayesian latent modeling identified statistical associations between unobserved variables. Medical applications highlighted liquid biopsy methods for non-invasive diagnostics, routine laboratory tests to identify overlooked illnesses, and AI's role in oral and maxillofacial imaging. Explainable AI and diverse image processing tools improved diagnostics, while text classification detected anorexic behavior in blog posts. The workshop fostered knowledge sharing, discussions, and emphasized the need for further AI development in radioprotection research in support of emerging public health issues. The organizers plan to continue the initiative as an annual event, promoting collaboration and addressing issues and perspectives in AI applications with a focus on low-dose radioprotection research. Researchers involved in radioprotection research and experts in relevant public policy domains are invited to explore the utility of AI in low-dose radiation research at the next workshop.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"46 22","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139534067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ijaz Ahmad, Alessia Amelio, A. Merla, Francesca Scozzari
{"title":"A survey on the role of artificial intelligence in managing Long COVID","authors":"Ijaz Ahmad, Alessia Amelio, A. Merla, Francesca Scozzari","doi":"10.3389/frai.2023.1292466","DOIUrl":"https://doi.org/10.3389/frai.2023.1292466","url":null,"abstract":"In the last years, several techniques of artificial intelligence have been applied to data from COVID-19. In addition to the symptoms related to COVID-19, many individuals with SARS-CoV-2 infection have described various long-lasting symptoms, now termed Long COVID. In this context, artificial intelligence techniques have been utilized to analyze data from Long COVID patients in order to assist doctors and alleviate the considerable strain on care and rehabilitation facilities. In this paper, we explore the impact of the machine learning methodologies that have been applied to analyze the many aspects of Long COVID syndrome, from clinical presentation through diagnosis. We also include the text mining techniques used to extract insights and trends from large amounts of text data related to Long COVID. Finally, we critically compare the various approaches and outline the work that has to be done to create a robust artificial intelligence approach for efficient diagnosis and treatment of Long COVID.","PeriodicalId":508738,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"43 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139533389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}