Burcu Arslan, Blair Lehman, Caitlin Tenison, Jesse R Sparks, Alexis A López, Lin Gu, Diego Zapata-Rivera
{"title":"Opportunities and challenges of using generative AI to personalize educational assessment.","authors":"Burcu Arslan, Blair Lehman, Caitlin Tenison, Jesse R Sparks, Alexis A López, Lin Gu, Diego Zapata-Rivera","doi":"10.3389/frai.2024.1460651","DOIUrl":"10.3389/frai.2024.1460651","url":null,"abstract":"<p><p>In line with the positive effects of personalized learning, personalized assessments are expected to maximize learner motivation and engagement, allowing learners to show what they truly know and can do. Considering the advances in Generative Artificial Intelligence (GenAI), in this perspective article, we elaborate on the opportunities of integrating GenAI into personalized educational assessments to maximize learner engagement, performance, and access. We also draw attention to the challenges of integrating GenAI into personalized educational assessments regarding its potential risks to the assessment's core values of validity, reliability, and fairness. Finally, we discuss possible solutions and future directions.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1460651"},"PeriodicalIF":3.0,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11491426/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jithin K Sreedharan, Fred Saleh, Abdullah Alqahtani, Ibrahim Ahmed Albalawi, Gokul Krishna Gopalakrishnan, Hadi Abdullah Alahmed, Basem Ahmed Alsultan, Dhafer Mana Alalharith, Musallam Alnasser, Ayedh Dafer Alahmari, Manjush Karthika
{"title":"Applications of artificial intelligence in emergency and critical care diagnostics: a systematic review and meta-analysis.","authors":"Jithin K Sreedharan, Fred Saleh, Abdullah Alqahtani, Ibrahim Ahmed Albalawi, Gokul Krishna Gopalakrishnan, Hadi Abdullah Alahmed, Basem Ahmed Alsultan, Dhafer Mana Alalharith, Musallam Alnasser, Ayedh Dafer Alahmari, Manjush Karthika","doi":"10.3389/frai.2024.1422551","DOIUrl":"10.3389/frai.2024.1422551","url":null,"abstract":"<p><strong>Introduction: </strong>Artificial intelligence has come to be the highlight in almost all fields of science. It uses various models and algorithms to detect patterns and specific findings to diagnose a disease with utmost accuracy. With the increasing need for accurate and precise diagnosis of disease, employing artificial intelligence models and concepts in healthcare setup can be beneficial.</p><p><strong>Methodology: </strong>The search engines and databases employed in this study are PubMed, ScienceDirect and Medline. Studies published between 1st January 2013 to 1st February 2023 were included in this analysis. The selected articles were screened preliminarily using the Rayyan web tool, after which investigators screened the selected articles individually. The risk of bias for the selected studies was assessed using QUADAS-2 tool specially designed to test bias among studies related to diagnostic test reviews.</p><p><strong>Results: </strong>In this review, 17 studies were included from a total of 12,173 studies. These studies were analysed for their sensitivity, accuracy, positive predictive value, specificity and negative predictive value in diagnosing barrette's neoplasia, cardiac arrest, esophageal adenocarcinoma, sepsis and gastrointestinal stromal tumors. All the studies reported heterogeneity with <i>p</i>-value <0.05 at confidence interval 95%.</p><p><strong>Conclusion: </strong>The existing evidential data suggests that artificial intelligence can be highly helpful in the field of diagnosis providing maximum precision and early detection. This helps to prevent disease progression and also helps to provide treatment at the earliest. Employing artificial intelligence in diagnosis will define the advancement of health care environment and also be beneficial in every aspect concerned with treatment to illnesses.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1422551"},"PeriodicalIF":3.0,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11487586/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abinand Nallathambi, Christin David Bose, Wilfried Haensch, Anand Raghunathan
{"title":"LRMP: Layer Replication with Mixed Precision for spatial in-memory DNN accelerators.","authors":"Abinand Nallathambi, Christin David Bose, Wilfried Haensch, Anand Raghunathan","doi":"10.3389/frai.2024.1268317","DOIUrl":"10.3389/frai.2024.1268317","url":null,"abstract":"<p><p>In-memory computing (IMC) with non-volatile memories (NVMs) has emerged as a promising approach to address the rapidly growing computational demands of Deep Neural Networks (DNNs). Mapping DNN layers spatially onto NVM-based IMC accelerators achieves high degrees of parallelism. However, two challenges that arise in this approach are the highly non-uniform distribution of layer processing times and high area requirements. We propose LRMP, a method to jointly apply layer replication and mixed precision quantization to improve the performance of DNNs when mapped to area-constrained IMC accelerators. LRMP uses a combination of reinforcement learning and mixed integer linear programming to search the replication-quantization design space using a model that is closely informed by the target hardware architecture. Across five DNN benchmarks, LRMP achieves 2.6-9.3× latency and 8-18× throughput improvement at minimal (<1%) degradation in accuracy.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1268317"},"PeriodicalIF":3.0,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11486753/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Generative AI with WGAN-GP for boosting seizure detection accuracy.","authors":"Lina Abou-Abbas, Khadidja Henni, Imene Jemal, Neila Mezghani","doi":"10.3389/frai.2024.1437315","DOIUrl":"https://doi.org/10.3389/frai.2024.1437315","url":null,"abstract":"<p><strong>Background: </strong>Imbalanced datasets pose challenges for developing accurate seizure detection systems based on electroencephalogram (EEG) data. Generative AI techniques may help augment minority class data to facilitate automatic epileptic seizure detection.</p><p><strong>New method: </strong>This study investigates the impact of various data augmentation (DA) approaches, including Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP), Vanilla GAN, Conditional GAN (CGAN), and Cramer GAN, on classification performance with Random Forest models. The best-performing GAN variant, WGAN-GP, was then integrated with a bidirectional Long Short-Term Memory (LSTM) architecture and compared against traditional and synthetic oversampling methods.</p><p><strong>Results: </strong>The evaluation of different GAN variants for data augmentation with Random Forest classifiers identified WGAN-GP as the most effective approach. The integration of WGAN-GP with bidirectional LSTM yielded substantial performance improvements, outperforming traditional oversampling methods and achieving an accuracy of 91.73% on the augmented data, compared to 86% accuracy on real data without augmentation.</p><p><strong>Comparison with existing methods: </strong>The proposed generative AI approach combining WGAN-GP and recurrent neural network models outperforms comparative synthetic oversampling methods on metrics relevant for reliable seizure detection from imbalanced EEG datasets.</p><p><strong>Conclusions: </strong>Incorporating the WGAN-GP generative AI technique for data augmentation and integrating it with bidirectional LSTM elevates seizure detection accuracy for imbalanced EEG datasets, surpassing the performance of traditional oversampling and class weight adjustment methods. This approach shows promise for improving epilepsy monitoring and management through enhanced automated detection system effectiveness.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1437315"},"PeriodicalIF":3.0,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480023/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling disagreement in automatic data labeling for semi-supervised learning in Clinical Natural Language Processing.","authors":"Hongshu Liu, Nabeel Seedat, Julia Ive","doi":"10.3389/frai.2024.1374162","DOIUrl":"https://doi.org/10.3389/frai.2024.1374162","url":null,"abstract":"<p><strong>Introduction: </strong>Computational models providing accurate estimates of their uncertainty are crucial for risk management associated with decision-making in healthcare contexts. This is especially true since many state-of-the-art systems are trained using the data which have been labeled automatically (self-supervised mode) and tend to overfit.</p><p><strong>Methods: </strong>In this study, we investigate the quality of uncertainty estimates from a range of current state-of-the-art predictive models applied to the problem of observation detection in radiology reports. This problem remains understudied for Natural Language Processing in the healthcare domain.</p><p><strong>Results: </strong>We demonstrate that Gaussian Processes (GPs) provide superior performance in quantifying the risks of three uncertainty labels based on the negative log predictive probability (NLPP) evaluation metric and mean maximum predicted confidence levels (MMPCL), whilst retaining strong predictive performance.</p><p><strong>Discussion: </strong>Our conclusions highlight the utility of probabilistic models applied to \"noisy\" labels and that similar methods could provide utility for Natural Language Processing (NLP) based automated labeling tasks.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1374162"},"PeriodicalIF":3.0,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11480042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Leila Aissaoui Ferhi, Manel Ben Amar, Fethi Choubani, Ridha Bouallegue
{"title":"Enhancing diagnostic accuracy in symptom-based health checkers: a comprehensive machine learning approach with clinical vignettes and benchmarking.","authors":"Leila Aissaoui Ferhi, Manel Ben Amar, Fethi Choubani, Ridha Bouallegue","doi":"10.3389/frai.2024.1397388","DOIUrl":"https://doi.org/10.3389/frai.2024.1397388","url":null,"abstract":"<p><strong>Introduction: </strong>The development of machine learning models for symptom-based health checkers is a rapidly evolving area with significant implications for healthcare. Accurate and efficient diagnostic tools can enhance patient outcomes and optimize healthcare resources. This study focuses on evaluating and optimizing machine learning models using a dataset of 10 diseases and 9,572 samples.</p><p><strong>Methods: </strong>The dataset was divided into training and testing sets to facilitate model training and evaluation. The following models were selected and optimized: Decision Tree, Random Forest, Naive Bayes, Logistic Regression and K-Nearest Neighbors. Evaluation metrics included accuracy, F1 scores, and 10-fold cross-validation. ROC-AUC and precision-recall curves were also utilized to assess model performance, particularly in scenarios with imbalanced datasets. Clinical vignettes were employed to gauge the real-world applicability of the models.</p><p><strong>Results: </strong>The performance of the models was evaluated using accuracy, F1 scores, and 10-fold cross-validation. The use of ROC-AUC curves revealed that model performance improved with increasing complexity. Precision-recall curves were particularly useful in evaluating model sensitivity in imbalanced dataset scenarios. Clinical vignettes demonstrated the robustness of the models in providing accurate diagnoses.</p><p><strong>Discussion: </strong>The study underscores the importance of comprehensive model evaluation techniques. The use of clinical vignette testing and analysis of ROC-AUC and precision-recall curves are crucial in ensuring the reliability and sensitivity of symptom-based health checkers. These techniques provide a more nuanced understanding of model performance and highlight areas for further improvement.</p><p><strong>Conclusion: </strong>This study highlights the significance of employing diverse evaluation metrics and methods to ensure the robustness and accuracy of machine learning models in symptom-based health checkers. The integration of clinical vignettes and the analysis of ROC-AUC and precision-recall curves are essential steps in developing reliable and sensitive diagnostic tools.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1397388"},"PeriodicalIF":3.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11483353/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anastasia S Panfilova, Ekaterina A Valueva, Ivan Y Ilyin
{"title":"The application of explainable artificial intelligence methods to models for automatic creativity assessment.","authors":"Anastasia S Panfilova, Ekaterina A Valueva, Ivan Y Ilyin","doi":"10.3389/frai.2024.1310518","DOIUrl":"https://doi.org/10.3389/frai.2024.1310518","url":null,"abstract":"<p><strong>Objective: </strong>The study is devoted to comparing various models based on Artificial Intelligence to determine the level of creativity based on drawings performed using the Urban test, as well as analyzing the results of applying explainable artificial intelligence methods to a trained model to identify the most relevant features in drawings that influence the model's prediction.</p><p><strong>Methods: </strong>The dataset is represented by a set of 1,823 scanned forms of drawings of participants performed according to the Urban test. The test results of each participant were assessed by an expert. Preprocessed images were used for fine-tuning pre-trained models such as MobileNet, ResNet18, AlexNet, DenseNet, ResNext, EfficientNet, ViT with additional linear layers to predict the participant's score. Visualization of the areas that are of greatest importance from the point of view of the model was carried out using the Gradient-weighted Class Activation Mapping (Grad-CAM) method.</p><p><strong>Results: </strong>Trained models based on MobileNet showed the highest prediction accuracy rate of 76%. The results of the application of explainable artificial intelligence demonstrated areas of interest that correlated with the criteria for expert assessment according to the Urban test. Analysis of erroneous predictions of the model in terms of interpretation of areas of interest made it possible to clarify the features of the drawing on which the model relies, contrary to the expert.</p><p><strong>Conclusion: </strong>The study demonstrated the possibility of using neural network methods for automated diagnosis of the level of creativity according to the Urban test based on the respondents' drawings. The application of explainable artificial intelligence methods to the trained model demonstrated the compliance of the identified activation zones with the rules of expert assessment according to the Urban test.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1310518"},"PeriodicalIF":3.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11473496/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient incremental training using a novel NMT-SMT hybrid framework for translation of low-resource languages.","authors":"Kumar Bhuvaneswari, Murugesan Varalakshmi","doi":"10.3389/frai.2024.1381290","DOIUrl":"https://doi.org/10.3389/frai.2024.1381290","url":null,"abstract":"<p><p>The data-hungry statistical machine translation (SMT) and neural machine translation (NMT) models offer state-of-the-art results for languages with abundant data resources. However, extensive research is imperative to make these models perform equally well for low-resource languages. This paper proposes a novel approach to integrate the best features of the NMT and SMT systems for improved translation performance of low-resource English-Tamil language pair. The suboptimal NMT model trained with the small parallel corpus translates the monolingual corpus and selects only the best translations, to retrain itself in the next iteration. The proposed method employs the SMT phrase-pair table to determine the best translations, based on the maximum match between the words of the phrase-pair dictionary and each of the individual translations. This repeating cycle of translation and retraining generates a large quasi-parallel corpus, thus making the NMT model more powerful. SMT-integrated incremental training demonstrates a substantial difference in translation performance as compared to the existing approaches for incremental training. The model is strengthened further by adopting a beam search decoding strategy to produce <i>k</i> best possible translations for each input sentence. Empirical findings prove that the proposed model with BLEU scores of 19.56 and 23.49 outperforms the baseline NMT with scores 11.06 and 17.06 for Eng-to-Tam and Tam-to-Eng translations, respectively. METEOR score evaluation further corroborates these results, proving the supremacy of the proposed model.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1381290"},"PeriodicalIF":3.0,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11461459/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142393858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Carsten Röttgen, Britta Herbig, Tobias Weinmann, Andreas Müller
{"title":"Algorithmic management and human-centered task design: a conceptual synthesis from the perspective of action regulation and sociomaterial systems theory.","authors":"Carsten Röttgen, Britta Herbig, Tobias Weinmann, Andreas Müller","doi":"10.3389/frai.2024.1441497","DOIUrl":"10.3389/frai.2024.1441497","url":null,"abstract":"<p><p>This paper aims to explain potential psychological effects of algorithmic management (AM) on human-centered task design and with that also workers' mental well-being. For this, we link research on algorithmic management (AM) with Sociomaterial System Theory and Action Regulation Theory (ART). Our main assumption is that psychological effects of sociomaterial systems, such as AM, can be explained by their impact on human action. From the synthesis of the theories, mixed effects on human-centered task design can be derived: It can be expected that AM contributes to fewer action regulation opportunities (i.e., job resources like job autonomy, transparency, predictability), and to lower intellectual demands (i.e., challenge demands like task complexity, problem solving). Moreover, it can be concluded that AM is related with more regulation problems (i.e., hindrance demands like overtaxing regulations) but also fewer regulation problems (like regulation obstacles, uncertainty). Based on these considerations and in line with the majority of current research, it can be assumed that the use of AM is indirectly associated with higher risks to workers' mental well-being. However, we also identify potential positive effects of AM as some stressful and demotivating obstacles at work are often mitigated. Based on these considerations, the main question of future research is not whether AM is good or bad for workers, but rather <i>how</i> work under AM can be designed to be humane. Our proposed model can guide and support researchers and practitioners in improving the understanding of the next generation of AM systems.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1441497"},"PeriodicalIF":3.0,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11461486/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142393856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ahsan Mahmood, Junier Oliva, Martin Andreas Styner
{"title":"Anomaly detection via Gumbel Noise Score Matching.","authors":"Ahsan Mahmood, Junier Oliva, Martin Andreas Styner","doi":"10.3389/frai.2024.1441205","DOIUrl":"10.3389/frai.2024.1441205","url":null,"abstract":"<p><p>We propose Gumbel Noise Score Matching (GNSM), a novel unsupervised method to detect anomalies in categorical data. GNSM accomplishes this by estimating the scores, i.e., the gradients of log likelihoods w.r.t. inputs, of continuously relaxed categorical distributions. We test our method on a suite of anomaly detection tabular datasets. GNSM achieves a consistently high performance across all experiments. We further demonstrate the flexibility of GNSM by applying it to image data where the model is tasked to detect poor segmentation predictions. Images ranked anomalous by GNSM show clear segmentation failures, with the anomaly scores strongly correlating with segmentation metrics computed on ground-truth. We outline the score matching training objective utilized by GNSM and provide an open-source implementation of our work.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1441205"},"PeriodicalIF":3.0,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11488619/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}