A brief reference to AI-driven audible reality (AuRa) in open world: potential, applications, and evaluation.

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Frontiers in Artificial Intelligence Pub Date : 2024-10-25 eCollection Date: 2024-01-01 DOI:10.3389/frai.2024.1424371
Ömer Ates, Garima Pandey, Athanasios Gousiopoulos, Theodoros G Soldatos
{"title":"A brief reference to AI-driven audible reality (AuRa) in open world: potential, applications, and evaluation.","authors":"Ömer Ates, Garima Pandey, Athanasios Gousiopoulos, Theodoros G Soldatos","doi":"10.3389/frai.2024.1424371","DOIUrl":null,"url":null,"abstract":"<p><p>Recent developments on artificial intelligence (AI) and machine learning (ML) techniques are expected to have significant impact on public health in several ways. Indeed, modern AI/ML methods have been applied on multiple occasions on topics ranging from drug discovery and disease diagnostics to personalized medicine, medical imaging, and healthcare operations. While such developments may improve several quality-of-life aspects (such as access to health services and education), it is important considering that some individuals may face more challenges, particularly in extreme or emergency situations. In this work, we focus on utilizing AI/ML components to support scenarios when visual impairment or other limitations hinder the ability to interpret the world in this way. Specifically, we discuss the potential and the feasibility of automatically transferring key visual information into audio communication, in different languages and in real-time-a setting which we name '<i>au</i>dible <i>r</i>e<i>a</i>lity' (AuRa). We provide a short guide to practical options currently available for implementing similar solutions and summarize key aspects for evaluating their scope. Finally, we discuss diverse settings and functionalities that AuRA applications could have in terms of broader impact, from a social and public health context, and invite the community to further such digital solutions and perspectives soon.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1424371"},"PeriodicalIF":3.0000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11543578/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2024.1424371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Recent developments on artificial intelligence (AI) and machine learning (ML) techniques are expected to have significant impact on public health in several ways. Indeed, modern AI/ML methods have been applied on multiple occasions on topics ranging from drug discovery and disease diagnostics to personalized medicine, medical imaging, and healthcare operations. While such developments may improve several quality-of-life aspects (such as access to health services and education), it is important considering that some individuals may face more challenges, particularly in extreme or emergency situations. In this work, we focus on utilizing AI/ML components to support scenarios when visual impairment or other limitations hinder the ability to interpret the world in this way. Specifically, we discuss the potential and the feasibility of automatically transferring key visual information into audio communication, in different languages and in real-time-a setting which we name 'audible reality' (AuRa). We provide a short guide to practical options currently available for implementing similar solutions and summarize key aspects for evaluating their scope. Finally, we discuss diverse settings and functionalities that AuRA applications could have in terms of broader impact, from a social and public health context, and invite the community to further such digital solutions and perspectives soon.

开放世界中人工智能驱动的可听现实(AuRa)简述:潜力、应用和评估。
人工智能(AI)和机器学习(ML)技术的最新发展有望在多个方面对公共卫生产生重大影响。事实上,现代人工智能/ML 方法已多次应用于从药物发现和疾病诊断到个性化医疗、医学成像和医疗保健运营等多个领域。虽然这些发展可能会改善生活质量的多个方面(如获得医疗服务和教育),但考虑到有些人可能会面临更多挑战,尤其是在极端或紧急情况下,这一点非常重要。在这项工作中,我们将重点放在利用人工智能/ML 组件,在视觉障碍或其他限制阻碍了以这种方式解读世界的能力时,为各种场景提供支持。具体来说,我们讨论了自动将关键视觉信息转化为不同语言的实时音频交流的潜力和可行性--我们将其命名为 "可听现实"(AuRa)。我们提供了一份简短的指南,介绍了目前可用于实施类似解决方案的实用选项,并总结了评估其范围的关键方面。最后,我们从社会和公共卫生的角度,讨论了 AuRA 应用在更广泛的影响方面可能具有的各种环境和功能,并邀请社会各界尽快进一步探讨此类数字解决方案和观点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.10
自引率
2.50%
发文量
272
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信