International Journal of Multimedia Information Retrieval最新文献

筛选
英文 中文
Deep learning for video-text retrieval: a review 视频文本检索的深度学习:综述
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2023-02-23 DOI: 10.1007/s13735-023-00267-8
Cunjuan Zhu, Qi Jia, Wei Chen, Yanming Guo, Yu Liu
{"title":"Deep learning for video-text retrieval: a review","authors":"Cunjuan Zhu, Qi Jia, Wei Chen, Yanming Guo, Yu Liu","doi":"10.1007/s13735-023-00267-8","DOIUrl":"https://doi.org/10.1007/s13735-023-00267-8","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"104 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75907213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
End-to-end residual learning-based deep neural network model deployment for human activity recognition 基于端到端残差学习的人类活动识别深度神经网络模型部署
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2023-02-19 DOI: 10.1007/s13735-023-00269-6
Alok Negi, Krishan Kumar
{"title":"End-to-end residual learning-based deep neural network model deployment for human activity recognition","authors":"Alok Negi, Krishan Kumar","doi":"10.1007/s13735-023-00269-6","DOIUrl":"https://doi.org/10.1007/s13735-023-00269-6","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"102 1","pages":"1-15"},"PeriodicalIF":5.6,"publicationDate":"2023-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80596794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emotion-aware music tower blocks (EmoMTB ): an intelligent audiovisual interface for music discovery and recommendation. 情绪感知音乐塔块(EmoMTB):用于音乐发现和推荐的智能视听界面。
IF 3.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2023-01-01 Epub Date: 2023-06-02 DOI: 10.1007/s13735-023-00275-8
Alessandro B Melchiorre, David Penz, Christian Ganhör, Oleg Lesota, Vasco Fragoso, Florian Fritzl, Emilia Parada-Cabaleiro, Franz Schubert, Markus Schedl
{"title":"Emotion-aware music tower blocks (EmoMTB ): an intelligent audiovisual interface for music discovery and recommendation.","authors":"Alessandro B Melchiorre, David Penz, Christian Ganhör, Oleg Lesota, Vasco Fragoso, Florian Fritzl, Emilia Parada-Cabaleiro, Franz Schubert, Markus Schedl","doi":"10.1007/s13735-023-00275-8","DOIUrl":"10.1007/s13735-023-00275-8","url":null,"abstract":"<p><p>Music listening has experienced a sharp increase during the last decade thanks to music streaming and recommendation services. While they offer text-based search functionality and provide recommendation lists of remarkable utility, their typical mode of interaction is unidimensional, i.e., they provide lists of consecutive tracks, which are commonly inspected in sequential order by the user. The user experience with such systems is heavily affected by cognition biases (e.g., position bias, human tendency to pay more attention to first positions of ordered lists) as well as algorithmic biases (e.g., popularity bias, the tendency of recommender systems to overrepresent popular items). This may cause dissatisfaction among the users by disabling them to find novel music to enjoy. In light of such systems and biases, we propose an intelligent audiovisual music exploration system named EmoMTB . It allows the user to browse the entirety of a given collection in a free nonlinear fashion. The navigation is assisted by a set of personalized emotion-aware recommendations, which serve as starting points for the exploration experience. EmoMTB  adopts the metaphor of a city, in which each track (visualized as a colored cube) represents one floor of a building. Highly similar tracks are located in the same building; moderately similar ones form neighborhoods that mostly correspond to genres. Tracks situated between distinct neighborhoods create a gradual transition between genres. Users can navigate this music city using their smartphones as control devices. They can explore districts of well-known music or decide to leave their comfort zone. In addition, EmoMTB   integrates an emotion-aware music recommendation system that re-ranks the list of suggested starting points for exploration according to the user's self-identified emotion or the collective emotion expressed in EmoMTB 's Twitter channel. Evaluation of EmoMTB   has been carried out in a threefold way: by quantifying the homogeneity of the clustering underlying the construction of the city, by measuring the accuracy of the emotion predictor, and by carrying out a web-based survey composed of open questions to obtain qualitative feedback from users.</p>","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"12 1","pages":"13"},"PeriodicalIF":3.6,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10238318/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9587418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tri-RAT: optimizing the attention scores for image captioning Tri-RAT:优化图像字幕的注意分数
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2022-10-06 DOI: 10.1007/s13735-022-00260-7
You Yang, Yongzhi An, Juntao Hu, Longyue Pan
{"title":"Tri-RAT: optimizing the attention scores for image captioning","authors":"You Yang, Yongzhi An, Juntao Hu, Longyue Pan","doi":"10.1007/s13735-022-00260-7","DOIUrl":"https://doi.org/10.1007/s13735-022-00260-7","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"18 1","pages":"705-715"},"PeriodicalIF":5.6,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82612596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gender classification from face images using central difference convolutional networks 基于中心差分卷积网络的人脸性别分类
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2022-09-22 DOI: 10.1007/s13735-022-00259-0
Mohammadreza Sheikh Fathollahi, Rezvan Heidari
{"title":"Gender classification from face images using central difference convolutional networks","authors":"Mohammadreza Sheikh Fathollahi, Rezvan Heidari","doi":"10.1007/s13735-022-00259-0","DOIUrl":"https://doi.org/10.1007/s13735-022-00259-0","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"os-53 1","pages":"695-703"},"PeriodicalIF":5.6,"publicationDate":"2022-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87367954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning MHA-WoML:多头注意,Wasserstein-OT:少镜头学习
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2022-09-21 DOI: 10.1007/s13735-022-00254-5
Junyan Yang, Jie Jiang, Yanming Guo
{"title":"MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning","authors":"Junyan Yang, Jie Jiang, Yanming Guo","doi":"10.1007/s13735-022-00254-5","DOIUrl":"https://doi.org/10.1007/s13735-022-00254-5","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"24 1","pages":"681-694"},"PeriodicalIF":5.6,"publicationDate":"2022-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82995441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Similar interior coordination image retrieval with multi-view features 基于多视点特征的相似内部协调图像检索
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2022-08-26 DOI: 10.1007/s13735-022-00247-4
Ren Togo, Yuki Honma, Maiku Abe, Takahiro Ogawa, M. Haseyama
{"title":"Similar interior coordination image retrieval with multi-view features","authors":"Ren Togo, Yuki Honma, Maiku Abe, Takahiro Ogawa, M. Haseyama","doi":"10.1007/s13735-022-00247-4","DOIUrl":"https://doi.org/10.1007/s13735-022-00247-4","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"46 1","pages":"731-740"},"PeriodicalIF":5.6,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77435089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A literature review and perspectives in deepfakes: generation, detection, and applications 深度伪造的文献综述与展望:生成、检测和应用
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2022-07-23 DOI: 10.1007/s13735-022-00241-w
Deepak Dagar, D. Vishwakarma
{"title":"A literature review and perspectives in deepfakes: generation, detection, and applications","authors":"Deepak Dagar, D. Vishwakarma","doi":"10.1007/s13735-022-00241-w","DOIUrl":"https://doi.org/10.1007/s13735-022-00241-w","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"1 1","pages":"219 - 289"},"PeriodicalIF":5.6,"publicationDate":"2022-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76211854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Cross-domain image retrieval: methods and applications 跨域图像检索:方法和应用
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2022-07-23 DOI: 10.1007/s13735-022-00244-7
Xiaoping Zhou, Xiangyu Han, Haoran Li, Jia Wang, Xun Liang
{"title":"Cross-domain image retrieval: methods and applications","authors":"Xiaoping Zhou, Xiangyu Han, Haoran Li, Jia Wang, Xun Liang","doi":"10.1007/s13735-022-00244-7","DOIUrl":"https://doi.org/10.1007/s13735-022-00244-7","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"53 1","pages":"199 - 218"},"PeriodicalIF":5.6,"publicationDate":"2022-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81120217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review 使用三维卷积神经网络从计算机断层扫描图像中分割器官:系统回顾
IF 5.6 3区 计算机科学
International Journal of Multimedia Information Retrieval Pub Date : 2022-07-16 DOI: 10.1007/s13735-022-00242-9
A. Ilesanmi, Taiwo Ilesanmi, O. P. Idowu, D. Torigian, J. Udupa
{"title":"Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review","authors":"A. Ilesanmi, Taiwo Ilesanmi, O. P. Idowu, D. Torigian, J. Udupa","doi":"10.1007/s13735-022-00242-9","DOIUrl":"https://doi.org/10.1007/s13735-022-00242-9","url":null,"abstract":"","PeriodicalId":48501,"journal":{"name":"International Journal of Multimedia Information Retrieval","volume":"8 1","pages":"315 - 331"},"PeriodicalIF":5.6,"publicationDate":"2022-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85311134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信