IEEE Transactions on Multimedia最新文献

筛选
英文 中文
Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-labeling 通过视觉语言辅助伪标记生成弱监督三维场景图
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-16 DOI: 10.1109/tmm.2024.3443670
Xu Wang, Yifan Li, Qiudan Zhang, Wenhui Wu, Mark Junjie Li, Lin Ma, Jianmin Jiang
{"title":"Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-labeling","authors":"Xu Wang, Yifan Li, Qiudan Zhang, Wenhui Wu, Mark Junjie Li, Lin Ma, Jianmin Jiang","doi":"10.1109/tmm.2024.3443670","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443670","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"27 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controllable Syllable-Level Lyrics Generation from Melody with Prior Attention 根据事先注意的旋律生成可控音节级歌词
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-15 DOI: 10.1109/tmm.2024.3443664
Zhe Zhang, Yi Yu, Atsuhiro Takasu
{"title":"Controllable Syllable-Level Lyrics Generation from Melody with Prior Attention","authors":"Zhe Zhang, Yi Yu, Atsuhiro Takasu","doi":"10.1109/tmm.2024.3443664","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443664","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"6 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anti-Collapse Loss for Deep Metric Learning 深度度量学习的防坍塌损失
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-15 DOI: 10.1109/tmm.2024.3443616
Xiruo Jiang, Yazhou Yao, Xili Dai, Fumin Shen, Liqiang Nie, Heng-Tao Shen
{"title":"Anti-Collapse Loss for Deep Metric Learning","authors":"Xiruo Jiang, Yazhou Yao, Xili Dai, Fumin Shen, Liqiang Nie, Heng-Tao Shen","doi":"10.1109/tmm.2024.3443616","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443616","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"4 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gist,Content,Target-Oriented:A 3-Level Human-Like Framework for Video Moment Retrieval 要点、内容、目标导向:用于视频瞬间检索的三层类人框架
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-14 DOI: 10.1109/tmm.2024.3443672
Di Wang, Xiantao Lu, Quan Wang, Yumin Tian, Bo Wan, Lihuo He
{"title":"Gist,Content,Target-Oriented:A 3-Level Human-Like Framework for Video Moment Retrieval","authors":"Di Wang, Xiantao Lu, Quan Wang, Yumin Tian, Bo Wan, Lihuo He","doi":"10.1109/tmm.2024.3443672","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443672","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"46 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse Pedestrian Character Learning for Trajectory Prediction 稀疏行人特征学习用于轨迹预测
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-14 DOI: 10.1109/tmm.2024.3443591
Yonghao Dong, Le Wang, Sanping Zhou, Gang Hua, Changyin Sun
{"title":"Sparse Pedestrian Character Learning for Trajectory Prediction","authors":"Yonghao Dong, Le Wang, Sanping Zhou, Gang Hua, Changyin Sun","doi":"10.1109/tmm.2024.3443591","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443591","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"39 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HSSHG: Heuristic Semantics-constrained Spatio-temporal Heterogeneous Graph for VideoQA HSSHG:用于视频质量检测的启发式语义约束时空异构图
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-14 DOI: 10.1109/tmm.2024.3443661
Ruomei Wang, Yuanmao Luo, Fuwei Zhang, Mingyang Liu, Xiaonan Luo
{"title":"HSSHG: Heuristic Semantics-constrained Spatio-temporal Heterogeneous Graph for VideoQA","authors":"Ruomei Wang, Yuanmao Luo, Fuwei Zhang, Mingyang Liu, Xiaonan Luo","doi":"10.1109/tmm.2024.3443661","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443661","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"81 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MMVS: Enabling Robust Adaptive Video Streaming for Wildly Fluctuating and Heterogeneous Networks MMVS:为剧烈波动的异构网络提供稳健的自适应视频流服务
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-14 DOI: 10.1109/tmm.2024.3443609
Shuoyao Wang, Jiawei Lin, Yu Dai
{"title":"MMVS: Enabling Robust Adaptive Video Streaming for Wildly Fluctuating and Heterogeneous Networks","authors":"Shuoyao Wang, Jiawei Lin, Yu Dai","doi":"10.1109/tmm.2024.3443609","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443609","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"81 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BI-AVAN: A Brain-Inspired Adversarial Visual Attention Network for Characterizing Human Visual Attention from Neural Activity BI-AVAN:从神经活动描述人类视觉注意力的脑启发对抗性视觉注意力网络
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-14 DOI: 10.1109/tmm.2024.3443623
Heng Huang, Lin Zhao, Haixing Dai, Lu Zhang, Xintao Hu, Dajiang Zhu, Tianming Liu
{"title":"BI-AVAN: A Brain-Inspired Adversarial Visual Attention Network for Characterizing Human Visual Attention from Neural Activity","authors":"Heng Huang, Lin Zhao, Haixing Dai, Lu Zhang, Xintao Hu, Dajiang Zhu, Tianming Liu","doi":"10.1109/tmm.2024.3443623","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443623","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"386 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GS-SFS: Joint Gaussian Splatting and Shape-from-Silhouette for Multiple Human Reconstruction in Large-Scale Sports Scenes GS-SFS:联合高斯拼接和轮廓塑形技术,用于大规模运动场景中的多人重构
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-14 DOI: 10.1109/tmm.2024.3443637
Yuqi Jiang, Jing Li, Haidong Qin, Yanran Dai, Jing Liu, Guodong Zhang, Canbin Zhang, Tao Yang
{"title":"GS-SFS: Joint Gaussian Splatting and Shape-from-Silhouette for Multiple Human Reconstruction in Large-Scale Sports Scenes","authors":"Yuqi Jiang, Jing Li, Haidong Qin, Yanran Dai, Jing Liu, Guodong Zhang, Canbin Zhang, Tao Yang","doi":"10.1109/tmm.2024.3443637","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443637","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"137 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RCVS: A Unified Registration and Fusion Framework for Video Streams RCVS:视频流统一注册与融合框架
IF 7.3 1区 计算机科学
IEEE Transactions on Multimedia Pub Date : 2024-08-14 DOI: 10.1109/tmm.2024.3443673
Housheng Xie, Meng Sang, Yukuan Zhang, Yang Yang, Shan Zhao, Jianbo Zhong
{"title":"RCVS: A Unified Registration and Fusion Framework for Video Streams","authors":"Housheng Xie, Meng Sang, Yukuan Zhang, Yang Yang, Shan Zhao, Jianbo Zhong","doi":"10.1109/tmm.2024.3443673","DOIUrl":"https://doi.org/10.1109/tmm.2024.3443673","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"81 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信