从计算机视觉到短文理解:将相似的方法应用于不同的学科

Jiayin Lin;Geng Sun;Jun Shen;David E. Pritchard;Ping Yu;Tingru Cui;Dongming Xu;Li Li;Ghassan Beydoun
{"title":"从计算机视觉到短文理解:将相似的方法应用于不同的学科","authors":"Jiayin Lin;Geng Sun;Jun Shen;David E. Pritchard;Ping Yu;Tingru Cui;Dongming Xu;Li Li;Ghassan Beydoun","doi":"10.23919/ICN.2022.0010","DOIUrl":null,"url":null,"abstract":"With the development of IoT and 5G technologies, more and more online resources are presented in trendy multimodal data forms over the Internet. Hence, effectively processing multimodal information is significant to the development of various online applications, including e-Iearning and digital health, to just name a few. However, most AI-driven systems or models can only handle limited forms of information. In this study, we investigate the correlation between natural language processing (NLP) and pattern recognition, trying to apply the mainstream approaches and models used in the computer vision (CV) to the task of NLP. Based on two different Twitter datasets, we propose a convolutional neural network based model to interpret the content of short text with different goals and application backgrounds. The experiments have demonstrated that our proposed model shows fairly competitive performance compared to the mainstream recurrent neural network based NLP models such as bidirectional long short-term memory (Bi-LSTM) and bidirectional gate recurrent unit (Bi-GRU). Moreover, the experimental results also demonstrate that the proposed model can precisely locate the key information in the given text.","PeriodicalId":100681,"journal":{"name":"Intelligent and Converged Networks","volume":"3 2","pages":"161-172"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9195266/9878033/09878056.pdf","citationCount":"1","resultStr":"{\"title\":\"From computer vision to short text understanding: Applying similar approaches into different disciplines\",\"authors\":\"Jiayin Lin;Geng Sun;Jun Shen;David E. Pritchard;Ping Yu;Tingru Cui;Dongming Xu;Li Li;Ghassan Beydoun\",\"doi\":\"10.23919/ICN.2022.0010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of IoT and 5G technologies, more and more online resources are presented in trendy multimodal data forms over the Internet. Hence, effectively processing multimodal information is significant to the development of various online applications, including e-Iearning and digital health, to just name a few. However, most AI-driven systems or models can only handle limited forms of information. In this study, we investigate the correlation between natural language processing (NLP) and pattern recognition, trying to apply the mainstream approaches and models used in the computer vision (CV) to the task of NLP. Based on two different Twitter datasets, we propose a convolutional neural network based model to interpret the content of short text with different goals and application backgrounds. The experiments have demonstrated that our proposed model shows fairly competitive performance compared to the mainstream recurrent neural network based NLP models such as bidirectional long short-term memory (Bi-LSTM) and bidirectional gate recurrent unit (Bi-GRU). Moreover, the experimental results also demonstrate that the proposed model can precisely locate the key information in the given text.\",\"PeriodicalId\":100681,\"journal\":{\"name\":\"Intelligent and Converged Networks\",\"volume\":\"3 2\",\"pages\":\"161-172\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/iel7/9195266/9878033/09878056.pdf\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligent and Converged Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/9878056/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent and Converged Networks","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/9878056/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

随着物联网和5G技术的发展,越来越多的在线资源在互联网上以时尚的多模态数据形式呈现。因此,有效地处理多模式信息对于各种在线应用程序的开发非常重要,包括电子学习和数字健康,仅举几例。然而,大多数人工智能驱动的系统或模型只能处理有限形式的信息。在这项研究中,我们探讨了自然语言处理(NLP)和模式识别之间的相关性,试图将计算机视觉(CV)中使用的主流方法和模型应用于NLP任务。基于两个不同的Twitter数据集,我们提出了一个基于卷积神经网络的模型来解释具有不同目标和应用背景的短文本内容。实验表明,与主流的基于循环神经网络的NLP模型(如双向长短期记忆(Bi-LSTM)和双向门循环单元(Bi-GRU))相比,我们提出的模型表现出相当有竞争力的性能。此外,实验结果还表明,该模型可以准确地定位给定文本中的关键信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
From computer vision to short text understanding: Applying similar approaches into different disciplines
With the development of IoT and 5G technologies, more and more online resources are presented in trendy multimodal data forms over the Internet. Hence, effectively processing multimodal information is significant to the development of various online applications, including e-Iearning and digital health, to just name a few. However, most AI-driven systems or models can only handle limited forms of information. In this study, we investigate the correlation between natural language processing (NLP) and pattern recognition, trying to apply the mainstream approaches and models used in the computer vision (CV) to the task of NLP. Based on two different Twitter datasets, we propose a convolutional neural network based model to interpret the content of short text with different goals and application backgrounds. The experiments have demonstrated that our proposed model shows fairly competitive performance compared to the mainstream recurrent neural network based NLP models such as bidirectional long short-term memory (Bi-LSTM) and bidirectional gate recurrent unit (Bi-GRU). Moreover, the experimental results also demonstrate that the proposed model can precisely locate the key information in the given text.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信