基于图像帧代数和视觉语义代数的机器视觉基础研究

Guoyin Wang
{"title":"基于图像帧代数和视觉语义代数的机器视觉基础研究","authors":"Guoyin Wang","doi":"10.1109/CMVIT57620.2023.00009","DOIUrl":null,"url":null,"abstract":"Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].","PeriodicalId":191655,"journal":{"name":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Basic Research on Machine Vision Underpinned by Image Frame Algebra (VFA) and Visual Semantic Algebra (VSA)\",\"authors\":\"Guoyin Wang\",\"doi\":\"10.1109/CMVIT57620.2023.00009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].\",\"PeriodicalId\":191655,\"journal\":{\"name\":\"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CMVIT57620.2023.00009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CMVIT57620.2023.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

计算机视觉[1],[2],[3],[4],[5]研究机器视觉的特性,语义理解和智能数学(IM)的一般操作[6],[7],[8],[9],[10],[11],[12],[13],[14],[15],[16],[17]。计算机视觉已经从算法方法、分析方法、模式识别和神经网络回归(AI)技术等各个方面进行了研究[2],[3]。然而,目前还缺乏使机器能够自主识别和处理图像的基本理论。当代IM的基础研究表明,智能机器对视觉对象的正式操作可以通过前端的图像帧代数(IFA)[8]、[18]和后端的视觉语义代数(VSA)[19]严格实现。IFA通过建模、分析、合成、特征提取和模式识别等一系列代数运算符将视觉图像作为一般的二维矩阵进行正式操作[4]、[5]、[18]。然后,它的对应物VSA通过代数分析和组合将视觉对象的地理关系转换为它们的语义解释。IFA和VSA的一致理论为机器支持的图像处理和理解提供了一种正式的方法。本主题提出了一个以IFA和VSA为基础的机器视觉理论框架,用于视觉对象的结构表示和视觉机制的功能操作[3],[8],[9]。它展示了如何通过IFA/VSA方法严格有效地解决机器视觉方面的持续挑战。将展示在现实世界中应用IFA/VSA进行严格的视觉模式检测、识别、分析和合成的案例研究[5],[18],[20]。IFA和VSA作为IM的两个连贯范例,除其他外[21],[22],[23],[24],[25],[26],[27],[28],[29],[30],IFA和VSA不仅应用于机器人的视觉和空间推理,还应用于计算智能和人工智能,通过机器识别和认知对视觉对象和模式进行严格的表示和操作[31],[32],[33],[34],[35],[36],[37],[38],[39],[40],[41],[42],[43],[44],[45],[46],[47],[48],[49],[50],[51],[52],[53],[55],[56],[57]、[58]、[59],[60],[61],[62],[63],[64],[65][66],[67],[68],[69],[70],[71],[72],[73],[74],[75][76]。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Basic Research on Machine Vision Underpinned by Image Frame Algebra (VFA) and Visual Semantic Algebra (VSA)
Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信