基于图像帧代数和视觉语义代数的机器视觉基础研究

2023 7th International Conference on Machine Vision and Information Technology (CMVIT) Pub Date : 2023-03-01 DOI:10.1109/CMVIT57620.2023.00009

Guoyin Wang

{"title":"基于图像帧代数和视觉语义代数的机器视觉基础研究","authors":"Guoyin Wang","doi":"10.1109/CMVIT57620.2023.00009","DOIUrl":null,"url":null,"abstract":"Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].","PeriodicalId":191655,"journal":{"name":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Basic Research on Machine Vision Underpinned by Image Frame Algebra (VFA) and Visual Semantic Algebra (VSA)\",\"authors\":\"Guoyin Wang\",\"doi\":\"10.1109/CMVIT57620.2023.00009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].\",\"PeriodicalId\":191655,\"journal\":{\"name\":\"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CMVIT57620.2023.00009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CMVIT57620.2023.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

计算机视觉[1]，[2]，[3]，[4]，[5]研究机器视觉的特性，语义理解和智能数学(IM)的一般操作[6]，[7]，[8]，[9]，[10]，[11]，[12]，[13]，[14]，[15]，[16]，[17]。计算机视觉已经从算法方法、分析方法、模式识别和神经网络回归(AI)技术等各个方面进行了研究[2]，[3]。然而，目前还缺乏使机器能够自主识别和处理图像的基本理论。当代IM的基础研究表明，智能机器对视觉对象的正式操作可以通过前端的图像帧代数(IFA)[8]、[18]和后端的视觉语义代数(VSA)[19]严格实现。IFA通过建模、分析、合成、特征提取和模式识别等一系列代数运算符将视觉图像作为一般的二维矩阵进行正式操作[4]、[5]、[18]。然后，它的对应物VSA通过代数分析和组合将视觉对象的地理关系转换为它们的语义解释。IFA和VSA的一致理论为机器支持的图像处理和理解提供了一种正式的方法。本主题提出了一个以IFA和VSA为基础的机器视觉理论框架，用于视觉对象的结构表示和视觉机制的功能操作[3]，[8]，[9]。它展示了如何通过IFA/VSA方法严格有效地解决机器视觉方面的持续挑战。将展示在现实世界中应用IFA/VSA进行严格的视觉模式检测、识别、分析和合成的案例研究[5]，[18]，[20]。IFA和VSA作为IM的两个连贯范例，除其他外[21]，[22]，[23]，[24]，[25]，[26]，[27]，[28]，[29]，[30]，IFA和VSA不仅应用于机器人的视觉和空间推理，还应用于计算智能和人工智能，通过机器识别和认知对视觉对象和模式进行严格的表示和操作[31]，[32]，[33]，[34]，[35]，[36]，[37]，[38]，[39]，[40]，[41]，[42]，[43]，[44]，[45]，[46]，[47]，[48]，[49]，[50]，[51]，[52]，[53]，[55]，[56]，[57]、[58]、[59],[60],[61],[62],[63],[64],[65][66],[67],[68],[69],[70],[71],[72],[73],[74],[75][76]。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Basic Research on Machine Vision Underpinned by Image Frame Algebra (VFA) and Visual Semantic Algebra (VSA)

Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 7th International Conference on Machine Vision and Information Technology (CMVIT)

自引率

0.00%

发文量