{"title":"基于图像帧代数和视觉语义代数的机器视觉基础研究","authors":"Guoyin Wang","doi":"10.1109/CMVIT57620.2023.00009","DOIUrl":null,"url":null,"abstract":"Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].","PeriodicalId":191655,"journal":{"name":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Basic Research on Machine Vision Underpinned by Image Frame Algebra (VFA) and Visual Semantic Algebra (VSA)\",\"authors\":\"Guoyin Wang\",\"doi\":\"10.1109/CMVIT57620.2023.00009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].\",\"PeriodicalId\":191655,\"journal\":{\"name\":\"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CMVIT57620.2023.00009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 7th International Conference on Machine Vision and Information Technology (CMVIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CMVIT57620.2023.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Basic Research on Machine Vision Underpinned by Image Frame Algebra (VFA) and Visual Semantic Algebra (VSA)
Computer vison [1], [2], [3], [4], [5] studies properties of machine vision, its semantic understanding, and general manipulations by Intelligent Mathematics (IM) [6], [7], [8], [9], [10] [11], [12], [13], [14], [15] [16], [17]. Computer vison has been studies from various aspects such as algorithmic methods, analysis methods, pattern recognitions, and neural-network-regression (AI) technologies [2], [3]. However, there is a lack of fundamental theories for enabling autonomous image recognition and processing by machines. Basic research on contemporary IM has revealed that formal manipulations of visual objects by intelligent machines may be rigorously implemented by Image Frame Algebra (IFA) [8], [18] in the front-end and Visual Semantic Algebra (VSA) [19] in the backend. IFA formally manipulates visual images as general 2D matrixes by a set of algebraic operators such as modeling, analyses, syntheses, feature elicitation, and pattern recognition [4], [5], [18]. Then, its counterpart, VSA, transforms the geographic relations of visual objects to their semantic interpretations by algebraic analyses and compositions. The coherent theory of IFA and VSA provides a formal methodology for machine-enabled image processing and comprehension. This keynote presents a theoretical framework of machine vision underpinned by IFA and VSA for the structural denotations of visual objects and functional manipulations of visual mechanisms [3], [8], [9]. It demonstrates how the persistent challenges to machine vision may be rigorously and efficiently solved by the IFA/VSA methodology. Case studies on applying IFA/VSA for rigorous visual pattern detection, recognition, analysis, and composition in real world will be demonstrated [5], [18], [20]. As two coherent paradigms of IM, among others [21], [22], [23], [24], [25] [26], [27], [28], [29], [30], IFA and VSA have been applied not only in robot visual and spatial reasoning, but also in computational intelligence and AI for rigorously representing and manipulating of visual objects and patterns by machine recognition and cognition [31], [32], [33], [34], [35] [36], [37], [38], [39], [40], [41], [42], [43], [44], [45] [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65] [66], [67], [68], [69], [70], [71], [72], [73], [74], [75] [76].