Ky范数在视频分割中的应用

Herald of Advanced Information Technology Pub Date : 2020-04-10 DOI:10.15276/hait01.2020.1

Myroslava Koliada

{"title":"Ky范数在视频分割中的应用","authors":"Myroslava Koliada","doi":"10.15276/hait01.2020.1","DOIUrl":null,"url":null,"abstract":"This article presents results of applying the KyFan norm in the context of solving the problem of video segmentation. Since the task of video analysis can be considered as analysis of the sequence of images, it was decided to find a way to formalize the description of the video frame using the mathematical apparatus of non-square matrices. When choosing a method, particular attention was paid precisely to universality with respect to the dimension of the initial data due to the technical characteristics and nature of the video data -video frames are matrices of arbitrary dimension. The ability to skip the step of matrix transformation to square dimension, or vectorization using some descriptor allows you to reduce computational costsrequired for this transformation. It was decided to use the value of the Ky Fan norm as an image descriptor, since it is built on top of matrix singular values. As it is known, singular values are calculated during the singular decomposition of the matrix and can be used, among other features, to reduce the dimension of the source data. A singular decomposition does not impose restrictions on either the dimension or the character of the elements of the original matrix. In addition, it can be used to derive other matrix decompositions with required characteristics. A comparative analysis of the effectiveness of the obtained descriptor was carried out in the case of using the k-norm and 1-norm, which showed that the 1-norm allows us to identify the most significant changes in the scene, while k -norm is able to detect minor. In other words, depending on the context of the source video data and the scope of the developed application, it is possible to configure the sensitivity of the application to a change in the scene by varying thenumber of singular values involved. The decision about the presence of changes in the context of video scene is made based on a comparison of descriptors of two consecutive images, that is, the values of the Ky Fan norm.","PeriodicalId":375628,"journal":{"name":"Herald of Advanced Information Technology","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"KY FAN NORM APPLICATION FOR VIDEO SEGMENTATION\",\"authors\":\"Myroslava Koliada\",\"doi\":\"10.15276/hait01.2020.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents results of applying the KyFan norm in the context of solving the problem of video segmentation. Since the task of video analysis can be considered as analysis of the sequence of images, it was decided to find a way to formalize the description of the video frame using the mathematical apparatus of non-square matrices. When choosing a method, particular attention was paid precisely to universality with respect to the dimension of the initial data due to the technical characteristics and nature of the video data -video frames are matrices of arbitrary dimension. The ability to skip the step of matrix transformation to square dimension, or vectorization using some descriptor allows you to reduce computational costsrequired for this transformation. It was decided to use the value of the Ky Fan norm as an image descriptor, since it is built on top of matrix singular values. As it is known, singular values are calculated during the singular decomposition of the matrix and can be used, among other features, to reduce the dimension of the source data. A singular decomposition does not impose restrictions on either the dimension or the character of the elements of the original matrix. In addition, it can be used to derive other matrix decompositions with required characteristics. A comparative analysis of the effectiveness of the obtained descriptor was carried out in the case of using the k-norm and 1-norm, which showed that the 1-norm allows us to identify the most significant changes in the scene, while k -norm is able to detect minor. In other words, depending on the context of the source video data and the scope of the developed application, it is possible to configure the sensitivity of the application to a change in the scene by varying thenumber of singular values involved. The decision about the presence of changes in the context of video scene is made based on a comparison of descriptors of two consecutive images, that is, the values of the Ky Fan norm.\",\"PeriodicalId\":375628,\"journal\":{\"name\":\"Herald of Advanced Information Technology\",\"volume\":\"103 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Herald of Advanced Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15276/hait01.2020.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Herald of Advanced Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15276/hait01.2020.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文给出了KyFan范数在解决视频分割问题中的应用结果。由于视频分析的任务可以被认为是对图像序列的分析，因此决定找到一种方法来使用非方阵的数学装置形式化视频帧的描述。在选择一种方法时，由于视频数据的技术特点和性质——视频帧是任意维数的矩阵，因此特别注意的是初始数据维数的普遍性。可以跳过矩阵变换到平方维的步骤，或者使用一些描述符进行矢量化，这样可以减少这种变换所需的计算成本。决定使用Ky范范范数的值作为图像描述符，因为它是建立在矩阵奇异值之上的。众所周知，奇异值是在矩阵的奇异分解过程中计算出来的，除了其他特征外，它还可以用来降低源数据的维数。奇异分解不会对原始矩阵的维数或元素的特征施加限制。此外，它还可以用于导出具有所需特征的其他矩阵分解。在使用k-范数和1-范数的情况下，对所获得的描述符的有效性进行了比较分析，结果表明，1-范数允许我们识别场景中最显著的变化，而k-范数能够检测次要的变化。换句话说，根据源视频数据的上下文和开发的应用程序的范围，可以通过改变所涉及的奇异值的数量来配置应用程序对场景变化的敏感性。对视频场景情境中是否存在变化的判断是基于对两个连续图像的描述符的比较，即Ky Fan范数的值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

KY FAN NORM APPLICATION FOR VIDEO SEGMENTATION

This article presents results of applying the KyFan norm in the context of solving the problem of video segmentation. Since the task of video analysis can be considered as analysis of the sequence of images, it was decided to find a way to formalize the description of the video frame using the mathematical apparatus of non-square matrices. When choosing a method, particular attention was paid precisely to universality with respect to the dimension of the initial data due to the technical characteristics and nature of the video data -video frames are matrices of arbitrary dimension. The ability to skip the step of matrix transformation to square dimension, or vectorization using some descriptor allows you to reduce computational costsrequired for this transformation. It was decided to use the value of the Ky Fan norm as an image descriptor, since it is built on top of matrix singular values. As it is known, singular values are calculated during the singular decomposition of the matrix and can be used, among other features, to reduce the dimension of the source data. A singular decomposition does not impose restrictions on either the dimension or the character of the elements of the original matrix. In addition, it can be used to derive other matrix decompositions with required characteristics. A comparative analysis of the effectiveness of the obtained descriptor was carried out in the case of using the k-norm and 1-norm, which showed that the 1-norm allows us to identify the most significant changes in the scene, while k -norm is able to detect minor. In other words, depending on the context of the source video data and the scope of the developed application, it is possible to configure the sensitivity of the application to a change in the scene by varying thenumber of singular values involved. The decision about the presence of changes in the context of video scene is made based on a comparison of descriptors of two consecutive images, that is, the values of the Ky Fan norm.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Herald of Advanced Information Technology

自引率

0.00%

发文量