Sequential Data Classification under Dynamic Emission

IF 0.7 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

PATTERN RECOGNITION AND IMAGE ANALYSIS Pub Date : 2024-04-10 DOI:10.1134/s1054661824010048

L. Aslanyan

{"title":"Sequential Data Classification under Dynamic Emission","authors":"L. Aslanyan","doi":"10.1134/s1054661824010048","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Sequential data are ubiquitous and widely available in a range of applications in almost all areas. We aim at considering medical, metrological and motion capturing type applications in terms of sequential data analytics in general, and classification in particular. Two scenarios are considered. The first starts with a pass through the initial sequential data database, performing training/learning of set of classes–medical conditions of patients in case of the dynamic treatment regime problems. This learned procedure will be used during the automated classification of new patients. Before starting the next, second pass, we form a confusion matrix based on the learned classification algorithm, and we form a transition matrix, which can be obtained in two ways: by the original database and alternatively by the data classified by the trained algorithm. The second pass is designed to correct the original classification with help of an additional hidden Markov type model (HMM), based on the mentioned two matrices as transition and emission matrices. The database (set of trellises, the training set) has a lattice structure. A part of the trellis tracks end at the target class (important in dynamic treatment regime applications, sometime associated with the healthy class). The trained classification, applied to the training set, can change the set of tracks ending at the target class, which forms one of the performance indicators of this algorithm. The next scenario also is based on the HMM type model. If one takes a lattice track, treating it as a sequence of observations, then HMM can improve that sequence by generating a complementary sequence, similar to the sequence of Viterbi states of HMM. It can also change the set of tracks ending at the target class, which forms the next performance measure, this time for the HMM procedure. Convergence to the target class is characterized by the convergence of the degrees of the transition matrices to the simple special case of such matrices. Alternatively, by extracting the root of the convergent matrices, the corresponding characterization of the transition matrix can be obtained so that the convergence is guaranteed. This work is mostly methodological than innovative being a complementary part to our previous work on target class classification topics. In the experimental part of this work we considered a root-oriented directed acyclic graphs that correspond to the target class classification policy. On the model of this graphs, a random set of tracks is generated, forming a so-called synthetic training set, synthetic trellis.</p>","PeriodicalId":35400,"journal":{"name":"PATTERN RECOGNITION AND IMAGE ANALYSIS","volume":"102 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PATTERN RECOGNITION AND IMAGE ANALYSIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1134/s1054661824010048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Sequential data are ubiquitous and widely available in a range of applications in almost all areas. We aim at considering medical, metrological and motion capturing type applications in terms of sequential data analytics in general, and classification in particular. Two scenarios are considered. The first starts with a pass through the initial sequential data database, performing training/learning of set of classes–medical conditions of patients in case of the dynamic treatment regime problems. This learned procedure will be used during the automated classification of new patients. Before starting the next, second pass, we form a confusion matrix based on the learned classification algorithm, and we form a transition matrix, which can be obtained in two ways: by the original database and alternatively by the data classified by the trained algorithm. The second pass is designed to correct the original classification with help of an additional hidden Markov type model (HMM), based on the mentioned two matrices as transition and emission matrices. The database (set of trellises, the training set) has a lattice structure. A part of the trellis tracks end at the target class (important in dynamic treatment regime applications, sometime associated with the healthy class). The trained classification, applied to the training set, can change the set of tracks ending at the target class, which forms one of the performance indicators of this algorithm. The next scenario also is based on the HMM type model. If one takes a lattice track, treating it as a sequence of observations, then HMM can improve that sequence by generating a complementary sequence, similar to the sequence of Viterbi states of HMM. It can also change the set of tracks ending at the target class, which forms the next performance measure, this time for the HMM procedure. Convergence to the target class is characterized by the convergence of the degrees of the transition matrices to the simple special case of such matrices. Alternatively, by extracting the root of the convergent matrices, the corresponding characterization of the transition matrix can be obtained so that the convergence is guaranteed. This work is mostly methodological than innovative being a complementary part to our previous work on target class classification topics. In the experimental part of this work we considered a root-oriented directed acyclic graphs that correspond to the target class classification policy. On the model of this graphs, a random set of tracks is generated, forming a so-called synthetic training set, synthetic trellis.

Abstract Image

查看原文本刊更多论文

动态排放下的序列数据分类

摘要序列数据无处不在，广泛应用于几乎所有领域。我们旨在从顺序数据分析，特别是分类的角度来考虑医疗、计量和运动捕捉类型的应用。我们考虑了两种情况。第一种情况是从通过初始序列数据数据库开始，在动态治疗机制问题的情况下，对一组类别--患者的医疗条件--进行训练/学习。在对新病人进行自动分类时，将使用这一学习程序。在开始下一步，即第二步之前，我们会根据学习到的分类算法形成混淆矩阵，并形成过渡矩阵，过渡矩阵可以通过两种方式获得：通过原始数据库和通过训练算法分类的数据。第二步的目的是借助额外的隐马尔可夫模型（HMM）修正原始分类，该模型以上述两个矩阵为过渡矩阵和排放矩阵。数据库（花架集，即训练集）具有网格结构。部分花架轨迹以目标类别为终点（在动态处理系统应用中很重要，有时与健康类别相关）。应用于训练集的训练分类法可以改变以目标类别为终点的轨迹集，这也是该算法的性能指标之一。下一种情况也是基于 HMM 模型。如果将网格轨迹视为观测序列，那么 HMM 可以通过生成一个互补序列来改进该序列，这与 HMM 的维特比状态序列类似。它还能改变以目标类别为终点的轨迹集，这就形成了下一个性能指标，这次是针对 HMM 程序的。收敛到目标类别的特征是过渡矩阵的度数收敛到这种矩阵的简单特例。或者，通过提取收敛矩阵的根，可以得到过渡矩阵的相应特征，从而保证收敛。这项工作主要是方法论方面的，而不是创新方面的，是对我们之前关于目标类分类主题工作的补充。在这项工作的实验部分，我们考虑了与目标类别分类策略相对应的面向根的有向无环图。在该图的模型上，随机生成一组轨迹，形成所谓的合成训练集，即合成树状图。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PATTERN RECOGNITION AND IMAGE ANALYSIS Computer Science-Computer Graphics and Computer-Aided Design

CiteScore

1.80

自引率

20.00%

发文量

期刊介绍： The purpose of the journal is to publish high-quality peer-reviewed scientific and technical materials that present the results of fundamental and applied scientific research in the field of image processing, recognition, analysis and understanding, pattern recognition, artificial intelligence, and related fields of theoretical and applied computer science and applied mathematics. The policy of the journal provides for the rapid publication of original scientific articles, analytical reviews, articles of the world''s leading scientists and specialists on the subject of the journal solicited by the editorial board, special thematic issues, proceedings of the world''s leading scientific conferences and seminars, as well as short reports containing new results of fundamental and applied research in the field of mathematical theory and methodology of image analysis, mathematical theory and methodology of image recognition, and mathematical foundations and methodology of artificial intelligence. The journal also publishes articles on the use of the apparatus and methods of the mathematical theory of image analysis and the mathematical theory of image recognition for the development of new information technologies and their supporting software and algorithmic complexes and systems for solving complex and particularly important applied problems. The main scientific areas are the mathematical theory of image analysis and the mathematical theory of pattern recognition. The journal also embraces the problems of analyzing and evaluating poorly formalized, poorly structured, incomplete, contradictory and noisy information, including artificial intelligence, bioinformatics, medical informatics, data mining, big data analysis, machine vision, data representation and modeling, data and knowledge extraction from images, machine learning, forecasting, machine graphics, databases, knowledge bases, medical and technical diagnostics, neural networks, specialized software, specialized computational architectures for information analysis and evaluation, linguistic, psychological, psychophysical, and physiological aspects of image analysis and pattern recognition, applied problems, and related problems. Articles can be submitted either in English or Russian. The English language is preferable. Pattern Recognition and Image Analysis is a hybrid journal that publishes mostly subscription articles that are free of charge for the authors, but also accepts Open Access articles with article processing charges. The journal is one of the top 10 global periodicals on image analysis and pattern recognition and is the only publication on this topic in the Russian Federation, Central and Eastern Europe.