{"title":"Sequential Data Classification under Dynamic Emission","authors":"L. Aslanyan","doi":"10.1134/s1054661824010048","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Sequential data are ubiquitous and widely available in a range of applications in almost all areas. We aim at considering medical, metrological and motion capturing type applications in terms of sequential data analytics in general, and classification in particular. Two scenarios are considered. The first starts with a pass through the initial sequential data database, performing training/learning of set of classes–medical conditions of patients in case of the dynamic treatment regime problems. This learned procedure will be used during the automated classification of new patients. Before starting the next, second pass, we form a confusion matrix based on the learned classification algorithm, and we form a transition matrix, which can be obtained in two ways: by the original database and alternatively by the data classified by the trained algorithm. The second pass is designed to correct the original classification with help of an additional hidden Markov type model (HMM), based on the mentioned two matrices as transition and emission matrices. The database (set of trellises, the training set) has a lattice structure. A part of the trellis tracks end at the target class (important in dynamic treatment regime applications, sometime associated with the healthy class). The trained classification, applied to the training set, can change the set of tracks ending at the target class, which forms one of the performance indicators of this algorithm. The next scenario also is based on the HMM type model. If one takes a lattice track, treating it as a sequence of observations, then HMM can improve that sequence by generating a complementary sequence, similar to the sequence of Viterbi states of HMM. It can also change the set of tracks ending at the target class, which forms the next performance measure, this time for the HMM procedure. Convergence to the target class is characterized by the convergence of the degrees of the transition matrices to the simple special case of such matrices. Alternatively, by extracting the root of the convergent matrices, the corresponding characterization of the transition matrix can be obtained so that the convergence is guaranteed. This work is mostly methodological than innovative being a complementary part to our previous work on target class classification topics. In the experimental part of this work we considered a root-oriented directed acyclic graphs that correspond to the target class classification policy. On the model of this graphs, a random set of tracks is generated, forming a so-called synthetic training set, synthetic trellis.</p>","PeriodicalId":35400,"journal":{"name":"PATTERN RECOGNITION AND IMAGE ANALYSIS","volume":"102 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PATTERN RECOGNITION AND IMAGE ANALYSIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1134/s1054661824010048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Sequential data are ubiquitous and widely available in a range of applications in almost all areas. We aim at considering medical, metrological and motion capturing type applications in terms of sequential data analytics in general, and classification in particular. Two scenarios are considered. The first starts with a pass through the initial sequential data database, performing training/learning of set of classes–medical conditions of patients in case of the dynamic treatment regime problems. This learned procedure will be used during the automated classification of new patients. Before starting the next, second pass, we form a confusion matrix based on the learned classification algorithm, and we form a transition matrix, which can be obtained in two ways: by the original database and alternatively by the data classified by the trained algorithm. The second pass is designed to correct the original classification with help of an additional hidden Markov type model (HMM), based on the mentioned two matrices as transition and emission matrices. The database (set of trellises, the training set) has a lattice structure. A part of the trellis tracks end at the target class (important in dynamic treatment regime applications, sometime associated with the healthy class). The trained classification, applied to the training set, can change the set of tracks ending at the target class, which forms one of the performance indicators of this algorithm. The next scenario also is based on the HMM type model. If one takes a lattice track, treating it as a sequence of observations, then HMM can improve that sequence by generating a complementary sequence, similar to the sequence of Viterbi states of HMM. It can also change the set of tracks ending at the target class, which forms the next performance measure, this time for the HMM procedure. Convergence to the target class is characterized by the convergence of the degrees of the transition matrices to the simple special case of such matrices. Alternatively, by extracting the root of the convergent matrices, the corresponding characterization of the transition matrix can be obtained so that the convergence is guaranteed. This work is mostly methodological than innovative being a complementary part to our previous work on target class classification topics. In the experimental part of this work we considered a root-oriented directed acyclic graphs that correspond to the target class classification policy. On the model of this graphs, a random set of tracks is generated, forming a so-called synthetic training set, synthetic trellis.
期刊介绍:
The purpose of the journal is to publish high-quality peer-reviewed scientific and technical materials that present the results of fundamental and applied scientific research in the field of image processing, recognition, analysis and understanding, pattern recognition, artificial intelligence, and related fields of theoretical and applied computer science and applied mathematics. The policy of the journal provides for the rapid publication of original scientific articles, analytical reviews, articles of the world''s leading scientists and specialists on the subject of the journal solicited by the editorial board, special thematic issues, proceedings of the world''s leading scientific conferences and seminars, as well as short reports containing new results of fundamental and applied research in the field of mathematical theory and methodology of image analysis, mathematical theory and methodology of image recognition, and mathematical foundations and methodology of artificial intelligence. The journal also publishes articles on the use of the apparatus and methods of the mathematical theory of image analysis and the mathematical theory of image recognition for the development of new information technologies and their supporting software and algorithmic complexes and systems for solving complex and particularly important applied problems. The main scientific areas are the mathematical theory of image analysis and the mathematical theory of pattern recognition. The journal also embraces the problems of analyzing and evaluating poorly formalized, poorly structured, incomplete, contradictory and noisy information, including artificial intelligence, bioinformatics, medical informatics, data mining, big data analysis, machine vision, data representation and modeling, data and knowledge extraction from images, machine learning, forecasting, machine graphics, databases, knowledge bases, medical and technical diagnostics, neural networks, specialized software, specialized computational architectures for information analysis and evaluation, linguistic, psychological, psychophysical, and physiological aspects of image analysis and pattern recognition, applied problems, and related problems. Articles can be submitted either in English or Russian. The English language is preferable. Pattern Recognition and Image Analysis is a hybrid journal that publishes mostly subscription articles that are free of charge for the authors, but also accepts Open Access articles with article processing charges. The journal is one of the top 10 global periodicals on image analysis and pattern recognition and is the only publication on this topic in the Russian Federation, Central and Eastern Europe.