{"title":"Comparing finite sequences of discrete events with non-uniform time intervals","authors":"Alexander C. Murph, A. Flynt, Brian R. King","doi":"10.1080/07474946.2021.1940491","DOIUrl":null,"url":null,"abstract":"Abstract Algorithms that quantify the similarity between two sequences of data date back to the mid 20th century. Sequence comparison continues to be active area of research in mathematics, statistics, and computer science, with applications to a wide number of fields including, biology, marketing, and linguistics. While many methods exist for comparing sequences of discrete events, this paper presents a novel method to compare such sequences while also utilizing information available from non-uniform time intervals between events. The Sequence Alignment with Non-Uniform Time Intervals (SAWNUTI) method, an extension of the Smith-Waterman and Needleman-Wunch algorithms, is described and evaluated using a simulation study and two real-world medical data sets (diabetes and eye tracking). Results illustrate the necessity of this method when time is important to consider in the comparison of sequences.","PeriodicalId":48879,"journal":{"name":"Sequential Analysis-Design Methods and Applications","volume":"40 1","pages":"291 - 313"},"PeriodicalIF":0.6000,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sequential Analysis-Design Methods and Applications","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/07474946.2021.1940491","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Algorithms that quantify the similarity between two sequences of data date back to the mid 20th century. Sequence comparison continues to be active area of research in mathematics, statistics, and computer science, with applications to a wide number of fields including, biology, marketing, and linguistics. While many methods exist for comparing sequences of discrete events, this paper presents a novel method to compare such sequences while also utilizing information available from non-uniform time intervals between events. The Sequence Alignment with Non-Uniform Time Intervals (SAWNUTI) method, an extension of the Smith-Waterman and Needleman-Wunch algorithms, is described and evaluated using a simulation study and two real-world medical data sets (diabetes and eye tracking). Results illustrate the necessity of this method when time is important to consider in the comparison of sequences.
期刊介绍:
The purpose of Sequential Analysis is to contribute to theoretical and applied aspects of sequential methodologies in all areas of statistical science. Published papers highlight the development of new and important sequential approaches.
Interdisciplinary articles that emphasize the methodology of practical value to applied researchers and statistical consultants are highly encouraged. Papers that cover contemporary areas of applications including animal abundance, bioequivalence, communication science, computer simulations, data mining, directional data, disease mapping, environmental sampling, genome, imaging, microarrays, networking, parallel processing, pest management, sonar detection, spatial statistics, tracking, and engineering are deemed especially important. Of particular value are expository review articles that critically synthesize broad-based statistical issues. Papers on case-studies are also considered. All papers are refereed.