{"title":"Synopsis for microbiological data stream analysis","authors":"Gianfranco Cellarosi, Claudio Sartori","doi":"10.1109/CBMS.2005.96","DOIUrl":null,"url":null,"abstract":"This paper derives from the extensive analysis of the results of microbiological laboratory output in a large hospital. In this environment data items are automatically collected by transactions linked to the analysis devices and the data flow rate can be very high. In addition, the time of the data items is extremely relevant, since a variation of the number of positive findings can be generated by dangerous events, such as outbreaks. The typical setting of data mining applications is to process data, in search of some hidden information. Unfortunately in this environment, we have many transactions and everyone of these is a potentially important information. When the data set changes, some kind of re-computation, either from scratch or according to differences only, has to be done. Things change when the change rate increases, so as to make difficult to process changes before new changes arrive. We define this particular situation as a data stream, and we devise a framework allowing efficient analysis of data streams. We propose a base set of techniques, which can be used to analyze data streams in Euclidean space, putting together information processing and statistical techniques. In particular, we are interested in detecting \"alarms\" in microbiological time series, that is points in time where the detected data differ from the values expected on the basis of past history. We provide also experimental results, based on real and data.","PeriodicalId":119367,"journal":{"name":"18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2005.96","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper derives from the extensive analysis of the results of microbiological laboratory output in a large hospital. In this environment data items are automatically collected by transactions linked to the analysis devices and the data flow rate can be very high. In addition, the time of the data items is extremely relevant, since a variation of the number of positive findings can be generated by dangerous events, such as outbreaks. The typical setting of data mining applications is to process data, in search of some hidden information. Unfortunately in this environment, we have many transactions and everyone of these is a potentially important information. When the data set changes, some kind of re-computation, either from scratch or according to differences only, has to be done. Things change when the change rate increases, so as to make difficult to process changes before new changes arrive. We define this particular situation as a data stream, and we devise a framework allowing efficient analysis of data streams. We propose a base set of techniques, which can be used to analyze data streams in Euclidean space, putting together information processing and statistical techniques. In particular, we are interested in detecting "alarms" in microbiological time series, that is points in time where the detected data differ from the values expected on the basis of past history. We provide also experimental results, based on real and data.