{"title":"基于改进LOF算法的收银数据异常检测","authors":"Kelin Long, Yuhang Wu, Y. Gui","doi":"10.4236/am.2018.96049","DOIUrl":null,"url":null,"abstract":"As the cash register system gradually prevailed in shopping malls, detecting \nthe abnormal status of the cash register system has gradually become a hotspot \nissue. This paper analyzes the transaction data of a shopping mall. When \ncalculating the degree of data difference, the coefficient of variation is used as \nthe attribute weight; the weighted Euclidean distance is used to calculate the \ndegree of difference; and k-means clustering is used to classify different time \nperiods. It applies the LOF algorithm to detect the outlier degree of transaction \ndata at each time period, sets the initial threshold to detect outliers, deletes \nthe outliers, and then performs SAX detection on the data set. If it does \nnot pass the test, then it will gradually expand the outlying domain and repeat \nthe above process to optimize the outlier threshold to improve the sensitivity \nof detection algorithm and reduce false positives.","PeriodicalId":55568,"journal":{"name":"Applied Mathematics-A Journal of Chinese Universities Series B","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2018-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Anomaly Detection of Store Cash Register Data Based on Improved LOF Algorithm\",\"authors\":\"Kelin Long, Yuhang Wu, Y. Gui\",\"doi\":\"10.4236/am.2018.96049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As the cash register system gradually prevailed in shopping malls, detecting \\nthe abnormal status of the cash register system has gradually become a hotspot \\nissue. This paper analyzes the transaction data of a shopping mall. When \\ncalculating the degree of data difference, the coefficient of variation is used as \\nthe attribute weight; the weighted Euclidean distance is used to calculate the \\ndegree of difference; and k-means clustering is used to classify different time \\nperiods. It applies the LOF algorithm to detect the outlier degree of transaction \\ndata at each time period, sets the initial threshold to detect outliers, deletes \\nthe outliers, and then performs SAX detection on the data set. If it does \\nnot pass the test, then it will gradually expand the outlying domain and repeat \\nthe above process to optimize the outlier threshold to improve the sensitivity \\nof detection algorithm and reduce false positives.\",\"PeriodicalId\":55568,\"journal\":{\"name\":\"Applied Mathematics-A Journal of Chinese Universities Series B\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2018-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Mathematics-A Journal of Chinese Universities Series B\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.4236/am.2018.96049\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematics-A Journal of Chinese Universities Series B","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.4236/am.2018.96049","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Anomaly Detection of Store Cash Register Data Based on Improved LOF Algorithm
As the cash register system gradually prevailed in shopping malls, detecting
the abnormal status of the cash register system has gradually become a hotspot
issue. This paper analyzes the transaction data of a shopping mall. When
calculating the degree of data difference, the coefficient of variation is used as
the attribute weight; the weighted Euclidean distance is used to calculate the
degree of difference; and k-means clustering is used to classify different time
periods. It applies the LOF algorithm to detect the outlier degree of transaction
data at each time period, sets the initial threshold to detect outliers, deletes
the outliers, and then performs SAX detection on the data set. If it does
not pass the test, then it will gradually expand the outlying domain and repeat
the above process to optimize the outlier threshold to improve the sensitivity
of detection algorithm and reduce false positives.
期刊介绍:
Applied Mathematics promotes the integration of mathematics with other scientific disciplines, expanding its fields of study and promoting the development of relevant interdisciplinary subjects.
The journal mainly publishes original research papers that apply mathematical concepts, theories and methods to other subjects such as physics, chemistry, biology, information science, energy, environmental science, economics, and finance. In addition, it also reports the latest developments and trends in which mathematics interacts with other disciplines. Readers include professors and students, professionals in applied mathematics, and engineers at research institutes and in industry.
Applied Mathematics - A Journal of Chinese Universities has been an English-language quarterly since 1993. The English edition, abbreviated as Series B, has different contents than this Chinese edition, Series A.