Max Z. Li, Karthik Gopalakrishnan, Yanjun Wang, H. Balakrishnan
{"title":"美国和中国机场延误分布的离群值分析","authors":"Max Z. Li, Karthik Gopalakrishnan, Yanjun Wang, H. Balakrishnan","doi":"10.1109/AIDA-AT48540.2020.9049208","DOIUrl":null,"url":null,"abstract":"Outlier detection is a key component of several machine learning approaches. However, many existing techniques, especially for multi-dimensional signals, are not interpretable and do not explain why a specific classification was assigned to a particular data point. Another limitation is that most methods only consider the magnitude or intensity of the signal, and not its spatial distribution. We present a spectral approach to identify outliers based on the spatial distribution of a signal across the nodes of a graph without any explicit assumptions on the underlying probability distribution of the signal. By applying these techniques to airport delays, we not only identify outliers in the spatial distribution of delays, but also gain insights into the delay dynamics. Specifically, we compare spatial delay distributions in the US and China during the period 2012–17, and identify several interesting characteristics pertaining to critical airports for outlier detection. We characterize typical variabilities in the delay distributions, and the frequency of occurrence of outliers. Our results highlight the differences between the operational dynamics of the US and Chinese air transportation systems, and contribute to performance benchmarking between different airspace systems.","PeriodicalId":106277,"journal":{"name":"2020 International Conference on Artificial Intelligence and Data Analytics for Air Transportation (AIDA-AT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Outlier Analysis of Airport Delay Distributions in US and China\",\"authors\":\"Max Z. Li, Karthik Gopalakrishnan, Yanjun Wang, H. Balakrishnan\",\"doi\":\"10.1109/AIDA-AT48540.2020.9049208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Outlier detection is a key component of several machine learning approaches. However, many existing techniques, especially for multi-dimensional signals, are not interpretable and do not explain why a specific classification was assigned to a particular data point. Another limitation is that most methods only consider the magnitude or intensity of the signal, and not its spatial distribution. We present a spectral approach to identify outliers based on the spatial distribution of a signal across the nodes of a graph without any explicit assumptions on the underlying probability distribution of the signal. By applying these techniques to airport delays, we not only identify outliers in the spatial distribution of delays, but also gain insights into the delay dynamics. Specifically, we compare spatial delay distributions in the US and China during the period 2012–17, and identify several interesting characteristics pertaining to critical airports for outlier detection. We characterize typical variabilities in the delay distributions, and the frequency of occurrence of outliers. Our results highlight the differences between the operational dynamics of the US and Chinese air transportation systems, and contribute to performance benchmarking between different airspace systems.\",\"PeriodicalId\":106277,\"journal\":{\"name\":\"2020 International Conference on Artificial Intelligence and Data Analytics for Air Transportation (AIDA-AT)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Artificial Intelligence and Data Analytics for Air Transportation (AIDA-AT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIDA-AT48540.2020.9049208\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Artificial Intelligence and Data Analytics for Air Transportation (AIDA-AT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIDA-AT48540.2020.9049208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Outlier Analysis of Airport Delay Distributions in US and China
Outlier detection is a key component of several machine learning approaches. However, many existing techniques, especially for multi-dimensional signals, are not interpretable and do not explain why a specific classification was assigned to a particular data point. Another limitation is that most methods only consider the magnitude or intensity of the signal, and not its spatial distribution. We present a spectral approach to identify outliers based on the spatial distribution of a signal across the nodes of a graph without any explicit assumptions on the underlying probability distribution of the signal. By applying these techniques to airport delays, we not only identify outliers in the spatial distribution of delays, but also gain insights into the delay dynamics. Specifically, we compare spatial delay distributions in the US and China during the period 2012–17, and identify several interesting characteristics pertaining to critical airports for outlier detection. We characterize typical variabilities in the delay distributions, and the frequency of occurrence of outliers. Our results highlight the differences between the operational dynamics of the US and Chinese air transportation systems, and contribute to performance benchmarking between different airspace systems.