{"title":"Trustworthy semi-supervised anomaly detection for online-to-offline logistics business in merchant identification","authors":"Yong Li, Shuhang Wang, Shijie Xu, Jiao Yin","doi":"10.1049/cit2.12301","DOIUrl":null,"url":null,"abstract":"<p>The rise of online-to-offline (O2O) e-commerce business has brought tremendous opportunities to the logistics industry. In the online-to-offline logistics business, it is essential to detect anomaly merchants with fraudulent shipping behaviours, such as sending other merchants' packages for profit with their low discounts. This can help reduce the financial losses of platforms and ensure a healthy environment. Existing anomaly detection studies have mainly focused on online fraud behaviour detection, such as fraudulent purchase and comment behaviours in e-commerce. However, these methods are not suitable for anomaly merchant detection in logistics due to the more complex online and offline operation of package-sending behaviours and the interpretable requirements of offline deployment in logistics. MultiDet, a semi-supervised multi-view fusion-based Anomaly Detection framework in online-to-offline logistics is proposed, which consists of a basic version SemiDet and an attention-enhanced multi-view fusion model. In SemiDet, pair-wise data augmentation is first conducted to promote model robustness and address the challenge of limited labelled anomaly instances. Then, SemiDet calculates the anomaly scoring of each merchant with an auto-encoder framework. Considering the multi-relationships among logistics merchants, a multi-view attention fusion-based anomaly detection network is further designed to capture merchants' mutual influences and improve the anomaly merchant detection performance. A post-hoc perturbation-based interpretation model is designed to output the importance of different views and ensure the trustworthiness of end-to-end anomaly detection. The framework based on an eight-month real-world dataset collected from one of the largest logistics platforms in China is evaluated, involving 6128 merchants and 16 million historical order consignor records in Beijing. Experimental results show that the proposed model outperforms other baselines in both AUC-ROC and AUC-PR metrics.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 3","pages":"544-556"},"PeriodicalIF":8.4000,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12301","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cit2.12301","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The rise of online-to-offline (O2O) e-commerce business has brought tremendous opportunities to the logistics industry. In the online-to-offline logistics business, it is essential to detect anomaly merchants with fraudulent shipping behaviours, such as sending other merchants' packages for profit with their low discounts. This can help reduce the financial losses of platforms and ensure a healthy environment. Existing anomaly detection studies have mainly focused on online fraud behaviour detection, such as fraudulent purchase and comment behaviours in e-commerce. However, these methods are not suitable for anomaly merchant detection in logistics due to the more complex online and offline operation of package-sending behaviours and the interpretable requirements of offline deployment in logistics. MultiDet, a semi-supervised multi-view fusion-based Anomaly Detection framework in online-to-offline logistics is proposed, which consists of a basic version SemiDet and an attention-enhanced multi-view fusion model. In SemiDet, pair-wise data augmentation is first conducted to promote model robustness and address the challenge of limited labelled anomaly instances. Then, SemiDet calculates the anomaly scoring of each merchant with an auto-encoder framework. Considering the multi-relationships among logistics merchants, a multi-view attention fusion-based anomaly detection network is further designed to capture merchants' mutual influences and improve the anomaly merchant detection performance. A post-hoc perturbation-based interpretation model is designed to output the importance of different views and ensure the trustworthiness of end-to-end anomaly detection. The framework based on an eight-month real-world dataset collected from one of the largest logistics platforms in China is evaluated, involving 6128 merchants and 16 million historical order consignor records in Beijing. Experimental results show that the proposed model outperforms other baselines in both AUC-ROC and AUC-PR metrics.
期刊介绍:
CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.