Proceedings of the 2020 5th International Conference on Machine Learning Technologies最新文献

筛选
英文 中文
Comparative Analysis Using Supervised Learning Methods for Anti-Money Laundering in Bitcoin 基于监督学习方法的比特币反洗钱比较分析
Ismail Alarab, S. Prakoonwit, Mohamed Ikbal Nacer
{"title":"Comparative Analysis Using Supervised Learning Methods for Anti-Money Laundering in Bitcoin","authors":"Ismail Alarab, S. Prakoonwit, Mohamed Ikbal Nacer","doi":"10.1145/3409073.3409078","DOIUrl":"https://doi.org/10.1145/3409073.3409078","url":null,"abstract":"With the advance of Bitcoin technology, money laundering has been incentivised as a den of Bitcoin blockchain, in which the user's identity is hidden behind a pseudonym known as address. Although this trait permits concealing in the plain sight, the public ledger of Bitcoin blockchain provides more power for investigators and allows collective intelligence for anti-money laundering and forensic analysis. This fascinating paradox arises in the strength of Bitcoin technology. Machine learning techniques have attained promising results in forensic analysis, in order to spot suspicious behaviour in Bitcoin blockchain. This paper presents a comparative analysis of the performance of classical supervised learning methods using a recently published data set derived from Bitcoin blockchain, to predict licit and illicit transactions in the network. Besides, an ensemble learning method is utilised using a combination of the given supervised learning models, which outperforms the given classical methods. This experiment is performed using a newly published data set derived from Bitcoin blockchain. Our main contribution points out that using ensemble learning approach outperforms the performance of the classical learning models used in the original paper, using Elliptic data set, a time series of Bitcoin transaction graph with node transactions and directed payments flow edges. Using the same data set, we show that we are able to predict licit/illicit transactions with an accuracy of 98.13% and F1 score equals to 83.36% using the proposed method. We discuss the variety of supervised learning methods, and their capabilities of assisting forensic analysis, and propose future work directions.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114979794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Byte Visualization Method for Malware Classification 恶意软件分类的字节可视化方法
Zhuojun Ren, Guang Chen, Wenke Lu
{"title":"Byte Visualization Method for Malware Classification","authors":"Zhuojun Ren, Guang Chen, Wenke Lu","doi":"10.1145/3409073.3409093","DOIUrl":"https://doi.org/10.1145/3409073.3409093","url":null,"abstract":"The exponential increase in the number of malware stems from the fact that attackers often create malware variants with automated tools. And automated tools generally tend to reuse similar function modules. It is essential, therefore, that security analysts distinguish malware families by recognizing similar modules. For this reason, we present a new visualization method for malware pedigree analysis, using visual similarities in the byte distributions of malware to implement classification. The method converts malware samples into dot plot patterns, and then searches for k-nearest neighbors of every tested sample with the Jaccard distance to determine its family. To evaluate the classification performance of the proposed method, we randomly collected 771 harmful binary files from 72 malware families on the VX Heavens website. With the value of k varying between 1 and 9, our method had the best accuracy of 92.48% when k = 1.The experimental results show that the proposed method can distinguish malware families effectively.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129515107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
YU-net Lung Segment Image Preprocess Methods Used for Common Chest Diseases Prediction 用于常见胸部疾病预测的YU-net肺段图像预处理方法
Haoxiong Yu, Xianbo Xu, Ziqi Zhao, Dancheng Li
{"title":"YU-net Lung Segment Image Preprocess Methods Used for Common Chest Diseases Prediction","authors":"Haoxiong Yu, Xianbo Xu, Ziqi Zhao, Dancheng Li","doi":"10.1145/3409073.3409074","DOIUrl":"https://doi.org/10.1145/3409073.3409074","url":null,"abstract":"With the availability of large-scale data set of X-ray images and development of CNNs(Convolutional Neural Networks), using CNNs assist diagnose become more and more popular. But training CNNs using global image may be affected by the excessive irrelevant noisy areas. Due to the poor alignment of some Chest X-ray(CXR) images, the existence of irregular border hinders the neural network performance. In our work, we address the above problems by proposing a YU-net to segment lung fields on CXR images based on U-net, remove those areas of the images outside the lungs. In order to prove the effectiveness of YU-net, we trained, validated and tested the same 112,120 pictures of 30,536 patients on ResNet-50 and DenseNet-121 with both original Chest X-ray images and YU-net cleaned images. Compare the predicted result of DenseNet-121 and ResNet-50 with both YU-net processed images and original dataset, we found that use the YU-net cleaned images improve the performance of CNNs to recognize the multiple common thorax diseases.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"579 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116205729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Application of Analytic Hierarchy Process-Fuzzy Comprehensive Evaluation in Public Transport of Ulaanbaatar City, Mongolia 层次分析法-模糊综合评价在蒙古乌兰巴托市公共交通中的应用
Khaliun Nasanjargal, Jing Lu
{"title":"Application of Analytic Hierarchy Process-Fuzzy Comprehensive Evaluation in Public Transport of Ulaanbaatar City, Mongolia","authors":"Khaliun Nasanjargal, Jing Lu","doi":"10.1145/3409073.3409075","DOIUrl":"https://doi.org/10.1145/3409073.3409075","url":null,"abstract":"Ulaanbaatar is considered as one of the most congested cities in the world. The public transport system of a city is a great indicator of its relative level of development. This article applies a combined model composed of the Analytic Hierarchy Process method and the Fuzzy Comprehensive Evaluation method to evaluate the current condition of the bus public transport in Ulaanbaatar, Mongolia. The results found that the current condition of bus public transport in Ulaanbaatar is average, with satisfaction factor as the main determining factor of Ulaanbaatar's bus public transport condition.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126487416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Logistic Regression with L2 Regularization using ADMM on Spark 基于Spark的ADMM高效L2正则化逻辑回归
Xiao Su
{"title":"Efficient Logistic Regression with L2 Regularization using ADMM on Spark","authors":"Xiao Su","doi":"10.1145/3409073.3409077","DOIUrl":"https://doi.org/10.1145/3409073.3409077","url":null,"abstract":"Linear classification has demonstrated success in many areas of applications. Modern algorithms for linear classification can train reasonably good models while going through the data in only tens of rounds. However, large data often does not fit in the memory of a single machine, which makes the bottleneck in large-scale learning the disk I/O, not the CPU. In this paper, we describe a specific implementation of the Alternating Direction Method of Multipliers (ADMM) algorithm for distributed optimization. This implementation runs logistic regression with L2 regularization over large datasets and does not require a user-tuned learning rate meta-parameter or any tools beyond Spark. We implement this framework in Apache Spark and compare it with the widely used Machine Learning LIBrary (MLLIB) in Apache Spark 2.4","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122221404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Short Text Classification Approach with Event Detection and Conceptual Information 基于事件检测和概念信息的短文本分类方法
Wei Yin, Liping Shen
{"title":"A Short Text Classification Approach with Event Detection and Conceptual Information","authors":"Wei Yin, Liping Shen","doi":"10.1145/3409073.3409091","DOIUrl":"https://doi.org/10.1145/3409073.3409091","url":null,"abstract":"Text classification is an elementary task in Natural Language Processing (NLP). Existing methods, such as Long Short-Term Memory Networks (LSTM) and Attention Mechanism have recently achieved strong performance on multiple NLP related tasks. However, in the field of text classification, their results are often limited by the quality of feature extraction. This phenomenon is particularly prominent in short text classification tasks, since short text does not have enough contextual information compared to paragraphs and documents. To address this challenge, in this article, we propose a method to enhance the semantic information of short text with two aspects: event-level information extracted from text and conceptual information retrieved from external knowledge base. We take event and conceptual information as a type of supplementary knowledge and incorporate it into deep neural networks. Attention mechanism is utilized to measure the importance of the supplementary knowledge. Meanwhile, we have discussed the granularity selection for Chinese word segmentation, and select char-based models. Finally, we classify a short text with the help of event and conceptual information. The experimental results show that the proposed method outperforms the state-of-the-art methods.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133343455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Data Assimilation by Artificial Neural Network using Conventional Observation for WRF Model WRF模型常规观测数据的人工神经网络同化
Shijin Yuan, Bo Shi, Bin Mu
{"title":"Data Assimilation by Artificial Neural Network using Conventional Observation for WRF Model","authors":"Shijin Yuan, Bo Shi, Bin Mu","doi":"10.1145/3409073.3409097","DOIUrl":"https://doi.org/10.1145/3409073.3409097","url":null,"abstract":"In this paper, artificial neural network(ANN) are introduced to data assimilation for WRF model, which is a mesoscale complex model. A particle swarm optimization optimized Multilayer Perception data assimilation (MLP-PSO-DA) model is proposed in order to emulate the ensemble square root filter (EnSRF) analysis. Multilayer Perception is employed and the optimal parameter configurations are automatic obtained by particle swarm optimization (PSO) algorithm. The MLP-PSO-DA is integrated with WRF modeling system for assimilation cycle. The EnSRF analysis fields from July of 2004, 2005 and 2006 are taking as samples to train the model. The ANN-based data assimilation is conducted at July, 2007 with interval of 6h. The prognostic variables analysis fields of MLP-PSO-DA and EnSRF are very similar and the difference between two method is within a small scope. The results prove the effectiveness of MLP-PSO-DA. Meanwhile, the MLP-PSO-DA model has great advantage to speed up the DA process.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116139043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Soybean Yield using Self-normalizing Neural Networks 利用自归一化神经网络预测大豆产量
Kaki Shu
{"title":"Prediction of Soybean Yield using Self-normalizing Neural Networks","authors":"Kaki Shu","doi":"10.1145/3409073.3409092","DOIUrl":"https://doi.org/10.1145/3409073.3409092","url":null,"abstract":"Nowadays, agriculture around the world is facing severe challenges because of global warming and rapid population growth. In order to maximize the agricultural production and minimize the environmental degradation at the same time, careful land-use planning and crop selection come to be crucial, where accurate crop yield prediction plays a key role. This research applies an emerging Deep Learning architecture called Self-normalizing neural networks (SNN) for yield prediction of the soybean, using numerical data obtained from official statistics to ensure high reliability and availability of data. Plentiful work has been done to improve the performance, including careful parameter tuning and application of early stopping and the learning rate scheduler. This study conducts an experiment to evaluate the performance of the model, and the results show that SNN can achieve a lower prediction error with sufficiently large training data compared with traditional machine learning methods, such as Support Vector Regression, and other Deep Learning techniques, such as Batch Normalization.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123470306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Domain Adaptation Based Person-Specific Face Anti-spoofing Using Color Texture Features 基于领域自适应的人脸颜色纹理特征防欺骗
Junwei Zhou, Ke Shu, Dongdong Zhao, Zhe Xia
{"title":"Domain Adaptation Based Person-Specific Face Anti-spoofing Using Color Texture Features","authors":"Junwei Zhou, Ke Shu, Dongdong Zhao, Zhe Xia","doi":"10.1145/3409073.3409087","DOIUrl":"https://doi.org/10.1145/3409073.3409087","url":null,"abstract":"Face anti-spoofing technology is indispensable for the face recognition system, which is vulnerable to malicious spoofing attacks such as printed attacks and replayed video attacks. In this paper, we focus on more challenging cross-database generalization, which can reflect the performance of face anti-spoofing methods in the practical application scenario. In general, the discrepancy between the sample distribution of different databases can be reduced by domain adaptation algorithms. For texture-based face anti-spoofing methods, the identity of subjects can significantly influence the effectiveness of domain adaptation. Thus, in this paper, we propose a domain adaptation based person-specific face anti-spoofing method to improve cross-database generalization. The feature used here is the local directional number pattern (LDN) extracted from HSV and YCbCr color spaces. For each target subject, we synthesize virtual fake samples for training using subject domain adaptation method. To further improve the generalization performance, we use the domain adaptation method to reduce the discrepancy between sample distribution of training and testing samples. To evaluate the performance of the proposed method, we perform cross-database experiments on CASIA and Replay-Attack database. Our method can realize promising generalization performance, outperforming most of the recently proposed methods.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129814720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Top-down Feature Aggregation Block Fusion Network for Salient Object Detection 基于自顶向下特征聚合块融合网络的显著目标检测
Meiyi Li, Lide Zhou
{"title":"Top-down Feature Aggregation Block Fusion Network for Salient Object Detection","authors":"Meiyi Li, Lide Zhou","doi":"10.1145/3409073.3409076","DOIUrl":"https://doi.org/10.1145/3409073.3409076","url":null,"abstract":"The emergence of deep neural networks and full convolutional neural networks has brought great progress to salient object detection. In this paper, we propose a new type of deep full convolutional neural network structure, named top-down feature aggregation block fusion network, which aims to fuse the rich features of feature aggregation blocks at each layer. In addition to the features of this layer, the feature aggregation blocks have other layer features, that is, each layer of feature aggregation blocks has both strong semantic information of the deep network and detailed features of the shallow network. In the top-down fusion process, the residual information of each layer can be learned like ResNet. At the same time, a non-local attention mechanism is introduced to improve the relevance of the context, and multiple auxiliary supervision connections are added to the intermediate layers, so that the network can more easily optimize and accelerate convergence. We have performed experiments on six benchmark datasets, and the results of the experiments show that our model is superior to the state-of-the-art methods both quantitatively and qualitatively.","PeriodicalId":229746,"journal":{"name":"Proceedings of the 2020 5th International Conference on Machine Learning Technologies","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127938201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信