{"title":"Performance comparison and optimization of mainstream NIDS systems in offline mode based on parallel processing technology","authors":"Tianyang Zhou","doi":"10.1109/CDS52072.2021.00030","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00030","url":null,"abstract":"For the network intrusion detection system (NIDS), improving the performance of the analysis process has always been one of the primary goals that NIDS needs to solve. An important method to improve performance is to use parallel processing technology to maximize the usage of multi-core CPU resources. In this paper, by splitting Pcap data packets, the NIDS software Snort3 can process Pcap packets in parallel mode. On this basis, this paper compares the performance between Snort2, Suricata, and Snort3 with different CPU cores in processing different sizes of Pcap data packets. At the same time, a parallel unpacking algorithm is proposed to further improve the parallel processing performance of Snort3.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125863546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-language Handwritten Numeral Recognition with Convolutional Neural Network","authors":"Yihan Wang","doi":"10.1109/CDS52072.2021.00082","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00082","url":null,"abstract":"Compared with handcrafted features, convolutional neural network (CNN) is a more effective model to solve the handwritten numeral recognition problem. In recent years, many different datasets have appeared, but there is a lack of a collection of multi-language handwritten numeral datasets, and the evaluation of multi-language handwritten numeral recognition for CNN is lacking. In this paper, we collect and present the biggest dataset for the multi-language handwritten numeral recognition problem ever, consisting of 15 different languages. We also contribute two baseline CNNs and evaluate them in this newly combined dataset. We found that LeNet is more effective than a more complex CNN. We also found that Devanagari and Telugu are the most difficult to distinguish when mixed with other similar languages.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122194606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Predictive Modeling of Wildfires in the United States","authors":"Lang Qin, W. Shao, Guofei Du, Junlin Mou, R. Bi","doi":"10.1109/CDS52072.2021.00102","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00102","url":null,"abstract":"This research utilizes wildfire records between 1911 and 2015 to train various models to predict fire size through using temperature, wind, humidity, and precipitation as features. Our results show 1) Decision Tree based Classifier outperforms both linear and ridge regression 2) Government entities can leverage our methodology to manage wildfires more efficiently, effectively, and decreasing monetary damages.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"186 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123011670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"High Precision Autocollimation Measurement Technology Based on Image Recognition","authors":"Li Zhongtang, Liu Mingyao, Luan Jianfeng","doi":"10.1109/CDS52072.2021.00028","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00028","url":null,"abstract":"Autocollimator is an important optical detection instrument, which is mainly used to measure small angle, and plays an important role in the field of optical equipment manufacturing and debugging. For many years, the autocollimation accuracy of collimator has been one of the factors affecting its performance. In order to improve the autocollimation accuracy of collimator optical axis, an auxiliary calibration method based on image recognition is proposed in this paper. And the concrete realization idea of this method is given. The accuracy of this method to detect and identify the optical axis eccentricity error can reach 0.1 pixel. Taking the detection data as reference to adjust the optical axis, the autocollimation accuracy of collimator optical axis can be greatly improved.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"17 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120857099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparison of stock price prediction ability based on GARCH and BP_ANN","authors":"Ercheng Liu","doi":"10.1109/CDS52072.2021.00021","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00021","url":null,"abstract":"This article first summarizes the history and development of two types of methods in the field of forecasting, GARCH model and artificial neural network ANA; then conducts data analysis and research in the Chinese stock market background which use matlab software and eviews software to run the double types respectively. This obvious BP-ANN model and GARCH model use to predict then continue to determine the trend of the Shanghai index which is A-share, and the price trend of a stock of Great Wisdom respectively. In the end, compare the output results to deepen the understanding of financial data analysis methods. Through the research of this article, it can be polished, whether it is in terms of stock index forecasting or single stock forecasting.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132640225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hardware-friendly model compression technique of DNN for edge computing","authors":"Xinyun Liu","doi":"10.1109/CDS52072.2021.00066","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00066","url":null,"abstract":"The research proposes a design methodology to compress the existing DNN models for low-cost edge devices. To reduce the computation complexity and memory cost, several novel model compression techniques are proposed. (1) A DNN model used to conduct image classification tasks is quantized into integer-based model for both the inference and training. 8-bit quantization is chosen in this work to balance the model training accuracy and cost. (2) A stochastic rounding scheme is implemented during the gradient backpropagation process to relieve the gradient diminishing risk. (3) To further reduce the training error caused by the gradient diminishing problem, a dynamic backpropagation algorithm is implemented. By dynamically scaling the magnitudes of gradient during the backpropagation, e.g. enlarging the magnitude of the gradient when it's too small to be quantized, it can effectively overcome the information loss due to the quantization error. As a result, such a DNN model for image classification is quantized into 8-bit model including training, which reduces the computation complexity by 8X and decreases the memory size by 6X Owing to the proposed dynamic backpropagation and stochastic training algorithms, the gradient diminishing issue during backpropagation is relieved. The training speed is reduced by 3X while classification error rates of state-of-art databases, e.g. ImageNet and CIFAR-10, are maintained similarly compared to the original model without quantization.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134151767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Applications of BP, Convolutional and RBF Networks","authors":"Zebu Lan","doi":"10.1109/CDS52072.2021.00099","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00099","url":null,"abstract":"By studying the effects of different types of feed forward neural networks in different fields, the applicable environment of different neural networks can be judged, which will make it easier for people to choose appropriate neural network when it's needed. To achieve this, in this article I summarize and classify the existing neural network experiences and feedback results, and compare the data before and after using the neural network. The data shows that BP networks can improve the resolution or accuracy of problems with no obvious influencing factors. Convolutional networks can increase the accuracy of image processing to more than 95%, and RBF networks can calculate high-precision data curves. Thus, it can be concluded that the BP network is suitable for solving problems with unclear influencing factors, the convolutional network has more image processing problems, and the RBF network has a higher frequency of use when higher results are required.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133421172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Flexible Job Shop Scheduling Rules Mining Based on Random Forest","authors":"Yizhong Wang","doi":"10.1109/CDS52072.2021.00045","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00045","url":null,"abstract":"With the development of the global economy and customization, the manufacturing scheduling problem is increasingly complicated. Flexible job shops (FJSs) have to be more flexible and dynamic to handle these complex and various manufacturing environments. Aiming at the dynamic scheduling problem of FJS, a method of mining scheduling rules from scheduling related historical data with industrial big data characteristics is proposed. In the mining of scheduling rules, an improved random forest algorithm is proposed, which is suitable for mining scheduling rules from historical data related to large-scale, high-dimensional, and noisy scheduling. Experimental results show that the scheduling rules obtained by the mining method have good performance in terms of scheduling performance and computational efficiency.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131396331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new perspective of credit scoring for small and medium-sized enterprises based on invoice data","authors":"Yuan Sun, Weifeng Jian, Yufeng Fu, Huiping Sun, Yuesheng Zhu, Zhiqiang Bai","doi":"10.1109/CDS52072.2021.00088","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00088","url":null,"abstract":"Credit Scoring takes a prominent part in the finance of small and medium-sized enterprises (SMEs), and it is also an invaluable tool to predict credit default. However, due to the variety of market size, capital scale, and the limitations of Credit Scoring Model, it is difficult for SMEs to refer to large enterprises for Credit Scoring. And as an important reference, financial statements are insufficient and time window is inflexible, which would fail to make data reflect the enterprise's operating conditions correctly and timely, inaccurate credit prediction arising. Therefore, we offer an inspiring perspective to search elastic and time-independent evidence. Served as an indispensable basis of accounting in China, invoices take full notes on taxes of economic business, with more details about financial statements and more flexibility over periods, which can develop a sustainable approach to master the operation information of SMEs in time. To deal with invoice data of SMEs, we study influential variables under the first digit law inspired by Benford's law, apply machine learning techniques, and guide experiment by the construction of score card. It shows that our method formed by easy-to-accomplish steps is of applicability and effectiveness, to support powerfully the existing Credit Scoring system.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125334298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Node Augmentation Methods for Graph Neural Network based Object Classification","authors":"Yifan Xue, Yixuan Liao, Xiaoxin Chen, Jingwei Zhao","doi":"10.1109/CDS52072.2021.00101","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00101","url":null,"abstract":"Graph neural networks (GNNs) are powerful models for learning representations of relational data, and have achieved impressive results both academically and industrially. To further enhance the performance of GNNs on the most studied node classification problem, we present NodeAug, a novel augmentation method that operates on graph-structured data, yielding virtual nodes by mixing pairs of nodes and corresponding graph structures. We first generate isolated virtual nodes with features and labels being convex combinations of existing nodes, without considering edges, which is termed as NodeAug-I and can enhance many GNN variants, demonstrating the effectiveness of node augmentation. Still, it does not exploit the graph structure, which is essential in GNNs while incorporating the structure in augmentation is a key challenge. Regarding this, we further propose two novel algorithms that can mix information of neighbors as well, taking graph structures into account. We conduct experiments on a wide range of benchmarks, including Cora, CiteSeer, Pubmed, CoraFull, Amazon and CLUSTER, and obtain considerable enhancement for well-known models, including GCN, GraphSAGE, GIN and GAT.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116743727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}