{"title":"A Comparative Study of Credit Card Fraud Detection Using the Combination of Machine Learning Techniques with Data Imbalance Solution","authors":"Faroque Ahmed, R. Shamsuddin","doi":"10.1109/CDS52072.2021.00026","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00026","url":null,"abstract":"Due to the rapid spread of fraud and cybersecurity risks in digital economy, fraud detection stands as a prime issue of modern technology. However, the analysis of fraud cases is computationally difficult because, fraud cases conjure less than 0.2% of the transactions. Thus to figure out the best classification technique to use for fraud detection, this paper has conducted a thorough experimentation of Machine Learning (ML) techniques. It has implemented six ML techniques i.e. Logistic Regression (LR), Support Vector Machine (SVM), Naíve Bayes (NB), Random forest (RF), Decision Tree (DT), and K-nearest neighbour (KNN) classifiers to detect credit card fraud. The investigation used five type of datasets i.e. imbalanced data, Under Sampled (US) data, Over Sampled (OS) data, sampled data using Synthetic Minority Over Sampling Technique (SMOTE) and Adaptive Synthetic Sampling Method for Imbalanced Data (ADASYN). The best combination of these classification approaches is selected based on five performance evaluation criteria i.e. Accuracy, Area Under the Curve (AUC), Precision, Recall score and fl-score. After evaluation of the classifiers it has showed that among 30 different classification approaches, RF classifier with over sampling (OS) technique was found to be the best approach in terms of all the performance criteria. It showed 99.99 % accurate and precise results with 99.99 % AUC, fl-score and 100 % Recall rate. Our choosen approach has obtained the highest accuracy over other studies on the same dataset. The banking sector as well as other financial institutions might use this suggested machine learning based combination approach to minimize (debit/credit card) frauds.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122140303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Digital Character CAPTCHA Recognition Using Convolution Network","authors":"Yu Cao","doi":"10.1109/CDS52072.2021.00029","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00029","url":null,"abstract":"Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA) is a type of automatic program to determine whether the user is human or not. The most common type of CAPTCHA is a kind of message interpretation by twisting the letters and adding slight noises in the background, plays a role of verification code. In this paper, we will introduce the basis of Convolutional Neural Network first. Then based on the handwritten digit recognition using CNN, we will develop a network for CAPTCHA image recognition.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115793709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on software defect prediction technology based on deep learning","authors":"Pengcheng Jiang","doi":"10.1109/CDS52072.2021.00024","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00024","url":null,"abstract":"To solve the problem that the traditional feature selection methods, such as PCA and LDA, are unable to get the nonlinear relationship between characteristics. Deep belief networks cannot eliminate the noise and missing value, which affect the accuracy of the software defect prediction (SDP) model. Not only the methods of feature selection, but data preprocessing and learning algorithm can also affect the precision of the defect prediction model. This thesis uses deep belief networks and SVM to construct an SDP model (DBN-SVM) to increase prediction precision. Using denoising autoencoders and SVM to build an SDP model (DA - SVM), compared with the DBN - SVM, DA - SVM model not only improves the prediction precision, but also enhance the robustness of the model. The thesis also proposes an SDP model framework which includes data preprocessing, feature selection and learning algorithm.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"93 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120998065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Application of Blockchain in Civil Aviation","authors":"Yucheng Peng","doi":"10.1109/CDS52072.2021.00042","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00042","url":null,"abstract":"In recent years, blockchain technology has increasingly attracted attention in the field of civil aviation, whose features such as decentralization, openness, transparency, traceability, and anonymity provide new ideas for solving problems in civil aviation. In this paper, from the perspective of the alliance chain, we design and analyze the application of blockchain technology in six scenarios, including passenger identity racking, baggage handling and indemnity, ticket selling, aircraft maintenance record, passenger privacy and security, and opaque examination and approval of airline resources.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"12 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125468993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Development of Obstacle Avoidance for Autonomous Vehicles and an Optimization Scheme for the Artificial Potential Field Method","authors":"Xuefei Yang","doi":"10.1109/CDS52072.2021.00010","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00010","url":null,"abstract":"The research on the driving process of autonomous vehicles involves the combination of multiple subjects including sensor perception, artificial intelligence, high-precision map, and intelligent obstacle avoidance, which is an important development direction of current road driving. To achieve the purpose of autonomous driving without human control, the high adaptability and robustness of the vehicle, as well as the ability to identify and avoid obstacles while driving are required. The purpose of this paper is to study the development process of the obstacle avoidance system for autonomous vehicles and propose an optimization scheme for the obstacle avoidance algorithm.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124108646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on Traffic Congestion Prevention Model Based on Internet of Things Virtualization Technology","authors":"Jianbing Chen, Bo Xu, Wei Tang","doi":"10.1109/CDS52072.2021.00072","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00072","url":null,"abstract":"This paper summarizes the virtualization of traffic road Internet of things, sensor perception of Intelligent Transportation Road, multi-source sensor fusion technology of intelligent traffic road, and problems existing in perception virtualization of traffic road Internet of things. It puts forward traffic road virtualization based on multi-source sensor and its resource management model and its working environment, and traffic road governance framework based on Internet of things virtualization Finally, this paper summarizes the work done in order to provide reference for the research on traffic congestion prevention based on the virtualization technology of the Internet of thing.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115368331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Business Intelligence based novel Marketing Strategy Approach using Automatic Speech Recognition and Text Summarization","authors":"Pratidnya Hegdepatil, K. Davuluri","doi":"10.1109/CDS52072.2021.00108","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00108","url":null,"abstract":"A novel marketing strategy approach is proposed that offers distinctive features than other competitive models. Text converted from speech between customer and sales executive is fed into text summarizer to automate customer strategy into various priorities. The approach shows that there is no need of data entry for emulating marketing tools. The need of account for customer's preferences is high, which can be stored by capturing the customer's context. Data mining and advanced analytical aspects give accurate results to help marketing managers and improve firms with solid customer strategies. The experimental results for the implementation show high outcomes in comparison to native advances in the technologies. A total of 62 test cases have been performed where an average of 12% Word Error Rate has been observed for Speech Recognition, which is optimal than other standard Speech Datasets. Furthermore, summarization compressed rate has been reported as 60%. The automated categorization module has achieved 70 % accuracy and overall F-score is 0.81349.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115528904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Investigation on Key Technologies for On-chip Optical Interconnection","authors":"Lehao Wang","doi":"10.1109/CDS52072.2021.00044","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00044","url":null,"abstract":"With the rapid development of communication technology, higher demands are required for the communication networks in the “big data era”. Integrated optics has the advantages of compactness, low power consumption and large bandwidth, which are widely applied for optical interconnection and communication. As higher and higher computing ability is demanded by more and more applications such as could computing, virtual reality, it is necessary to broaden the bandwidth of on-chip optical interconnection. In this paper, there advanced modulation formats are investigated to increase the transmission capacity for on-chip optical interconnection, including Pulse Amplitude Modulation (PAM), Discrete Multi-tone Modulation (DMT) and Carrier-free amplitude modulation (CAP). On this basis, mode division multiplexing (MDM) can be used to further increase the transmission capacity, in which different spatial modes are exploited for data transmission. Moreover, flexible data exchange function in op-chip MDM networks is studied to increase the network flexibility.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125443237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Rate My Professors: A Study Of Bias and Inaccuracies In Anonymous Self-Reporting","authors":"Alexander Katrompas, V. Metsis","doi":"10.1109/CDS52072.2021.00098","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00098","url":null,"abstract":"Commentary-driven and opinion-driven “ratings” websites such as Yelp, Amazon Reviews, and Rate My Professors are ubiquitous on the Web. These review sites engage in voluntary self-reporting which is well known to be fraught with bias and inaccurate representations. This study is an investigation into the website known as Rate My Professors which ostensibly seeks to collect and report on college and university professor quality. Rate My Professors (RMP), like many review sites, is anonymous. While it is commonly believed that anonymity increases accuracy, studies have shown otherwise. Studies using anonymous self-reporting are well known to be unreliable, so much so that researchers are required to take compensatory measures to validate their results. Rate My Professors takes no such measures, and in fact there is no guarantee that a “student reviewer” is even a student. This study investigates Rate My Professors for bias, inaccuracy, and invalid data. The study will show compelling evidence which supports the idea that anonymous self-reporting, without compensatory validation measures, is flawed and unsuitable for use in a decision making process.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126199083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Deep Learning Based Hybrid Model for Sales Prediction of E-commerce with Sentiment Analysis","authors":"Haotian Zhu","doi":"10.1109/CDS52072.2021.00091","DOIUrl":"https://doi.org/10.1109/CDS52072.2021.00091","url":null,"abstract":"The market of E-commerce has developed rapidly since the emergence of Internet. Many companies or shops of E-commerce want to seek a way to predict the sales of their products. The prediction of sales can help the merchants to formulate a sales strategy, in order to obtain a bigger profit and attract more investment. However, many studies simply use the daily sales or some very basic daily information of the product to predict the sales, without considering the effects of reviews. In this paper, a hybrid network including Bi-directional Long Short-Term Memory (BiLSTM) and Convolutional Neural Network (CNN) is proposed to solve this prediction task. Through careful selection, several attributes and comments of the products are utilized. Feature engineering is used to normalize the different kinds of data. BiLSTM is conducted to analyze all the comments. CNN is utilized to make predictions by using the data provided by feature engineering. Analysis is described to show the advantages and effectiveness of this network. By using the attributes of the product, along with the polarization of comments, CNN can predict the sales of the product.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"1988 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125490298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}