2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)最新文献

筛选
英文 中文
A Transportation Analytic Solution for Predicting Flight Cancellations 预测航班取消的运输解析解
Shawn J. Lanting, C. Leung, Khush Bhrugesh Patel, Sanskar Raval, Liza Yashin
{"title":"A Transportation Analytic Solution for Predicting Flight Cancellations","authors":"Shawn J. Lanting, C. Leung, Khush Bhrugesh Patel, Sanskar Raval, Liza Yashin","doi":"10.1109/IRI58017.2023.00050","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00050","url":null,"abstract":"Flight cancellations can negatively impact passengers and airlines by causing stress, time loss, financial losses, and providing a disruptive travelling experience. Airlines pay for crewmembers, provide refunds for passengers, and need to account for other unexpected expenses. Passengers might have a connection and need to get to a specific place for an important event such as a work conference, wedding, funeral, or vacation. Applying advanced transportation data analytical techniques to develop practical solutions can contribute to the ongoing development of more efficient and reliable air travel. In this paper, we present a data science solution, which integrates flight data, weather data, and other related data to determine key factors contributing to flight cancellations. In particular, we focus on weather-related factors such as precipitation and wind speed. Evaluation results on real data show the practicality and accuracy of our solution in predicting flight cancellations.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130507783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Similarity Measure and Hierarchical Clustering Approach to Color Image Segmentation 一种新的相似度量和层次聚类方法用于彩色图像分割
Radhwane Gherbaoui, Nacéra Benamrane, Mohammed Ouali
{"title":"A New Similarity Measure and Hierarchical Clustering Approach to Color Image Segmentation","authors":"Radhwane Gherbaoui, Nacéra Benamrane, Mohammed Ouali","doi":"10.1109/IRI58017.2023.00014","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00014","url":null,"abstract":"Cluster analysis is an important task in data analysis and machine learning. Traditional clustering methods, such as partitioning and density-based approaches, have limitations in identifying natural clusters in datasets with elliptical and chained shapes. In this paper, we propose a novel hierarchical clustering algorithm for color image segmentation that addresses these limitations by quantifying the degree of overlap between clusters as a similarity measure for the merging process.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131627509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Objective optimization of Stock Portfolio Using Evolutionary Computation under COVID-19 COVID-19下基于进化计算的股票投资组合多目标优化
Asuka Tai, Koki Yoshioka, H. Dozono
{"title":"Multiple Objective optimization of Stock Portfolio Using Evolutionary Computation under COVID-19","authors":"Asuka Tai, Koki Yoshioka, H. Dozono","doi":"10.1109/IRI58017.2023.00020","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00020","url":null,"abstract":"Japanese people are facing the problem of saving for old age, and the government recommends asset formation through investment from a young age. However, this approach has not become popular. Therefore, the purpose of this study is to build a system that can find a combination that can obtain income gain with reduced risk from long-term stock holdings. We used the vector-evaluated-genetic-algorithm to find the Pareto optimal solutions from time-series data of stock prices and attempted to verify the effectiveness of this method.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133609069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Improvements of Mouse Dynamics Based Continuous User Authentication 基于连续用户认证的鼠标动力学改进研究
Hayri Durmaz, Mehmet Keskinöz
{"title":"On the Improvements of Mouse Dynamics Based Continuous User Authentication","authors":"Hayri Durmaz, Mehmet Keskinöz","doi":"10.1109/IRI58017.2023.00010","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00010","url":null,"abstract":"Traditional authentication methods are vulnerable when users leave their devices unattended or if their credentials are compromised. In contrast, continuous authentication offers a perpetual strategy for user validation, ensuring that only authorized users access critical information throughout their entire usage. The problem of continuous authentication boils down to a binary classification task: determining whether the usage is legal or illegal. Deep learning presents a promising solution for this problem, although the use of convolutional neural networks (CNNs) in continuous authentication still has room for improvement. In this study, we employ residual learning to train and test a user authentication model. To further enhance the accuracy of the results, we implement a realistic augmentation method and employ a superior image mapping technique compared to existing literature. As a result, we achieve significantly more accurate results than those reported in the referenced studies. On average, our tests yield a False Accept Rate of 0.45 and a False Reject Rate of 0.34, which are 6.5 times better than the referenced studies. These findings demonstrate a substantial improvement in the usability and effectiveness of real-world cybersecurity applications.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"147 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116695644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Credit Card Fraud Detection Through a Novel Ensemble Feature Selection Technique 通过一种新的集成特征选择技术增强信用卡欺诈检测
Huanjing Wang, Qianxin Liang, John T. Hancock, T. Khoshgoftaar
{"title":"Enhancing Credit Card Fraud Detection Through a Novel Ensemble Feature Selection Technique","authors":"Huanjing Wang, Qianxin Liang, John T. Hancock, T. Khoshgoftaar","doi":"10.1109/IRI58017.2023.00028","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00028","url":null,"abstract":"Identifying fraudulent activities in credit card transactions is an inherent component of financial computing. The focus of our research is on the Credit Card Fraud Detection Dataset, which is widely used due to its authentic transaction data. In numerous machine learning applications, feature selection has become a crucial step. To improve the chance of discovering the globally optimal feature set, we employ ensembles of feature ranking methods. These ensemble methods merge multiple feature ranking lists through a median approach. We conduct a comprehensive empirical study that examines two different ensembles of feature ranking techniques, including an ensemble of twelve threshold-based feature selection (TBFS) techniques and an ensemble of five supervised feature selection (SFS) techniques. Additionally, we present results where all features are used. We construct classification models using two Decision Tree-based classifiers, CatBoost and XGBoost, and evaluate them using two different performance metrics, the Area Under the Receiver Operating Characteristic Curve (AUC) and the Area under the Precision-Recall Curve (AUPRC). Since AUPRC provides a more accurate representation of the number of false positives, especially for highly imbalanced datasets, evaluating models for AUPRC is a wise choice. The experimental results demonstrate that the ensemble of SFS and all features performs similarly or better than the ensemble of TBFS. Moreover, we find that XGBoost outperforms CatBoost in terms of AUPRC.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129703656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble Learning for UAV Detection: Developing a Multi-Class Multimodal Dataset 无人机检测的集成学习:多类多模态数据集的开发
J. McCoy, A. Rawal, D. Rawat
{"title":"Ensemble Learning for UAV Detection: Developing a Multi-Class Multimodal Dataset","authors":"J. McCoy, A. Rawal, D. Rawat","doi":"10.1109/IRI58017.2023.00025","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00025","url":null,"abstract":"Unmanned aerial vehicles (UAVs) are a growing threat to public safety if used maliciously. In this study, we present our multimodal data set containing image, audio, and radio frequency (RF) data, which can serve as a valuable resource for researchers and developers in the field of UAV detection. We present a multiclass multimodal ensemble approach to address the need to improve UAV identification and detection. Our approach is novel as we integrated multiple deep-learning classifiers into a single ensemble classifier. We evaluate the performance of our proposed solution with a hard-voting model and a soft-voted model to evaluate the effectiveness of the proposed solution. Overall, our ensemble approach performed better than the single-modality classifier and when combined, could mitigate the low accuracy of the RF (CNN) accuracy score of 67%. This study has shown how effective ensemble approaches can be used to mitigate limitations when predicting UAV based on multimodal signatures.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128339985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resource-Centric Goal Model Slicing for Detecting Feature Interactions 以资源为中心的目标模型切片检测特征交互
Zedong Peng, Mahima Dahiya, Tessneem Khalil, Nan Niu, Tanmay Bhowmik, Yilong Yang
{"title":"Resource-Centric Goal Model Slicing for Detecting Feature Interactions","authors":"Zedong Peng, Mahima Dahiya, Tessneem Khalil, Nan Niu, Tanmay Bhowmik, Yilong Yang","doi":"10.1109/IRI58017.2023.00018","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00018","url":null,"abstract":"Feature interaction (FI) occurs when the requirements are satisfied by the features in isolation but not in composition. We present a novel approach to FI detection via a lightweight modeling of two features’ resource dependency. Our preliminary study on two Zoom features shows three types of resource dependency: produce-and-use, state-changing, and mutual-exclusion. We present the testing pattern associated with each type, report the FI testing results, and discuss our long-term directions toward using real-world software’s features to ground and evaluate requirements engineering research.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128711173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using BERT to Understand TikTok Users’ ADHD Discussion 使用BERT理解TikTok用户对ADHD的讨论
Kayla Pineda, A. M. Perrotti, Faryaneh Poursardar, Danielle Graber, S. Jayarathna
{"title":"Using BERT to Understand TikTok Users’ ADHD Discussion","authors":"Kayla Pineda, A. M. Perrotti, Faryaneh Poursardar, Danielle Graber, S. Jayarathna","doi":"10.1109/IRI58017.2023.00043","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00043","url":null,"abstract":"The rapid rise in popularity of the social media platform TikTok has allowed information to reach a wider audience at a faster rate compared to traditional news outlets. With this growth, more young people are connected with each other, allowing discussion on issues of mental health, a once stigmatized topic, to grow among young people. However, not all information about mental health issues is done properly or accurately. A commonly discussed mental health issue on TikTok is Attention-Deficit/Hyperactivity Disorder, known as ADHD. In this paper, we briefly describe how ADHD is portrayed on TikTok. Then, we analyze ADHD discussion around themes of self-diagnosis, symptoms, and self-help through text analysis using BERT. We analyze the common responses to ADHD TikToks and set up a multi-label classifier to understand the general range of responses.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115215865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ballot Tabulation Using Deep Learning 使用深度学习的选票制表
Fei Zhao, Chengcui Zhang, Nitesh Saxena, D. Wallach, AKM SHAHARIAR AZAD RABBY
{"title":"Ballot Tabulation Using Deep Learning","authors":"Fei Zhao, Chengcui Zhang, Nitesh Saxena, D. Wallach, AKM SHAHARIAR AZAD RABBY","doi":"10.1109/IRI58017.2023.00026","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00026","url":null,"abstract":"Currently deployed election systems that scan and process hand-marked ballots are not sophisticated enough to handle marks insufficiently filled in (e.g., partially filled-in), improper marks (e.g., using check marks or crosses instead of filling in bubbles), or marks outside of bubbles, other than setting a threshold to detect whether the pixels inside bubbles are dark and dense enough to be counted as a vote. The current works along this line are still largely limited by their degree of automation and require substantial manpower for annotation and adjudication. In this study, we propose a highly automated deep learning (DL) mark segmentation model-based ballot tabulation assistant able to accurately identify legitimate ballot marks. For comparison purposes, a highly customized traditional computer vision (T-CV) mark segmentation-based method has also been developed to compare with the DL-based tabulator, with a detailed discussion included. Our experiments conducted on two real election datasets achieved the highest accuracy of 99.984% on ballot tabulation. In order to further enhance our DL model’s capability of detecting the marks that are underrepresented in training datasets, e.g., insufficiently or improperly filled marks, we propose a Siamese network architecture that enables our DL model to exploit the contrasting features between a hand-marked ballot image and its corresponding blank template image to detect marks. Without the need for extra data collection, by incorporating this novel network architecture, our DL model-based tabulation method not only achieved a higher accuracy score but also substantially reduced the overall false negative rate.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126585149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
“What You See Is What You Test”: Recommending Features from GUIs for Requirements-Based Testing “所见即所测”:为基于需求的测试推荐gui特性
Zedong Peng, J. Savolainen, Jianzhang Zhang, Nan Niu
{"title":"“What You See Is What You Test”: Recommending Features from GUIs for Requirements-Based Testing","authors":"Zedong Peng, J. Savolainen, Jianzhang Zhang, Nan Niu","doi":"10.1109/IRI58017.2023.00057","DOIUrl":"https://doi.org/10.1109/IRI58017.2023.00057","url":null,"abstract":"Requirements-based testing (RBT) advocates the design of test cases in order to adequately exercise the behavior of a software system without regard to the internal details of the implementation. To address the challenge that requirements descriptions may be inaccurate in practice, we align requirements engineering and software testing in a novel way by not counting on a complete and up-to-date requirements documentation. Rather, we maintain the black-box nature of RBT to recommend features as the units of testing from software’s graphical user interfaces (GUIs). In particular, we exploit optical character recognition (OCR) to identify the textual information from GUIs, and further build the GUI-feature correspondences based on software’s user-centric documentation which may exhibit partial correctness. Such correspondences from multiple software systems in the same domain serve as a foundation for our recommendation engine, which suggests the to-be-tested features related to a given GUI. We demonstrate our recommender’s feasibility with a study of five products in the web conferencing domain, and the results show the more complete set of features against which a GUI needs to be tested.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131394677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信