{"title":"MC-LRNN:一种基于逻辑的多类软件漏洞预测神经网络","authors":"Yuxiang Shang , Shaoying Liu","doi":"10.1016/j.jss.2025.112627","DOIUrl":null,"url":null,"abstract":"<div><div>Software vulnerabilities are a major threat to information systems. Detecting them early and accurately is critical. Software metrics are commonly used in vulnerability prediction, but choosing the most relevant features remains a major challenge. In this paper, we present Multi-Class Logic Rules Neural Network (MC-LRNN), a novel model that combines logic-based reasoning with neural networks for software vulnerability prediction. MC-LRNN uses a Top-Down Hill-Climbing Greedy Algorithm to extract first-order logic rules from software metrics, forming an interpretable reasoning layer that guides the learning process. The dataset is divided into a Logic Rule Dataset for rule generation and a Learning Dataset for model training and evaluation.</div><div>We evaluate MC-LRNN on three benchmark datasets — Juliet, SARD, and REVEAL — under both binary and multi-class classification settings. The results show that MC-LRNN consistently outperforms traditional baselines, handles class imbalance effectively, and generalizes well across projects. Its design provides both interpretability and strong generalization capabilities, making it well-suited for real-world vulnerability prediction. Code and datasets are available at: <span><span>https://github.com/Seansyx123/LRNN</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"231 ","pages":"Article 112627"},"PeriodicalIF":4.1000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MC-LRNN: A logic-based neural network for multi-class software vulnerability prediction\",\"authors\":\"Yuxiang Shang , Shaoying Liu\",\"doi\":\"10.1016/j.jss.2025.112627\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Software vulnerabilities are a major threat to information systems. Detecting them early and accurately is critical. Software metrics are commonly used in vulnerability prediction, but choosing the most relevant features remains a major challenge. In this paper, we present Multi-Class Logic Rules Neural Network (MC-LRNN), a novel model that combines logic-based reasoning with neural networks for software vulnerability prediction. MC-LRNN uses a Top-Down Hill-Climbing Greedy Algorithm to extract first-order logic rules from software metrics, forming an interpretable reasoning layer that guides the learning process. The dataset is divided into a Logic Rule Dataset for rule generation and a Learning Dataset for model training and evaluation.</div><div>We evaluate MC-LRNN on three benchmark datasets — Juliet, SARD, and REVEAL — under both binary and multi-class classification settings. The results show that MC-LRNN consistently outperforms traditional baselines, handles class imbalance effectively, and generalizes well across projects. Its design provides both interpretability and strong generalization capabilities, making it well-suited for real-world vulnerability prediction. Code and datasets are available at: <span><span>https://github.com/Seansyx123/LRNN</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":51099,\"journal\":{\"name\":\"Journal of Systems and Software\",\"volume\":\"231 \",\"pages\":\"Article 112627\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Systems and Software\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0164121225002961\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121225002961","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
MC-LRNN: A logic-based neural network for multi-class software vulnerability prediction
Software vulnerabilities are a major threat to information systems. Detecting them early and accurately is critical. Software metrics are commonly used in vulnerability prediction, but choosing the most relevant features remains a major challenge. In this paper, we present Multi-Class Logic Rules Neural Network (MC-LRNN), a novel model that combines logic-based reasoning with neural networks for software vulnerability prediction. MC-LRNN uses a Top-Down Hill-Climbing Greedy Algorithm to extract first-order logic rules from software metrics, forming an interpretable reasoning layer that guides the learning process. The dataset is divided into a Logic Rule Dataset for rule generation and a Learning Dataset for model training and evaluation.
We evaluate MC-LRNN on three benchmark datasets — Juliet, SARD, and REVEAL — under both binary and multi-class classification settings. The results show that MC-LRNN consistently outperforms traditional baselines, handles class imbalance effectively, and generalizes well across projects. Its design provides both interpretability and strong generalization capabilities, making it well-suited for real-world vulnerability prediction. Code and datasets are available at: https://github.com/Seansyx123/LRNN.
期刊介绍:
The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to:
•Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution
•Agile, model-driven, service-oriented, open source and global software development
•Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems
•Human factors and management concerns of software development
•Data management and big data issues of software systems
•Metrics and evaluation, data mining of software development resources
•Business and economic aspects of software development processes
The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.