Zixin Tang , Haihui Fan , Xiaoyan Gu , Jiang Zhou , Hui Ma , Athanasios V. Vasilakos , Bo Li
{"title":"Enabling efficient and accurate semantic search over encrypted cloud data","authors":"Zixin Tang , Haihui Fan , Xiaoyan Gu , Jiang Zhou , Hui Ma , Athanasios V. Vasilakos , Bo Li","doi":"10.1016/j.ins.2025.122437","DOIUrl":null,"url":null,"abstract":"<div><div>The privacy and security of cloud data have drawn much attention, leading to more data owners outsourcing encrypted data. However, the common practice of encryption can reduce data searchability. Semantic searchable encryption aims to support flexible queries over encrypted data and achieve efficient search while ensuring that search results match the user's search intent. Although semantic searchable encryption schemes have made progress, they still have limitations in properly balancing accuracy, efficiency, and security. In this paper, we propose a novel <strong>C</strong>ontext-<strong>E</strong>nhanced <strong>S</strong>emantic <strong>S</strong>earchable <strong>E</strong>ncryption (<strong>CESSE</strong>) scheme to achieve accurate and highly efficient secure semantic search over encrypted cloud data. To achieve it, we first adopt a context-enhanced pre-trained model component to mine the relevance between queries and documents by contrastive learning and obtain context-enhanced vector representations to improve search accuracy. Then, to ensure privacy protection, we utilize an optimized asymmetric scalar-product-preserving encryption (optimized ASPE) algorithm to encrypt vectors before outsourcing to the cloud. Additionally, we construct the approximate nearest neighbor (ANN) index to accelerate vector searching. At last, we give a formal definition of security and theoretically prove the safety of our scheme under a more practical threat model. Extensive experiments demonstrate that the CESSE outperforms state-of-the-art baselines with better accuracy and efficiency.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"719 ","pages":"Article 122437"},"PeriodicalIF":6.8000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525005699","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The privacy and security of cloud data have drawn much attention, leading to more data owners outsourcing encrypted data. However, the common practice of encryption can reduce data searchability. Semantic searchable encryption aims to support flexible queries over encrypted data and achieve efficient search while ensuring that search results match the user's search intent. Although semantic searchable encryption schemes have made progress, they still have limitations in properly balancing accuracy, efficiency, and security. In this paper, we propose a novel Context-Enhanced Semantic Searchable Encryption (CESSE) scheme to achieve accurate and highly efficient secure semantic search over encrypted cloud data. To achieve it, we first adopt a context-enhanced pre-trained model component to mine the relevance between queries and documents by contrastive learning and obtain context-enhanced vector representations to improve search accuracy. Then, to ensure privacy protection, we utilize an optimized asymmetric scalar-product-preserving encryption (optimized ASPE) algorithm to encrypt vectors before outsourcing to the cloud. Additionally, we construct the approximate nearest neighbor (ANN) index to accelerate vector searching. At last, we give a formal definition of security and theoretically prove the safety of our scheme under a more practical threat model. Extensive experiments demonstrate that the CESSE outperforms state-of-the-art baselines with better accuracy and efficiency.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.