{"title":"Hybrid Attention Based Shuffle-Net V2 with Global Convolutional Context Networks and Improved Pyramid Scene Parsing Network for Image Forgery Detection","authors":"Henerita Khumallambam, Durgamohon Polem, Rajeev Rajkumar","doi":"10.3103/S1060992X25601459","DOIUrl":"10.3103/S1060992X25601459","url":null,"abstract":"<p>Image forgery detection is the process of identifying manipulated or altered images to determine their authenticity, ensuring the integrity of digital media. Image forgery was predicted to maintain trust in digital media, prevent misinformation, and ensure the credibility of visual evidence in various fields like journalism and law enforcement. Traditional methods for predicting image forgery relied on manual inspection and basic image processing methods, those techniques were time-consuming, required expert knowledge, and were prone to human error, making them less reliable and efficient. Recently presented Artificial Intelligence (AI) techniques, automate the detection process by learning from large datasets, offering increased accuracy and faster processing times. However, such methods need large amounts of labelled data and a good amount of computational power to train the models properly. To address these limitations, proposed Global Convolutional Context Networks (GCCNet) were employed to identify a fake images. The image is first taken from a dataset that was used to detect copy moves. The Quantum Wavelet Transform Filter (QWTF) and RetiNex algorithm are used to pre-process the input image. That is the original image’s noise is eliminated using QWTF, and the pixel contrast is improved using RetiNex. The copied portion of the pre-processed images is then segmented using the Pyramid Scene Parsing Network (CAP-PSPNet), which is based on Contour Aware Processing to make the segmentation edges sharper. The image’s segmented portion is then passed to a hybrid Attention-based Shuffle-Net V2 (ATSNetV2) with Global Convolutional Context Networks (GCCNet) to predict real or fake images. As a result, the model attained an accuracy of 96.60%, an error rate of 3.40%, and F1-score of 96.7%. These highlight the proposed model offer high precision and reliability in image forgery detection.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"35 1","pages":"223 - 236"},"PeriodicalIF":0.8,"publicationDate":"2026-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147561405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Venkata Ramana Kaneti, P. Neelakantan, Malige Gangappa
{"title":"Hybrid QRE-RHIBE Encryption with Capability Based Access Control Model and Optimized ENN for Secure EHR Retrieval and Disease Detection in a Healthcare Cloud","authors":"Venkata Ramana Kaneti, P. Neelakantan, Malige Gangappa","doi":"10.3103/S1060992X25600211","DOIUrl":"10.3103/S1060992X25600211","url":null,"abstract":"<p>Healthcare practitioners use Electronic Health Records (EHRs), which are digital records of a patient’s medical history that are kept up to date, to manage patient care. Integration of cloud computing in healthcare enables efficient storage, retrieval, and analysis of EHRs while ensuring robust security and enabling advanced disease detection. These days, security and control over data access are some of the main issues with cloud storage, particularly in the medical industry. To avoid these challenges Optimized Deep learning approach with hybrid cryptography algorithm is developed to secure EHR retrieval and disease detection. Initially, EHR data is collected and pre-processed using Linear Non-Gaussian Acyclic Model (LiNGAM) to fill the missing values and Local Context Normalization (LCN) approach to normalize the data. Then Modified Elman Neural Network (MENN) is used to detect the disease. To enhance the performance of Elman Neural Network, Social Spider Optimization Algorithm (SSOA) is utilized for optimally selecting the weight decay and dropout rate. Following that, to improve the security of data and safeguard patient privacy in the cloud server, a QRE-RHIBE is utilized. Quantum Resistant Encryption is used to generate a keys and the RHIBE is used for both encryption and decryption process based on the generated key. Then for accessing the data stored in the cloud server, Capability based Access Control Mechanism (CBACM) is employed for identifying and removing unwanted users’ access. The model that employed, which achieves an accuracy of 98.53%, precision of 98.02% and execution time of 1.2 sec. This approach ensures data privacy, precise access control, and enhanced diagnostic accuracy which promoting trust and innovation in healthcare systems.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"35 1","pages":"113 - 129"},"PeriodicalIF":0.8,"publicationDate":"2026-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147561323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Attention-Residual Multi-Modal Fusion Framework for Crisis Categorization in Social Media Feeds","authors":"S. Sheeba Rachel, S. Srinivasan","doi":"10.3103/S1060992X25600764","DOIUrl":"10.3103/S1060992X25600764","url":null,"abstract":"<p>Crisis categorization in social media feeds perform an important part in modern disaster management and response techniques. With the increasing employ of social platforms as a primary source of information during crises, effective categorization algorithms are essential for quickly and accurately assessing the severity and impact of events. This study introduces the Attention Residual Multi Modal (ARMM) Fusion Framework, which addresses difficulties in MM data processing for damage assessment. For image processing, the system uses Visual Refinement with Feature Forge, which includes Bilateral Filtering for noise reduction and edge preservation, Bicubic Interpolation for upscaling, and Residual Network with Drop Block for detailed and robust image feature extraction. The framework cleans and pre-processes text using an LSTM-Residual with Embedding Network, converting it into compact vector representations, and then uses residual LSTM connections to capture temporal dependencies and maintain feature integrity for robust text feature extraction. Image and text information are then combined and processed using a MM channel attention method, which improves sensitivity to informative features. The proposed method produces outstanding performance metrics, including precision of 98.00%, recall of 94.12%, F1 score of 95.86%, and accuracy of 96.13%. This method efficiently identifies damage severity (severe, medium, or minor) in tweets that include both images and text, leading risk management strategies (rescue, volunteering and contribution) depending on the assessed damage.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"35 1","pages":"1 - 17"},"PeriodicalIF":0.8,"publicationDate":"2026-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147561404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reinforcement-Trained Neural Network for Various Manufacturers AV Mixed Traffic Flow Simulation","authors":"O. P. Bobrovskaya, T. V. Gavrilenko, V. A. Galkin","doi":"10.3103/S1060992X25603173","DOIUrl":"10.3103/S1060992X25603173","url":null,"abstract":"<p>The problem of modeling the movement of autopiloted vehicles in a mixed traffic stream of autopilots of various manufacturers, in which there are no collisions, is solved. A neural network model is proposed that implements reinforcement learning for an unmanned vehicle model integrated into a microscopic model of a mixed traffic flow. Computational experiments are being conducted with the proposed model. It was hypothesized that trained agents would work worse together than with the original automated objects in which they were trained. In the first experiment, for a closed multi-corridor circle, the percentage of implementation of trained agents gradually increased. In the second experiment, different trained agents were trained on a single-lane circle and then launched together. In the third experiment, agents are trained in different environments and run together on a track with multiple corridors. As a result of the experiments, the hypothesis was not confirmed. When using different learning environments, agents trained in these environments interact more effectively with each other in a common system than with other types of agents.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"35 1","pages":"55 - 77"},"PeriodicalIF":0.8,"publicationDate":"2026-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147561406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Caged CrafText: Language-Grounded Safety Constraints for Multimodal Reinforcement Learning in CrafText","authors":"G. Gorbov, D. Lukashevsky, A. Skrynnik, A. Panov","doi":"10.3103/S1060992X25602799","DOIUrl":"10.3103/S1060992X25602799","url":null,"abstract":"<p>Safe reinforcement learning under natural language instructions remains challenging. Current benchmarks are limited by providing only safety constraints in natural language while using structured goal representations, and by focusing on simple navigation tasks in static environments. We introduce <b>Caged CrafText</b>, the first benchmark where both task objectives and safety constraints are specified entirely through natural language. Our benchmark features complex, long-horizon tasks requiring reasoning in dynamic environments, and is structured around two dataset levels: the Main dataset, designed to evaluate language understanding and safe exploration capabilities across aggregated constraint types, and the Debug dataset, which enables detailed diagnosis of specific failure modes in individual constraint scenarios. Evaluation of existing safety approaches reveals their insufficient safe exploration capability, highlighting the need for improved methods capable of handling complex language-guided constraints. Our experiments also demonstrate that while Large Language Model(LLM) agents show promising reasoning capabilities, they suffer from grounding issues that lead to frequent constraint violations.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"35 1","pages":"18 - 30"},"PeriodicalIF":0.8,"publicationDate":"2026-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147561869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modelling the Tight Focusing of Laser Beams with Radial Polarization Using Binary Mask with Optimal Configuration","authors":"D. Y. Kalashnikov, S. S. Stafeev, V. V. Kotlyar","doi":"10.3103/S1060992X25602337","DOIUrl":"10.3103/S1060992X25602337","url":null,"abstract":"<p>The paper is dedicated to the study of tight focusing of laser beams with radial polarization and finding the optimal configuration of a binary phase mask to obtain the minimum size of the focal spot according to the full-width at half-maximum of intensity in the focus of an aplanatic lens. The simulation was implemented by the Richards-Wolf formulas for radially polarized light. Optimization of the binary phase mask was implemented by genetic algorithm. Optimal parameters were selected for binary masks consisting of 108 and 150 zones. The sizes of the obtained focal spots were equal to FWHM = 0.36 λ and FWHM = 0.32 λ, respectively.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"S385 - S391"},"PeriodicalIF":0.8,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147342599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Simulating the Behavior of Forest Animal Groups to Generate Synthetic Data in Unreal Engine 5","authors":"K. E. Vorobyev, D. A. Sharin, L. A. Taskina","doi":"10.3103/S1060992X25602994","DOIUrl":"10.3103/S1060992X25602994","url":null,"abstract":"<p>The article presents the development of a system for simulating the behavior of groups of forest animals in the three-dimensional environment of Unreal Engine 5 in order to generate synthetic data for computer vision tasks. The simulation includes realistic animal behavior, group dynamics, and the use of a virtual drone with a camera to simulate aerial photography. Annotations are automatically generated for each object, suitable for subsequent training of neural networks. The experiments demonstrated the effectiveness of mixed datasets containing both real and synthetic images when training the YOLO model. The results obtained confirm that the generation of synthetic data can significantly improve the quality of learning with a limited amount of real data. The experiments demonstrated that adding 30% synthetic data to real datasets increased mAP50 from 0.1866 to 0.3605, while Precision improved from 0.7529 to 0.8411. In contrast, training solely on synthetic data reduced mAP50 to 0.0286, confirming the necessity of combining real and generated data. The results indicate that synthetic data can significantly enhance training effectiveness under conditions of limited real-world images.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"S450 - S456"},"PeriodicalIF":0.8,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147342598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Random Bit Generation in VCSEL with Delayed Optoelectronic Feedback","authors":"A. A. Krents, E. A. Yarunova, N. E. Molevich","doi":"10.3103/S1060992X25601964","DOIUrl":"10.3103/S1060992X25601964","url":null,"abstract":"<p>This paper presents a method for generating random bits using a semiconductor laser with optoelectronic feedback, which induces chaotic dynamics. Numerical simulations of the system equations, incorporating the alpha factor and time delay, revealed that for τ > 2.5 ns, the system transitions to a chaotic regime, confirmed by a continuous intensity spectrum. For bit generation, the chaotic signal was sampled at 40 GHz, followed by 8-bit quantization and extraction of the 4 least significant bits. The resulting sequence of bits successfully passed all 15 NIST SP 800-22 tests, including frequency, entropy, and correlation assessments. The implemented scheme demonstrates a generation rate of up to 160 Gbit/s, highlighting its potential for high-performance security systems.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"S478 - S484"},"PeriodicalIF":0.8,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147342203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Methods of Anomalous Data Detection in Datasets","authors":"Akbar Soliev, Akmal Akhatov, Akbar Rashidov","doi":"10.3103/S1060992X25603045","DOIUrl":"10.3103/S1060992X25603045","url":null,"abstract":"<p>It is known that the accuracy of data analysis and artificial intelligence models that trained and tuned on the basis of data is closely related to the quality of the data set. The quality of the data set depends on several factors, one of the most important of which is the absence or elimination of anomalous data in the data set. Anomalous data has such a property that artificial intelligence models work normally with a data set with this anomalous data. That is, artificial intelligence models do not notice at all that they are working with incorrect data. As a result, the artificial intelligence model returns an incorrect result, which may lead to incorrect conclusions about the object. Therefore, today, the detection of anomalous data in the datasets is one of the studies that has retained its relevance. This research paper discusses anomalous data, their negative consequences, and the types of anomalies in the data set. It also studies methods for detecting anomalous data in datasets and analyzes their use cases.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"S514 - S521"},"PeriodicalIF":0.8,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147342537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. A. Sharin, K. E. Vorobyev, T. D. Kazarkin, L. A. Taskina
{"title":"Modeling the Behavior of a Flock of Birds to Generate Synthetic Data in Unreal Engine 5","authors":"D. A. Sharin, K. E. Vorobyev, T. D. Kazarkin, L. A. Taskina","doi":"10.3103/S1060992X25700274","DOIUrl":"10.3103/S1060992X25700274","url":null,"abstract":"<p>The paper proposes a system for modeling the collective behavior of flocks of birds to generate synthetic data in Unreal Engine 5. The system provides flexible adjustment of flock dynamics parameters, including species, speed and flight radius, as well as simulation of complex interactions in a virtual environment. Automated data markup is performed in YOLO (You Only Look Once) formats and with the preservation of spatial metadata of objects. A comparative analysis of the influence of the proportion of synthetic data in training samples on the quality of detection of computer vision models using the example of YOLOv11 is carried out. The results showed that the inclusion of up to 60 to 80 percent of synthetically generated data in the training dataset allows achieving an optimal balance between the accuracy of the model and the cost of collecting real data. The experiments showed that combining real and synthetic data significantly improves detection quality. For example, with 20% real and 80% synthetic data the YOLO v11 model achieved mAP@0.5 = 0.7157, Precision = 0.8157, and Recall = 0.6396, which substantially exceeds the results obtained when training only on real images. The developed system opens up prospects for application in the tasks of airspace monitoring, environmental research and virtual reality.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"S457 - S464"},"PeriodicalIF":0.8,"publicationDate":"2026-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147342091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}