Journal of Computer Science and Technology最新文献

Balancing Accuracy and Training Time in Federated Learning for Violence Detection in Surveillance Videos: A Study of Neural Network Architectures 平衡用于监控视频暴力检测的联合学习的准确性和训练时间：神经网络架构研究

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-09-13 DOI: 10.1007/s11390-024-3702-7

Quentin Pajon, Swan Serre, Hugo Wissocq, Léo Rabaud, Siba Haidar, Antoun Yaacoub

{"title":"Balancing Accuracy and Training Time in Federated Learning for Violence Detection in Surveillance Videos: A Study of Neural Network Architectures","authors":"Quentin Pajon, Swan Serre, Hugo Wissocq, Léo Rabaud, Siba Haidar, Antoun Yaacoub","doi":"10.1007/s11390-024-3702-7","DOIUrl":"https://doi.org/10.1007/s11390-024-3702-7","url":null,"abstract":"This paper presents an original investigation into the domain of violence detection in videos, introducing an innovative approach tailored to the unique challenges of a federated learning environment. The study encompasses a comprehensive exploration of machine learning techniques, leveraging spatio-temporal features extracted from benchmark video datasets. In a notable departure from conventional methodologies, we introduce a novel architecture, the “Diff Gated” network, designed to streamline preprocessing and training while simultaneously enhancing accuracy. Our exploration of advanced machine learning techniques, such as super-convergence and transfer learning, expands the horizons of federated learning, offering a broader range of practical applications. Moreover, our research introduces a method for seamlessly adapting centralized datasets to the federated learning context, bridging the gap between traditional machine learning and federated learning approaches. The outcome of this study is a remarkable advancement in the field of violence detection, with our federated learning model consistently outperforming state-of-the-art models, underscoring the transformative potential of our contributions. This work represents a significant step forward in the application of machine learning techniques to critical societal challenges.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"152 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142178639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Survey of LLM Datasets: From Autoregressive Model to AI Chatbot LLM 数据集调查：从自回归模型到人工智能聊天机器人

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3767-3

Fei Du, Xin-Jian Ma, Jing-Ru Yang, Yi Liu, Chao-Ran Luo, Xue-Bin Wang, Hai-Ou Jiang, Xiang Jing

{"title":"A Survey of LLM Datasets: From Autoregressive Model to AI Chatbot","authors":"Fei Du, Xin-Jian Ma, Jing-Ru Yang, Yi Liu, Chao-Ran Luo, Xue-Bin Wang, Hai-Ou Jiang, Xiang Jing","doi":"10.1007/s11390-024-3767-3","DOIUrl":"https://doi.org/10.1007/s11390-024-3767-3","url":null,"abstract":"Since OpenAI opened access to ChatGPT, large language models (LLMs) become an increasingly popular topic attracting researchers’ attention from abundant domains. However, public researchers meet some problems when developing LLMs given that most of the LLMs are produced by industries and the training details are typically unrevealed. Since datasets are an important setup of LLMs, this paper does a holistic survey on the training datasets used in both the pre-train and fine-tune processes. The paper first summarizes 16 pre-train datasets and 16 fine-tune datasets used in the state-of-the-art LLMs. Secondly, based on the properties of the pre-train and fine-tune processes, it comments on pre-train datasets from quality, quantity, and relation with models, and comments on fine-tune datasets from quality, quantity, and concerns. This study then critically figures out the problems and research trends that exist in current LLM datasets. The study helps public researchers train and investigate LLMs by visual cases and provides useful comments to the research community regarding data development. To the best of our knowledge, this paper is the first to summarize and discuss datasets used in both autoregressive and chat LLMs. The survey offers insights and suggestions to researchers and LLM developers as they build their models, and contributes to the LLM study by pointing out the existing problems of LLM studies from the perspective of data.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"16 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Video Colorization: A Survey 视频着色：一项调查

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-4143-z

Zhong-Zheng Peng, Yi-Xin Yang, Jin-Hui Tang, Jin-Shan Pan

{"title":"Video Colorization: A Survey","authors":"Zhong-Zheng Peng, Yi-Xin Yang, Jin-Hui Tang, Jin-Shan Pan","doi":"10.1007/s11390-024-4143-z","DOIUrl":"https://doi.org/10.1007/s11390-024-4143-z","url":null,"abstract":"Video colorization aims to add color to grayscale or monochrome videos. Although existing methods have achieved substantial and noteworthy results in the field of image colorization, video colorization presents more formidable obstacles due to the additional necessity for temporal consistency. Moreover, there is rarely a systematic review of video colorization methods. In this paper, we aim to review existing state-of-the-art video colorization methods. In addition, maintaining spatial-temporal consistency is pivotal to the process of video colorization. To gain deeper insight into the evolution of existing methods in terms of spatial-temporal consistency, we further review video colorization methods from a novel perspective. Video colorization methods can be categorized into four main categories: optical-flow based methods, scribble-based methods, exemplar-based methods, and fully automatic methods. However, optical-flow based methods rely heavily on accurate optical-flow estimation, scribble-based methods require extensive user interaction and modifications, exemplar-based methods face challenges in obtaining suitable reference images, and fully automatic methods often struggle to meet specific colorization requirements. We also discuss the existing challenges and highlight several future research opportunities worth exploring.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"64 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advances of Pipeline Model Parallelism for Deep Learning Training: An Overview 用于深度学习训练的管道模型并行性的进步：概述

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3872-3

Lei Guan, Dong-Sheng Li, Ji-Ye Liang, Wen-Jian Wang, Ke-Shi Ge, Xi-Cheng Lu

引用次数: 0

Knowledge-Enhanced Conversational Agents 知识增强型对话代理

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-2883-4

Fabio Caffaro, Giuseppe Rizzo

引用次数: 0

A Survey of Multimodal Controllable Diffusion Models 多模式可控扩散模型概览

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3814-0

Rui Jiang, Guang-Cong Zheng, Teng Li, Tian-Rui Yang, Jing-Dong Wang, Xi Li

{"title":"A Survey of Multimodal Controllable Diffusion Models","authors":"Rui Jiang, Guang-Cong Zheng, Teng Li, Tian-Rui Yang, Jing-Dong Wang, Xi Li","doi":"10.1007/s11390-024-3814-0","DOIUrl":"https://doi.org/10.1007/s11390-024-3814-0","url":null,"abstract":"Diffusion models have recently emerged as powerful generative models, producing high-fidelity samples across domains. Despite this, they have two key challenges, including improving the time-consuming iterative generation process and controlling and steering the generation process. Existing surveys provide broad overviews of diffusion model advancements. However, they lack comprehensive coverage specifically centered on techniques for controllable generation. This survey seeks to address this gap by providing a comprehensive and coherent review on controllable generation in diffusion models. We provide a detailed taxonomy defining controlled generation for diffusion models. Controllable generation is categorized based on the formulation, methodologies, and evaluation metrics. By enumerating the range of methods researchers have developed for enhanced control, we aim to establish controllable diffusion generation as a distinct subfield warranting dedicated focus. With this survey, we contextualize recent results, provide the dedicated treatment of controllable diffusion model generation, and outline limitations and future directions. To demonstrate applicability, we highlight controllable diffusion techniques for major computer vision tasks application. By consolidating methods and applications for controllable diffusion models, we hope to catalyze further innovations in reliable and scalable controllable generation.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"13 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

When Crowdsourcing Meets Data Markets: A Fair Data Value Metric for Data Trading 当众包遇上数据市场：数据交易的公平数据价值度量

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-023-2519-0

Yang-Su Liu, Zhen-Zhe Zheng, Fan Wu, Gui-Hai Chen

{"title":"When Crowdsourcing Meets Data Markets: A Fair Data Value Metric for Data Trading","authors":"Yang-Su Liu, Zhen-Zhe Zheng, Fan Wu, Gui-Hai Chen","doi":"10.1007/s11390-023-2519-0","DOIUrl":"https://doi.org/10.1007/s11390-023-2519-0","url":null,"abstract":"Large-quantity and high-quality data is critical to the success of machine learning in diverse applications. Faced with the dilemma of data silos where data is difficult to circulate, emerging data markets attempt to break the dilemma by facilitating data exchange on the Internet. Crowdsourcing, on the other hand, is one of the important methods to efficiently collect large amounts of data with high-value in data markets. In this paper, we investigate the joint problem of efficient data acquisition and fair budget distribution across the crowdsourcing and data markets. We propose a new metric of data value as the uncertainty reduction of a Bayesian machine learning model by integrating the data into model training. Guided by this data value metric, we design a mechanism called Shapley Value Mechanism with Individual Rationality (SV-IR), in which we design a greedy algorithm with a constant approximation ratio to greedily select the most cost-efficient data brokers, and a fair compensation determination rule based on the Shapley value, respecting the individual rationality constraints. We further propose a fair reward distribution method for the data holders with various effort levels under the charge of a data broker. We demonstrate the fairness of the compensation determination rule and reward distribution rule by evaluating our mechanisms on two real-world datasets. The evaluation results also show that the selection algorithm in SV-IR could approach the optimal solution, and outperforms state-of-the-art methods.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"50 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Age-of-Information-Aware Federated Learning 感知信息时代的联合学习

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-024-3914-x

Yin Xu, Ming-Jun Xiao, Chen Wu, Jie Wu, Jin-Rui Zhou, He Sun

{"title":"Age-of-Information-Aware Federated Learning","authors":"Yin Xu, Ming-Jun Xiao, Chen Wu, Jie Wu, Jin-Rui Zhou, He Sun","doi":"10.1007/s11390-024-3914-x","DOIUrl":"https://doi.org/10.1007/s11390-024-3914-x","url":null,"abstract":"Federated learning (FL) is an emerging privacy-preserving distributed computing paradigm, enabling numerous clients to collaboratively train machine learning models without the necessity of transmitting clients’ private datasets to the central server. Unlike most existing research where the local datasets of clients are assumed to be unchanged over time throughout the whole FL process, our study addresses such scenarios in this paper where clients’ datasets need to be updated periodically, and the server can incentivize clients to employ as fresh as possible datasets for local model training. Our primary objective is to design a client selection strategy to minimize the loss of the global model for FL loss within a constrained budget. To this end, we introduce the concept of “Age of Information” (AoI) to quantitatively assess the freshness of local datasets and conduct a theoretical analysis of the convergence bound in our AoI-aware FL system. Based on the convergence bound, we further formulate our problem as a restless multi-armed bandit (RMAB) problem. Next, we relax the RMAB problem and apply the Lagrangian Dual approach to decouple it into multiple subproblems. Finally, we propose a Whittle’s Index Based Client Selection (WICS) algorithm to determine the set of selected clients. In addition, comprehensive simulations substantiate that the proposed algorithm can effectively reduce training loss and enhance the learning accuracy compared with some state-of-the-art methods.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"29 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DCFNet: Discriminant Correlation Filters Network for Visual Tracking DCFNet：用于视觉跟踪的判别相关滤波器网络

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-023-3788-3

Wei-Ming Hu, Qiang Wang, Jin Gao, Bing Li, Stephen Maybank

{"title":"DCFNet: Discriminant Correlation Filters Network for Visual Tracking","authors":"Wei-Ming Hu, Qiang Wang, Jin Gao, Bing Li, Stephen Maybank","doi":"10.1007/s11390-023-3788-3","DOIUrl":"https://doi.org/10.1007/s11390-023-3788-3","url":null,"abstract":"CNN (convolutional neural network) based real time trackers usually do not carry out online network update in order to maintain rapid tracking speed. This inevitably influences the adaptability to changes in object appearance. Correlation filter based trackers can update the model parameters online in real time. In this paper, we present an end-to-end lightweight network architecture, namely Discriminant Correlation Filter Network (DCFNet). A differentiable DCF (discriminant correlation filter) layer is incorporated into a Siamese network architecture in order to learn the convolutional features and the correlation filter simultaneously. The correlation filter can be efficiently updated online. In previous work, we introduced a joint scale-position space to the DCFNet, forming a scale DCFNet which carries out the predictions of object scale and position simultaneously. We combine the scale DCFNet with the convolutional-deconvolutional network, learning both the high-level embedding space representations and the low-level fine-grained representations for images. The adaptability of the fine-grained correlation analysis and the generalization capability of the semantic embedding are complementary for visual tracking. The back-propagation is derived in the Fourier frequency domain throughout the entire work, preserving the efficiency of the DCF. Extensive evaluations on the OTB (Object Tracking Benchmark) and VOT (Visual Object Tracking Challenge) datasets demonstrate that the proposed trackers have fast speeds, while maintaining tracking accuracy.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"2 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Neighborhood Combination Search for Single-Machine Scheduling with Sequence-Dependent Setup Time 取决于序列设置时间的单机调度的邻域组合搜索

IF 1.9 3区计算机科学

Journal of Computer Science and Technology Pub Date : 2024-07-22 DOI: 10.1007/s11390-023-2007-6

Xiao-Lu Liu, Hong-Yun Xu, Jia-Ming Chen, Zhou-Xing Su, Zhi-Peng Lyu, Jun-Wen Ding

{"title":"Neighborhood Combination Search for Single-Machine Scheduling with Sequence-Dependent Setup Time","authors":"Xiao-Lu Liu, Hong-Yun Xu, Jia-Ming Chen, Zhou-Xing Su, Zhi-Peng Lyu, Jun-Wen Ding","doi":"10.1007/s11390-023-2007-6","DOIUrl":"https://doi.org/10.1007/s11390-023-2007-6","url":null,"abstract":"In a local search algorithm, one of its most important features is the definition of its neighborhood which is crucial to the algorithm’s performance. In this paper, we present an analysis of neighborhood combination search for solving the single-machine scheduling problem with sequence-dependent setup time with the objective of minimizing total weighted tardiness (SMSWT). First, We propose a new neighborhood structure named Block Swap (B1) which can be considered as an extension of the previously widely used Block Move (B2) neighborhood, and a fast incremental evaluation technique to enhance its evaluation efficiency. Second, based on the Block Swap and Block Move neighborhoods, we present two kinds of neighborhood structures: neighborhood union (denoted by B1⋃B2) and token-ring search (denoted by B1 → B2), both of which are combinations of B1 and B2. Third, we incorporate the neighborhood union and token-ring search into two representative metaheuristic algorithms: the Iterated Local Search Algorithm (ILSnew) and the Hybrid Evolutionary Algorithm (HEAnew) to investigate the performance of the neighborhood union and token-ring search. Extensive experiments show the competitiveness of the token-ring search combination mechanism of the two neighborhoods. Tested on the 120 public benchmark instances, our HEAnew has a highly competitive performance in solution quality and computational time compared with both the exact algorithms and recent metaheuristics. We have also tested the HEAnew algorithm with the selected neighborhood combination search to deal with the 64 public benchmark instances of the single-machine scheduling problem with sequence-dependent setup time. HEAnew is able to match the optimal or the best known results for all the 64 instances. In particular, the computational time for reaching the best well-known results for five challenging instances is reduced by at least 61.25%.","PeriodicalId":50222,"journal":{"name":"Journal of Computer Science and Technology","volume":"64 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0