Wachara Fungwacharakorn, Kanae Tsushima, Hiroshi Hosobe, Hideaki Takeda, Ken Satoh
{"title":"An Argumentative Approach for Explaining Preemption in Soft-Constraint Based Norms","authors":"Wachara Fungwacharakorn, Kanae Tsushima, Hiroshi Hosobe, Hideaki Takeda, Ken Satoh","doi":"arxiv-2409.04065","DOIUrl":"https://doi.org/arxiv-2409.04065","url":null,"abstract":"Although various aspects of soft-constraint based norms have been explored,\u0000it is still challenging to understand preemption. Preemption is a situation\u0000where higher-level norms override lower-level norms when new information\u0000emerges. To address this, we propose a derivation state argumentation framework\u0000(DSA-framework). DSA-framework incorporates derivation states to explain how\u0000preemption arises based on evolving situational knowledge. Based on\u0000DSA-framework, we present an argumentative approach for explaining preemption.\u0000We formally prove that, under local optimality, DSA-framework can provide\u0000explanations why one consequence is obligatory or forbidden by soft-constraint\u0000based norms represented as logical constraint hierarchies.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"156 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems","authors":"Stylianos Loukas Vasileiou, William Yeoh","doi":"arxiv-2409.03671","DOIUrl":"https://doi.org/arxiv-2409.03671","url":null,"abstract":"We present TRACE-cs, a novel hybrid system that combines symbolic reasoning\u0000with large language models (LLMs) to address contrastive queries in scheduling\u0000problems. TRACE-cs leverages SAT solving techniques to encode scheduling\u0000constraints and generate explanations for user queries, while utilizing an LLM\u0000to process the user queries into logical clauses as well as refine the\u0000explanations generated by the symbolic solver to natural language sentences. By\u0000integrating these components, our approach demonstrates the potential of\u0000combining symbolic methods with LLMs to create explainable AI agents with\u0000correctness guarantees.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shashank Tripathi, Omid Taheri, Christoph Lassner, Michael J. Black, Daniel Holden, Carsten Stoll
{"title":"HUMOS: Human Motion Model Conditioned on Body Shape","authors":"Shashank Tripathi, Omid Taheri, Christoph Lassner, Michael J. Black, Daniel Holden, Carsten Stoll","doi":"arxiv-2409.03944","DOIUrl":"https://doi.org/arxiv-2409.03944","url":null,"abstract":"Generating realistic human motion is essential for many computer vision and\u0000graphics applications. The wide variety of human body shapes and sizes greatly\u0000impacts how people move. However, most existing motion models ignore these\u0000differences, relying on a standardized, average body. This leads to uniform\u0000motion across different body types, where movements don't match their physical\u0000characteristics, limiting diversity. To solve this, we introduce a new approach\u0000to develop a generative motion model based on body shape. We show that it's\u0000possible to train this model using unpaired data by applying cycle consistency,\u0000intuitive physics, and stability constraints, which capture the relationship\u0000between identity and movement. The resulting model generates diverse,\u0000physically plausible, and dynamically stable human motions that are both\u0000quantitatively and qualitatively more realistic than current state-of-the-art\u0000methods. More details are available on our project page\u0000https://CarstenEpic.github.io/humos/.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"496 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"In Search of Trees: Decision-Tree Policy Synthesis for Black-Box Systems via Search","authors":"Emir Demirović, Christian Schilling, Anna Lukina","doi":"arxiv-2409.03260","DOIUrl":"https://doi.org/arxiv-2409.03260","url":null,"abstract":"Decision trees, owing to their interpretability, are attractive as control\u0000policies for (dynamical) systems. Unfortunately, constructing, or synthesising,\u0000such policies is a challenging task. Previous approaches do so by imitating a\u0000neural-network policy, approximating a tabular policy obtained via formal\u0000synthesis, employing reinforcement learning, or modelling the problem as a\u0000mixed-integer linear program. However, these works may require access to a\u0000hard-to-obtain accurate policy or a formal model of the environment (within\u0000reach of formal synthesis), and may not provide guarantees on the quality or\u0000size of the final tree policy. In contrast, we present an approach to\u0000synthesise optimal decision-tree policies given a black-box environment and\u0000specification, and a discretisation of the tree predicates, where optimality is\u0000defined with respect to the number of steps to achieve the goal. Our approach\u0000is a specialised search algorithm which systematically explores the\u0000(exponentially large) space of decision trees under the given discretisation.\u0000The key component is a novel pruning mechanism that significantly reduces the\u0000search space. Our approach represents a conceptually novel way of synthesising\u0000small decision-tree policies with optimality guarantees even for black-box\u0000environments with black-box specifications.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"54 2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mingze Gao, Jingyu Liu, Mingda Li, Jiangtao Xie, Qingbin Liu, Bo Zhao, Xi Chen, Hui Xiong
{"title":"TC-LLaVA: Rethinking the Transfer from Image to Video Understanding with Temporal Considerations","authors":"Mingze Gao, Jingyu Liu, Mingda Li, Jiangtao Xie, Qingbin Liu, Bo Zhao, Xi Chen, Hui Xiong","doi":"arxiv-2409.03206","DOIUrl":"https://doi.org/arxiv-2409.03206","url":null,"abstract":"Multimodal Large Language Models (MLLMs) have significantly improved\u0000performance across various image-language applications. Recently, there has\u0000been a growing interest in adapting image pre-trained MLLMs for video-related\u0000tasks. However, most efforts concentrate on enhancing the vision encoder and\u0000projector components, while the core part, Large Language Models (LLMs),\u0000remains comparatively under-explored. In this paper, we propose two strategies\u0000to enhance the model's capability in video understanding tasks by improving\u0000inter-layer attention computation in LLMs. Specifically, the first approach\u0000focuses on the enhancement of Rotary Position Embedding (RoPE) with\u0000Temporal-Aware Dual RoPE, which introduces temporal position information to\u0000strengthen the MLLM's temporal modeling capabilities while preserving the\u0000relative position relationships of both visual and text tokens. The second\u0000approach involves enhancing the Attention Mask with the Frame-wise Block Causal\u0000Attention Mask, a simple yet effective method that broadens visual token\u0000interactions within and across video frames while maintaining the causal\u0000inference mechanism. Based on these proposed methods, we adapt LLaVA for video\u0000understanding tasks, naming it Temporal-Considered LLaVA (TC-LLaVA). Our\u0000TC-LLaVA achieves new state-of-the-art performance across various video\u0000understanding benchmarks with only supervised fine-tuning (SFT) on\u0000video-related datasets.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"84 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Harnessing LLMs for Cross-City OD Flow Prediction","authors":"Chenyang Yu, Xinpeng Xie, Yan Huang, Chenxi Qiu","doi":"arxiv-2409.03937","DOIUrl":"https://doi.org/arxiv-2409.03937","url":null,"abstract":"Understanding and predicting Origin-Destination (OD) flows is crucial for\u0000urban planning and transportation management. Traditional OD prediction models,\u0000while effective within single cities, often face limitations when applied\u0000across different cities due to varied traffic conditions, urban layouts, and\u0000socio-economic factors. In this paper, by employing Large Language Models\u0000(LLMs), we introduce a new method for cross-city OD flow prediction. Our\u0000approach leverages the advanced semantic understanding and contextual learning\u0000capabilities of LLMs to bridge the gap between cities with different\u0000characteristics, providing a robust and adaptable solution for accurate OD flow\u0000prediction that can be transferred from one city to another. Our novel\u0000framework involves four major components: collecting OD training datasets from\u0000a source city, instruction-tuning the LLMs, predicting destination POIs in a\u0000target city, and identifying the locations that best match the predicted\u0000destination POIs. We introduce a new loss function that integrates POI\u0000semantics and trip distance during training. By extracting high-quality\u0000semantic features from human mobility and POI data, the model understands\u0000spatial and functional relationships within urban spaces and captures\u0000interactions between individuals and various POIs. Extensive experimental\u0000results demonstrate the superiority of our approach over the state-of-the-art\u0000learning-based methods in cross-city OD flow prediction.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"81 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pranay Thangeda, Trevor S. Betz, Michael N. Grussing, Melkior Ornik
{"title":"InfraLib: Enabling Reinforcement Learning and Decision Making for Large Scale Infrastructure Management","authors":"Pranay Thangeda, Trevor S. Betz, Michael N. Grussing, Melkior Ornik","doi":"arxiv-2409.03167","DOIUrl":"https://doi.org/arxiv-2409.03167","url":null,"abstract":"Efficient management of infrastructure systems is crucial for economic\u0000stability, sustainability, and public safety. However, infrastructure\u0000management is challenging due to the vast scale of systems, stochastic\u0000deterioration of components, partial observability, and resource constraints.\u0000While data-driven approaches like reinforcement learning (RL) offer a promising\u0000avenue for optimizing management policies, their application to infrastructure\u0000has been limited by the lack of suitable simulation environments. We introduce\u0000InfraLib, a comprehensive framework for modeling and analyzing infrastructure\u0000management problems. InfraLib employs a hierarchical, stochastic approach to\u0000realistically model infrastructure systems and their deterioration. It supports\u0000practical functionality such as modeling component unavailability, cyclical\u0000budgets, and catastrophic failures. To facilitate research, InfraLib provides\u0000tools for expert data collection, simulation-driven analysis, and\u0000visualization. We demonstrate InfraLib's capabilities through case studies on a\u0000real-world road network and a synthetic benchmark with 100,000 components.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Sequential Decision-Making Model for Perimeter Identification","authors":"Ayal Taitler","doi":"arxiv-2409.02549","DOIUrl":"https://doi.org/arxiv-2409.02549","url":null,"abstract":"Perimeter identification involves ascertaining the boundaries of a designated\u0000area or zone, requiring traffic flow monitoring, control, or optimization.\u0000Various methodologies and technologies exist for accurately defining these\u0000perimeters; however, they often necessitate specialized equipment, precise\u0000mapping, or comprehensive data for effective problem delineation. In this\u0000study, we propose a sequential decision-making framework for perimeter search,\u0000designed to operate efficiently in real-time and require only publicly\u0000accessible information. We conceptualize the perimeter search as a game between\u0000a playing agent and an artificial environment, where the agent's objective is\u0000to identify the optimal perimeter by sequentially improving the current\u0000perimeter. We detail the model for the game and discuss its adaptability in\u0000determining the definition of an optimal perimeter. Ultimately, we showcase the\u0000model's efficacy through a real-world scenario, highlighting the identification\u0000of corresponding optimal perimeters.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Constantin Waubert de Puiseau, Fabian Wolz, Merlin Montag, Jannik Peters, Hasan Tercan, Tobias Meisen
{"title":"Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem","authors":"Constantin Waubert de Puiseau, Fabian Wolz, Merlin Montag, Jannik Peters, Hasan Tercan, Tobias Meisen","doi":"arxiv-2409.02697","DOIUrl":"https://doi.org/arxiv-2409.02697","url":null,"abstract":"The job shop scheduling problem (JSSP) and its solution algorithms have been\u0000of enduring interest in both academia and industry for decades. In recent\u0000years, machine learning (ML) is playing an increasingly important role in\u0000advancing existing and building new heuristic solutions for the JSSP, aiming to\u0000find better solutions in shorter computation times. In this paper we build on\u0000top of a state-of-the-art deep reinforcement learning (DRL) agent, called\u0000Neural Local Search (NLS), which can efficiently and effectively control a\u0000large local neighborhood search on the JSSP. In particular, we develop a method\u0000for training the decision transformer (DT) algorithm on search trajectories\u0000taken by a trained NLS agent to further improve upon the learned\u0000decision-making sequences. Our experiments show that the DT successfully learns\u0000local search strategies that are different and, in many cases, more effective\u0000than those of the NLS agent itself. In terms of the tradeoff between solution\u0000quality and acceptable computational time needed for the search, the DT is\u0000particularly superior in application scenarios where longer computational times\u0000are acceptable. In this case, it makes up for the longer inference times\u0000required per search step, which are caused by the larger neural network\u0000architecture, through better quality decisions per step. Thereby, the DT\u0000achieves state-of-the-art results for solving the JSSP with ML-enhanced search.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL","authors":"Mohammad Reshadati","doi":"arxiv-2409.02711","DOIUrl":"https://doi.org/arxiv-2409.02711","url":null,"abstract":"The developments in the field of generative AI has brought a lot of\u0000opportunities for companies, for instance to improve efficiency in customer\u0000service and automating tasks. PostNL, the biggest parcel and E-commerce\u0000corporation of the Netherlands wants to use generative AI to enhance the\u0000communication around track and trace of parcels. During the internship a\u0000Minimal Viable Product (MVP) is created to showcase the value of using\u0000generative AI technologies, to enhance parcel tracking, analyzing the parcel's\u0000journey and being able to communicate about it in an easy to understand manner.\u0000The primary goal was to develop an in-house LLM-based system, reducing\u0000dependency on external platforms and establishing the feasibility of a\u0000dedicated generative AI team within the company. This multi-agent LLM based\u0000system aimed to construct parcel journey stories and identify logistical\u0000disruptions with heightened efficiency and accuracy. The research involved\u0000deploying a sophisticated AI-driven communication system, employing\u0000Retrieval-Augmented Generation (RAG) for enhanced response precision, and\u0000optimizing large language models (LLMs) tailored to domain specific tasks. The MVP successfully implemented a multi-agent open-source LLM system, called\u0000SuperTracy. SuperTracy is capable of autonomously managing a broad spectrum of\u0000user inquiries and improving internal knowledge handling. Results and\u0000evaluation demonstrated technological innovation and feasibility, notably in\u0000communication about the track and trace of a parcel, which exceeded initial\u0000expectations. These advancements highlight the potential of AI-driven solutions\u0000in logistics, suggesting many opportunities for further refinement and broader\u0000implementation within PostNL operational framework.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"53 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142193871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}