{"title":"Impact of Data Sampling on Stability of Feature Selection for Software Measurement Data","authors":"Kehan Gao, T. Khoshgoftaar, Amri Napolitano","doi":"10.1109/ICTAI.2011.172","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.172","url":null,"abstract":"Software defect prediction can be considered a binary classification problem. Generally, practitioners utilize historical software data, including metric and fault data collected during the software development process, to build a classification model and then employ this model to predict new program modules as either fault-prone (fp) or not-fault-prone (nfp). Limited project resources can then be allocated according to the prediction results by (for example) assigning more reviews and testing to the modules predicted to be potentially defective. Two challenges often come with the modeling process: (1) high-dimensionality of software measurement data and (2) skewed or imbalanced distributions between the two types of modules (fp and nfp) in those datasets. To overcome these problems, extensive studies have been dedicated towards improving the quality of training data. The commonly used techniques are feature selection and data sampling. Usually, researchers focus on evaluating classification performance after the training data is modified. The present study assesses a feature selection technique from a different perspective. We are more interested in studying the stability of a feature selection method, especially in understanding the impact of data sampling techniques on the stability of feature selection when using the sampled data. Some interesting findings are found based on two case studies performed on datasets from two real-world software projects.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"147 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132373625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Hybrid Consensus and Clustering Method for Protein Structure Selection","authors":"Qingguo Wang, Yingzi Shang, Dong Xu","doi":"10.1109/ICTAI.2011.10","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.10","url":null,"abstract":"In protein tertiary structure prediction, a crucial step is to select near-native structures from a large number of predicted structural models. Over the years, many methods have been proposed for the protein structure selection problem. Despite significant advances, the discerning power of current approaches is still unsatisfactory. In this paper, we propose a new algorithm, CC-Select, that combines consensus with clustering techniques. Given a set of predicted models, CC-Select first calculates a consensus score for each structure based on its average pair wise structural similarity to other models. Then, similar structures are grouped into clusters using multidimensional scaling and clustering algorithms. In each cluster, the one with the highest consensus score is selected as a candidate model. Using extensive benchmark sets of a large collection of predicted models, we compare CC-Select with existing state-of-the-art quality assessment methods and show significant improvement.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128384644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Developing Strategies for Improving Planning and Scheduling of Actions in RTS Games","authors":"A. A. Branquinho, C. R. Lopes, Thiago F. Naves","doi":"10.1109/ICTAI.2011.21","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.21","url":null,"abstract":"This article describes techniques developed for generating and scheduling actions using partial order planning and SLA* in the production of resources for Real-time Strategy (RTS) Games. RTS games are characterized by two important steps. In the first step a plan of action should be carried out to produce resources. In the second step, the resources produced in the former step are employed in battles against the enemy. Resource production is vital to succeed in this sort of game. The developed algorithms significantly decrease the make span, which is the time required for execution of the actions that achieve the goal for resource production.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130655150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Incremental Kernel Mapping Algorithms for Scalable Recommender Systems","authors":"M. Ghazanfar, S. Szedmák, A. Prügel-Bennett","doi":"10.1109/ICTAI.2011.183","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.183","url":null,"abstract":"Recommender systems apply machine learning techniques for filtering unseen information and can predict whether a user would like a given item. Kernel Mapping Recommender (KMR) system algorithms have been proposed, which offer state-of-the-art performance. One potential drawback of the KMR algorithms is that the training is done in one step and hence they cannot accommodate the incremental update with the arrival of new data making them unsuitable for the dynamic environments. From this line of research, we propose a new heuristic, which can build the model incrementally without retraining the whole model from scratch when new data (item or user) are added to the recommender system dataset. Furthermore, we proposed a novel perceptron-type algorithm, which is a fast incremental algorithm for building the model that maintains a good level of accuracy and scales well with the data. We show empirically over two datasets that the proposed algorithms give quite accurate results while providing significant computation savings.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121481609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sergio Esparcia, Roberto Centeno, Ramón Hermoso, E. Argente
{"title":"Artifacting and Regulating the Environment of a Virtual Organization","authors":"Sergio Esparcia, Roberto Centeno, Ramón Hermoso, E. Argente","doi":"10.1109/ICTAI.2011.88","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.88","url":null,"abstract":"This work presents an extension of the Environment Dimension of the Virtual Organization Model, which is an Organization Modeling Language to define Organization-Centered Multi-Agent Systems. This extension allows this model to regulate the environment by supporting artifacts for organizational mechanisms, an approach based on the Agents & Artifacts conceptual framework. The three main entities of this framework are agents, artifacts and workspaces, which have been integrated in this work inside the Virtual Organization Model. Additionally, this paper presents an application to the health care setting and an analysis of the related work on this topic.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"158 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121824556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fuad Mousse Abinader Jr., A. C. S. D. Queiroz, Daniel W. Honda
{"title":"Self-Organized Hierarchical Methods for Time Series Forecasting","authors":"Fuad Mousse Abinader Jr., A. C. S. D. Queiroz, Daniel W. Honda","doi":"10.1109/ICTAI.2011.180","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.180","url":null,"abstract":"Time series forecasting with the use of Artificial Neural Networks (ANN), in special with self-organized maps (SOM), has been explored in the literature with good results. One good strategy for improving computational cost and specialization of SOMs in general is constructing it via hierarchical structures. This work presents four different heuristics for constructing hierarchical SOMs for time series prediction, evaluating their computational cost and forecast precision and providing insight on future enhancements.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123647405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimal One-Max Strategy with Dynamic Island Models","authors":"Adrien Goëffon, F. Lardeux","doi":"10.1109/ICTAI.2011.79","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.79","url":null,"abstract":"In this paper, we recall the dynamic island model concept, in order to dynamically select local search operators within a multi-operator genetic algorithm. We use a fully-connected island model, where each island is assigned to a local search operator. Selection of operators is simulated by migration steps, whose policies depend on a learning process. The efficiency of this approach is assessed in comparing, for the One-Max Problem, theoretical and ideal results to those obtained by the model. Experiments show that the model has the expected behavior and is able to regain the optimal local search strategy for this well-known problem.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125017749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adding Constraints and Flexible Preferences in Travel Reservation Systems","authors":"S. Benferhat, Abdelhamid Boudjelida","doi":"10.1109/ICTAI.2011.181","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.181","url":null,"abstract":"Preferences play an important role in many applications. This paper focuses on the use of preferences in travel agency systems such as E-travel system. We show that adding flexible constraints and preferences allows to reach solutions that better fit users's desires. We study simple languages for expressing preferences and queries. We then provide criteria to rank-order solutions returned by a travel agency system. Lastly, we provide experimental results showing the merits of our approach.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"126 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129796846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-view Transfer Learning with Adaboost","authors":"Zhijie Xu, Shiliang Sun","doi":"10.1109/ICTAI.2011.65","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.65","url":null,"abstract":"Transfer learning, serving as one of the most important research directions in machine learning, has been studied in various fields in recent years. In this paper, we integrate the theory of multi-view learning into transfer learning and propose a new algorithm named Multi-View Transfer Learning with Adaboost (MV-TL Adaboost). Different from many previous works on transfer learning, we not only focus on using the labeled data from one task to help to learn another task, but also consider how to transfer them in different views synchronously. We regard both the source and target task as a collection of several constituent views and each of these two tasks can be learned from every views at the same time. Moreover, this kind of multi-view transfer learning is implemented with adaboost algorithm. Furthermore, we analyze the effectiveness and feasibility of MV-TL Adaboost. Experimental results also validate the effectiveness of our proposed approach.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129981079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hadrien Cambazard, D. Mehta, B. O’Sullivan, L. Quesada, M. Ruffini, D. Payne, L. Doyle
{"title":"A Combinatorial Optimisation Approach to the Design of Dual Parented Long-Reach Passive Optical Networks","authors":"Hadrien Cambazard, D. Mehta, B. O’Sullivan, L. Quesada, M. Ruffini, D. Payne, L. Doyle","doi":"10.1109/ICTAI.2011.123","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.123","url":null,"abstract":"We present an application focused on the design of resilient long-reach passive optical networks. We specifically consider dual parented networks whereby each customer must be connected to two metro sites via a local exchange sites. An important property of such a placement is resilience to single metro node failure. The objective of the application is to determine the optimal position of a set of metro-nodes such that the total optical fibre length is minimised. We prove that the decision variant of this problem is NP-Complete. We present three alternative combinatorial optimisation approaches to finding an optimal metro node placement using: a mixed integer linear programming formulation of the problem, a hybrid approach that uses clustering as a preprocessing step, and, finally, a local search approach. We consider a detailed case-study based on a network for Ireland. The hybrid approach scales well and finds solutions that are close to optimal, with a runtime that is two orders-of-magnitude better than the MIP model. The local search approach is consistently good on all benchmarks.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"143 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124578993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}