{"title":"Identifying text reuse using word net-based extended named entity recognition","authors":"Eunji Lee, Pankoo Kim","doi":"10.1145/3264746.3264811","DOIUrl":"https://doi.org/10.1145/3264746.3264811","url":null,"abstract":"Text reuse is an unethical practice that has become prominent in information content digitization owing to the spread of the internet and smartphones. One challenge with text reuse is that it can be difficult to detect if there are changes in the word order and words are inserted, deleted, or replaced. To resolve the issue of words being excluded from similarity measurement targets when they are replaced with words having a similar meaning, this paper proposes a method of measuring similarity in which named entity recognition is performed on the words appearing in the target document and named entity tags are annotated to them. However, typical named entity recognition only targets proper nouns, so when common nouns are replaced with similar words, they are not classified as named entities belonging to the same class. To resolve this problem, we have expanded the range of WordNetbased named entity recognition.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"34 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123093179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimization of vehicle routing problem with fatigue driving based on genetic algorithm","authors":"Jiacheng Li, Jiaoman Du, Lei Li","doi":"10.1145/3264746.3264782","DOIUrl":"https://doi.org/10.1145/3264746.3264782","url":null,"abstract":"In order to better solve the logistics distribution problems and improve customer satisfaction, aiming at minimizing total cost, the author adds a variable that restricts drivers' fatigue driving in the model, and builds a model of route optimization based on heterogeneous vehicles, so as to design a single-parent genetic algorithm for this model, and validates the algorithm by the delivery case of Japan Takkyubin Corporation. The numerical results of the example show that the logistics distribution route optimization scheme based on the single-parent genetic algorithm can meet the customer's cargo and time requirements, and can reduce vehicle use costs, save early or late penalty costs, and improve the company's economic interests. This study provides new solution ideas for improving delivery issues.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115052579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jingyi Chen, Xin Su, Xuewu Zhang, Chang Choi, Dongmin Choi
{"title":"A survey on ocean observatory networks","authors":"Jingyi Chen, Xin Su, Xuewu Zhang, Chang Choi, Dongmin Choi","doi":"10.1145/3264746.3264800","DOIUrl":"https://doi.org/10.1145/3264746.3264800","url":null,"abstract":"Ocean observation networks have become important data acquisition platform for marine scientific research. This paper first summarizes the characteristics of the ocean observation networks, briefly introduces the development history of the world in the field of ocean observatory networks, and then gives a detailed introduction to the research progress of the ocean observatory networks in various countries. The paper also points out the key technologies and initial solutions of the ocean observatory network system, and discusses the next step development of the seabed observatory networks.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"223 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124588321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An efficient load balancing multicast scheduling for solving congestion problem in social data center networks","authors":"Hsueh-Wen Tseng, Ya-Ju Yu, Kai-Hsu Hsieh","doi":"10.1145/3264746.3264763","DOIUrl":"https://doi.org/10.1145/3264746.3264763","url":null,"abstract":"Recently, social network sites have become more popular. The web site traffic on Facebook reaches 22.36% proportion of global data traffic in the world. There are various and complex kinds of data types such as texts, photographs, and videos which are transmitted in social network sites. In social networks, cloud services are generally accomplished by multicast-based group communications. The extensive data of social networks is generated within a relatively short period of time and is concentrated on partial servers. Subsequently, the rate of multicast congestion increases substantially, resulting in severe packet loss and transmission error. Therefore, we study the congestion problem of multicast-based group communications in the social data center network. Then, we propose an efficient load balancing multicast scheduling (LBMS) by observing users' behaviors on the social network to alleviate the congestion problems of the multicast traffic. Simulation results shows that LBMS can achieve load balance and significantly improve throughput and average delay.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123835525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Energy-aware task scheduling strategies with QoS constraint for green computing in cloud data centers","authors":"Xing Liu, Panwen Liu, Hongjing Li, Zheng Li, Chengming Zou, Haiying Zhou, Xin Yan, Ruoshi Xia","doi":"10.1145/3264746.3264792","DOIUrl":"https://doi.org/10.1145/3264746.3264792","url":null,"abstract":"Energy optimization with Quality-of-Service (QoS) constraint has become a timely and significant challenge for the cloud datacenters. In this paper, a hardware and software collaborative optimization strategy is implemented to minimize the energy cost while satisfying the time constraint of the cloud-computing datacenters. In the hardware aspect, a DVFS-capable CPU/GPU/FPGA heterogeneous cloud infrastructure is built. This infrastructure has high flexibility, and can adjust its hardware characteristics dynamically in terms of the software run-time contexts, so that a hardware platform which matches the software can be built. Based on this hardware platform, the cloud applications can be executed more efficiently with less energy cost. In the software aspect, the deadline-aware energy-efficient task scheduling algorithms are investigated. Different from the traditional approaches which search for the optimal scheduling solution by the heuristic approaches, a new scheduling approach based on the improved Mathematical Morphology (MM) algorithm is investigated in this paper. To evaluate the performance of our work, we calculated the energy cost of the Fourier transform (FT) and Gaussian elimination (GE) applications on the homogeneous and heterogeneous cloud computing platforms by applying the GA and MM algorithms, respectively. The results proved the MM algorithms running on the DVFS-capable heterogeneous cloud infrastructure could decrease the energy cost of the FT application and GE application respectively by 24.7% and 37.8%, if compared with the GA algorithm running on the DVFS-incapable homogeneous cloud infrastructure.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"44 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120922904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Abstractive summarization by neural attention model with document content memory","authors":"YunSeok Choi, Dahae Kim, Jee-Hyong Lee","doi":"10.1145/3264746.3264773","DOIUrl":"https://doi.org/10.1145/3264746.3264773","url":null,"abstract":"In this paper, we propose a generative approach for abstractive summarization, which creates summaries based on a language model. The main goal of our paper is to generate a long sequence of words with coherent sentences by reflecting the key concepts of the original document and the characteristics of summaries. To achieve this goal, we propose an attention mechanism that uses Document Content Memory for learning the language model effectively. To evaluate its effectiveness, the proposed methods are compared with other language models and an extractive summarization method. The results demonstrated that the proposed methods could be competitive with other approaches.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115978836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dimensionality reduction by bayesian eigenvalue-analysis for state prediction in large sensor systems: with application in wind turbines","authors":"J. Herp, E. Nadimi","doi":"10.1145/3264746.3264753","DOIUrl":"https://doi.org/10.1145/3264746.3264753","url":null,"abstract":"The potential of the theory of random matrices are presented and evaluated as a statistical tool to represent the empirical correlations in a study of multivariate time series. A new sub space state prediction framework is proposed, consisting of the combination of a Bayesian state prediction algorithm and the eigenvalues of the empirical correlation matrix. In an industrial use-case of wind turbines, remarkable agreement between the theoretical prediction (based on the assumption that the correlation matrix is random) and empirical data, concerning the density of eigenvalues associated with the time series of different sensors, are found. Finally, the proposed framework outperforms the existing Bayesian state prediction algorithm and is computationally more feasible than feeding unprocessed data.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127916475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
W. Abeysinghe, C. Hung, Slim Bechikh, Xiaosong Wang, Altaf Rattani
{"title":"Clustering algorithms on imbalanced data using the SMOTE technique for image segmentation","authors":"W. Abeysinghe, C. Hung, Slim Bechikh, Xiaosong Wang, Altaf Rattani","doi":"10.1145/3264746.3264774","DOIUrl":"https://doi.org/10.1145/3264746.3264774","url":null,"abstract":"Imbalanced data is a critical problem in machine learning. Most imbalanced dataset consists of one or more classes, called the minority class, which do not have enough number of samples for the recognition. Many traditional classification algorithms are unable to recognize the minority class effectively. Clustering algorithms used for image segmentation may have a high accuracy; however, none of samples in the minority class is classified correctly. In this study, we use three approaches, traditional oversampling technique, traditional undersampling technique, and the Synthetic Minority Over-sampling Technique (SMOTE), to reduce the significant difference of imbalance of the number of samples between the majority classes and the minority classes in the dataset. Fuzzy C-means algorithm (FCM) and Possibilistic Clustering Algorithm (PCA) are used to segment the images in which the samples are generated using above sampling methods. Experimental results are evaluated using the Kappa Coefficient and Confusion matrix. Our evaluation shows that the oversampling, undersampling, and SMOTE techniques can improve the imbalanced image segmentation problem with a higher accuracy[1].","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126828055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
B. Carpentieri, Arcangelo Castiglione, A. D. Santis, F. Palmieri, Raffaele Pizzolante
{"title":"Data hiding using compressed archives","authors":"B. Carpentieri, Arcangelo Castiglione, A. D. Santis, F. Palmieri, Raffaele Pizzolante","doi":"10.1145/3264746.3264752","DOIUrl":"https://doi.org/10.1145/3264746.3264752","url":null,"abstract":"The need to avoid suspicion in a potential observer/investigator, though it can be seen as a weak requirement for the protection of transmitted data, it can be extremely useful. For example, once discovered that the data has been protected by means of certain security tools, a malicious user that is strongly interested in such data could make a great effort in trying to remove the protection, for example, by cracking the password or by exploiting the vulnerabilities of the algorithm that was used to protect the data. Hence, to transfer information in a secure way, it is sometimes better to hide it inside other data, rather than protecting it in a conventional manner. One of the most important techniques in data hiding is steganography. In this work we propose a novel steganographic method which enables to hide information by using a compressed archive as information carrier. The method we propose uses these compressed archives, together with their relative hierarchical structure, to hide bits of information. We remark that by means of our method the hidden information does not have any semantic relation with respect to the content of the compressed archive, whereas such relation will concern how the archive (and the relative sub-archives) was compressed, i.e., what compression algorithms and relative parameters were used, as well as how this operation was performed. Since it is a common practice to send compressed data by means of a communication channel, we believe that our proposal can be a valid tool for steganographic purposes, since the suspect generated in a potential observer could be considered almost negligible.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114237727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sae-Han Suh, Ji-Eun Jhang, Kwanghee Won, Sung Y. Shin, C. Sung
{"title":"Development of vegetation mapping with deep convolutional neural network","authors":"Sae-Han Suh, Ji-Eun Jhang, Kwanghee Won, Sung Y. Shin, C. Sung","doi":"10.1145/3264746.3264791","DOIUrl":"https://doi.org/10.1145/3264746.3264791","url":null,"abstract":"The Precision Agriculture (PA) plays a crucial part in the agricultural industry about improving the decision-making process. It aims to optimally allocate the resources to maintain the sustainable productivity of farmland and reduce the use of chemical compounds. [17] However, the on-site inspection of vegetations often falls to researchers' trained eye and experience, when it deals with the identification of the non-crop vegetations. Deep Convolution Neural Network (CNN) can be deployed to mitigate the cost of manual classification. Although CNN outperforms the other traditional classifiers, such as Support Vector Machine, it is still in question whether CNN can be deployable in an industrial environment. In this paper, we conducted a study on the feasibility of CNN for Vegetation Mapping on lawn inspection for weeds. We would like to study the possibility of expanding the concept to the on-site, near realtime, crop site inspections, by evaluating the generated results.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114840697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}