Vânia Cecchini, T. Nguyen, Thomas Pfau, S. D. Landtsheer, T. Sauter
{"title":"An Efficient Machine Learning Method to Solve Imbalanced Data in Metabolic Disease Prediction","authors":"Vânia Cecchini, T. Nguyen, Thomas Pfau, S. D. Landtsheer, T. Sauter","doi":"10.1109/KSE.2019.8919337","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919337","url":null,"abstract":"The increase of obesity, its related diseases and the high incidence of metabolic diseases as a whole, constitute a major public health problem on a global scale. New strategies that allow for the discovery of novel metabolic disease-related genes are necessary to develop new treatments. In this paper, we proposed an efficient method to predict metabolic disease genes, solving the problem of imbalanced data. The method combined protein-protein interactions and miRNA-target interactions to construct integrated networks, whose topological properties can be used as features to train machine learning classifiers. We applied different strategies to optimize imbalanced class. The best model of gradient boosting achieved a significant F1-score of 0.82. When testing the model with non-disease genes, we predicted 549 candidates, out of which 123 were validated indirectly from literature to be related to metabolic diseases. The remaining genes’ functions were investigated by gene enrichment analysis, revealing their association with diseases known to co-occur with metabolic diseases, such as cancer and cardiovascular conditions. These results indicated that this method contributed to the identification of novel metabolic disease-related genes.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128780606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Annotation Intent Identification toward Enhancement of Marketing Campaign Performance","authors":"Sorratat Sirirattanajakarin, B. Suntisrivaraporn","doi":"10.1109/KSE.2019.8919386","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919386","url":null,"abstract":"An annotation is a short free text associating with each financial transaction made on SCB Easy, one of the leading mobile banking applications in Thailand. Identifying the intent(s) hidden within this text is a challenging task in Natural Language Processing (NLP). On the one hand, the availability of labeled data is limited, and on the other hand, an annotation oftentimes carries multiple intents. This paper describes AI2, an NLP framework for identifying multiple intents from short annotations. The framework is based on long short-term memory architectures. We conduct two sets of empirical experiments: one helps identify the best architecture between MIBC and SMLC; and the other finds the optimal model parameters. Apart from assuring experimental results, AI2 predictions are used in marketing campaigns with promising performance uplifts.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121835274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Less Visually Different Objects Recognition with Aggregated Residual Siamese Network","authors":"A. Nguyen, A. Yoshitaka","doi":"10.1109/KSE.2019.8919487","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919487","url":null,"abstract":"Deep learning has been increasingly achieved advanced success in recent years and becomes a robust technique in the field of computer vision and image processing. However, massive data sets are required to obtain such power when training deep neural networks. On the other hand, many studies succeed in the task of general object recognition, for example, ImageNet Large Scale Visual Recognition Challenge, but there are very few studies that directly deal with the problem of less visually different objects. In this paper, we introduce a challenging problem of visual classification, in which objects are very similar and out-reaching of human’s ability, based on a proposed definition of similarity / less visual difference. Besides, the research also focuses on the backbone drawback of deep learning that is data limitation. A challenging data set, which we call HAM10K-limited, is collected with very few samples and imbalance between classes for evaluating methods. To tackle those issues, a convolutional neural network is proposed, named Aggregated Residual Siamese Network. The empirical results show that the proposed architecture achieves a significant improvement from previous works.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129133499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
N. H. Hai, D. N. Thanh, N. Hien, H. Premachandra, V. Prasath
{"title":"A Fast Denoising Algorithm for X-Ray Images with Variance Stabilizing Transform","authors":"N. H. Hai, D. N. Thanh, N. Hien, H. Premachandra, V. Prasath","doi":"10.1109/KSE.2019.8919364","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919364","url":null,"abstract":"We propose a fast denoising algorithm for X-Ray images with variance stabilizing transformations. The variance stabilizing transformations are used to transform Poisson noisy images to Gaussian noisy images. Therefore, we can utilize advantages of the fast denoising algorithm based on the alternative direction method of multipliers. In experiments, we evaluate denoising quality by the Peak signal-to-noise ratio and the Structure Similarity metrics. Comparing results show that our method outperforms other similar denoising methods.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127646203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"KSE 2019 Technical Program Committee","authors":"","doi":"10.1109/kse.2019.8919402","DOIUrl":"https://doi.org/10.1109/kse.2019.8919402","url":null,"abstract":"","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129364217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Extraction process of conceptual model from a document-oriented NoSQL database","authors":"A. A. Brahim, Rabah Tighilt Ferhat, G. Zurfluh","doi":"10.1109/KSE.2019.8919400","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919400","url":null,"abstract":"NoSQL systems are used to manage massive databases that verify 3V: Volume, Variety and Velocity. Generally, these systems are known by the characteristic \"schema less\" which means that we can create a database without defining the data schema beforehand. This property offers more flexibility and speed by allowing the evolution of the data model during the exploitation of the base. However, to formulate queries on the database, the user needs a precise knowledge of data model. In this article, we propose a process for the automatic extraction of the conceptual model of a document-oriented NoSQL database. To do this, we use the Model Driven Architecture (MDA) architecture that provides a formal framework for automatic model transformation. From a NoSQL database, we propose a set of transformation rules with QVT to generate the conceptual model in the form of a UML class diagram. An experimentation of the extraction process was carried out on an application in the medical field..","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"251 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129460431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Learning to Transform Vietnamese Natural Language Queries into SQL Commands","authors":"Thi-Hai-Yen Vuong, Thi-Thu-Trang Nguyen, Nhu-Thuat Tran, Le-Minh Nguyen, X. Phan","doi":"10.1109/KSE.2019.8919393","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919393","url":null,"abstract":"In the field of data management, users traditionally manipulates their data using structured query language (SQL). However, this method requires an understanding of relational database, data schema, and SQL syntax as well as the way it works. Database manipulation using natural language, therefore, is much more convenient since any normal user can interact with their data without a background of database and SQL. This is, however, really tough because transforming natural language commands into SQL queries is a challenging task in natural language processing and understanding. In this paper, we propose a novel two–phase approach to automatically analyzing and converting natural language queries into the corresponding SQL forms. In our approach, the first phase is component segmentation which identifies primary clauses in SQL such as SELECT, FROM, WHERE, ORDER BY, etc. The second phase is slot– filling that helps extract sub–components for each primary clause such as SELECT column(s), SELECT aggregation operation, etc. We carefully conducted an empirical evaluation for our method using conditional random fields (CRFs) on a medium–sized corpus of natural language queries in Vietnamese, and have achieved promising results with an average accuracy of more than 90%.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132896662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
H. Thang, Duc-Man Nguyen, Nhu-Hang Ha, Trung-Kien Pham, Phuong-Thao Nguyen, Van-Dao Tran
{"title":"A Combinatorial Technique for Mobile Applications Software Testing","authors":"H. Thang, Duc-Man Nguyen, Nhu-Hang Ha, Trung-Kien Pham, Phuong-Thao Nguyen, Van-Dao Tran","doi":"10.1109/KSE.2019.8919456","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919456","url":null,"abstract":"In this article, we propose a combinatorial testing technique for mobile application, thereby suggesting a combination of test-driven data generation techniques as well as developing a support tool named CTGen which using the IPO algorithm to generate test data with input model files, the tool also supports code generation for JUnit testing. The experiment results were compared in terms of test coverage, effort to build a test case with the PICT tool for positive results. Depending on the purpose of developer or tester, testing effort or reliability of application to use appropriate approaches and tools.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130149894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Abstract Argumentation for Summarizing Product Reviews: A Case Study in Shopee Thailand","authors":"Teeradaj Racharak","doi":"10.1109/KSE.2019.8919483","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919483","url":null,"abstract":"When reading online reviews of products, restaurants, and hotels, internet users usually appreciate argumentative reviews rather than plain opinions. Building intelligent systems capable of summarizing useful reviews can help the users to grasp more information about features of a product and understand if it fits to their needs. In this paper, we conduct an empirical study on computational argumentation (Dung’s abstract argumentation) in Shopee.co.th reviews. For that, we first extract positive (in favor of purchase) and negative (against it) arguments from a particular randomly chosen product. Second, we link extracted arguments by hand to understand how they are connected. In such a way, we realize that summarizing product reviews that goes beyond plain opinions, i.e. being a collection of admissible arguments, can be helpful to both online buyers and sellers.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"57 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132531281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Novel Recovery Data Technique on MLC NAND Flash Memory","authors":"Tran Van Dai, Jingi Park, Dong-Joo Park","doi":"10.1109/KSE.2019.8919382","DOIUrl":"https://doi.org/10.1109/KSE.2019.8919382","url":null,"abstract":"Flash memory today is more popular because of its advantages, such as low power consumption, high mobility and fast data access. In NAND flash memory, a solution called Flash Translation Layer (FTL) was proposed to solve its disadvantages like erase-before-write and unsymmetrical read or write response time. During the process of using the data, the data might be lost on the power failure in the systems. In some systems, the data is very important. Hence, recovery of data in the event of the system crash or a sudden power outage is of prime importance. One of the methods to fix is the error code correction (ECC), supplied with the flash device by the manufacturer. In the process of power off failure recovery, there have been previous schemes such as In-Page Backup, In-Block Backup, Hybrid Backup, A-PLR (Accumulation based Power Loss Recovery), HYFLUR (Hybrid FLUsh Recovery), and C-HYFLUR (Compression scheme for HYFLUR). In this paper, we introduce a technique based on the page leveling mapping using the spare area in FTL divided into ECC, map information, and reserved. To estimate the performance of this technique, we compare the recovery time and mapping information management cost of our approach with those of previous schemes such as In-Block Backup and In-Page Backup.","PeriodicalId":439841,"journal":{"name":"2019 11th International Conference on Knowledge and Systems Engineering (KSE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123843274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}