{"title":"HMM-Based Dari Named Entity Recognition for Information Extraction","authors":"Ghezal Ahmad Jan Zia, Ahmad Zia Sharif","doi":"10.5121/CSIT.2019.90706","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90706","url":null,"abstract":"Named Entity Recognition (NER) is the fundamental subtask of information extraction systems that labels elements into categories such as persons, organizations or locations. The task of NER is to detect and classify words that are parts of sentences. This paper describes a statistical approach to modeling NER in Dari language. Dari and Pashto are low resources languages, spoken as official languages in Afghanistan. Unlike other languages, named entity detection approaches differ in Dari. Since in Dari language there is no capitalization for identifying named entities. We seek to bridge the gap between Dari linguistic structure and supervised learning model that predict the sequences of words paired with a sequence of tags as outputs. Dari corpus was developed from the collection of news, reports and articles based on the original orthographic structure of the Dari language. The experimental result of named entity recognition performance presents 94% accuracy.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"55 31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123332796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Approach to Tracking Problem for Linear Control System Via Invariant Ellipsoids Method","authors":"M. Khlebnikov","doi":"10.5121/CSIT.2019.90714","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90714","url":null,"abstract":"In this paper, a simple yet universal approach to the tracking problem for linear control systems via the linear static combined feedback is proposed. The approach is based on the invariant ellipsoid concept and LMI technique, where the optimal control design reduced to finding the minimal invariant ellipsoid for the closed-loop system. With such an ideology, the control design problem directly reduces to a semidefinite programming and one-dimensional minimization. Another attractive property of the proposed approach is that it is equally applicable to discrete-time control systems. The efficacy of the technique is illustrated via a benchmark problem.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133089958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Machine Learning Algorithm in Automated Text Categorization of Legacy Archives","authors":"Dali Wang, Ying Bai, David Hamblin","doi":"10.5121/CSIT.2019.90701","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90701","url":null,"abstract":"The goal of this research is to develop an algorithm to automatically retrieve critical information from raw data files in NASA’s airborne measurement data archive. The product has to meet specific metrics in term of accuracy, robustness and usability, as the initial decision-tree based development has shown limited applicability due to its resource intensive characteristics. We have developed an innovative solution that is much less resource intensive while offering comparable performance. As with many practical applications, the data available are noisy and correlated; and there is a wide range of features that are associated with the information to be retrieved. The proposed algorithm uses a decision tree to select features and determine their weights. A weighted Naive Bayes is used due to the presence of highly correlated inputs. The development has been successfully deployed in an industrial scale, and the results show that the development is well-balanced in term of performance and resource requirements.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114128101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Factors Affecting Classification Algorithms Recommendation: A Survey","authors":"M. Reda, M. Nassef, A. Salah","doi":"10.5121/CSIT.2019.90707","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90707","url":null,"abstract":"A lot of classification algorithms are available in the area of data mining for solving the same kind of problem with a little guidance for recommending the most appropriate algorithm to use which gives best results for the dataset at hand. As a way of optimizing the chances of recommending the most appropriate classification algorithm for a dataset, this paper focuses on the different factors considered by data miners and researchers in different studies when selecting the classification algorithms that will yield desired knowledge for the dataset at hand. The paper divided the factors affecting classification algorithms recommendation into business and technical factors. The technical factors proposed are measurable and can be exploited by recommendation software tools.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"142 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127554560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"CHEMCONNECT: An Ontology-Based Repository of Experimental Devices and Observations","authors":"Edward S. Blurock","doi":"10.5121/CSIT.2019.90709","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90709","url":null,"abstract":"CHEMCONNECT is an ontology cloud-based repository of experimental, theoretical and computational data for the experimental sciences domain. Currently, the emphasis is on the chemical combustion community, but in future work (in collaboration with domain experts) the domain will be expanded. CHEMCONNECT goes beyond traditional meta-data annotated scientific result repositories in that the data is parsed and analysed with respect to an extensive chemical and combustion knowledge base. The parsed data is then inter-linked allowing for efficient searching and comparison. The goal is to link all data associated with experiments, including the device description, the intermediate data (both computed and measured), the associated interpretations, procedures and methodologies used to produce the data and the final published results and references. Having published data linked to its dependent measurements and constants, devices, subsystems, sensors and even people and laboratories provides an effective accountability and more confidence in the data. Data entry and availability can range from private user, to user defined consortia to general public. These concepts are implemented at http://www.connectedsmartdata.info.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131764676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"AHP Under Uncertainty: A Modified Version of Cloud Delphi Hierarchical Analysis","authors":"A. A. Ahmad, Ghaida Rebdawi, Obaida Alsahli","doi":"10.5121/CSIT.2019.90703","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90703","url":null,"abstract":"Cloud Delphi Hierarchical Analysis (CDHA) is an Analytic Hierarchical Process (AHP) based method for group decision making under uncertain environments. CDHA adopts appropriate tools for such environments, namely Delphi method, and Cloud model. Adopting such tools makes it a promising AHP variant in handling uncertainty. In spite of CDHA is a promising method, it is still suffering from two main defects. The first one lies in its definition of the consistency index, the second one lies in the technique used in building the pairwise comparisons Cloud models. This paper will discuss these defects, and propose a modified version. To overcome the defects mentioned above, the modified version will depend more on the context of the interval pairwise comparisons matrix while building the corresponding Cloud pairwise comparisons matrix. A simple case study that involves reproducing the relative area sizes of four provinces in Syria will be used to illustrate the modified version and to compare it with the original one.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116158932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Resolution Enhancement of Electron Microscopic Volume by Volume Restoration Technique","authors":"A. Khan, Kishor Datta Gupta, Ariful Haque","doi":"10.5121/CSIT.2019.90710","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90710","url":null,"abstract":"The knowledge of structure of proteins, protein derived compounds and RNA structures in eukaryotic cell is mandatory to understand the functions of these macromolecules.With recent development of Direct Electron Detector Device (DDD) camera and application of maximum likelihood algorithms in volume reconstruction,cryo Electron Microscopy (cryo-EM) enables us to visualize the macromolecules in nearly physiological state. The current resolution limit of cryo-EM can be improved further by applying novel and effective signal processing algorithms after the EM workflow. In this work, a signal processing method has been developed to enhance the resolution of the EM volume through volume restoration techniques. We have proposed a novel technique to estimate the volume degradation function of the volume reconstruction system from the noise-only subvolumes of the observed EM volume. Then the volume is restored (utilizing the estimated volume degradation function) using a combination of regularized Richardson-Lucy and Wiener Deconvolution techniques. In addition to volume restoration, we have employed spatial de-noising techniques utilizing various morphological filters to reduce noise outside the main molecular structure. The experimental results demonstrate that the resolution (evaluated by ourier Shell Correlation curve) and visual quality of the EM volume can be significantly improved using our proposed technique.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127347795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ghezal Ahmad Jan Zia, Ahmad Zia Sharifi, Fazl Ahmad Amini, Niaz Mohammad Ramaki
{"title":"A Comparative Mention-Pair Models for Coreference Resolution in DARI Language for Information Extraction","authors":"Ghezal Ahmad Jan Zia, Ahmad Zia Sharifi, Fazl Ahmad Amini, Niaz Mohammad Ramaki","doi":"10.5121/CSIT.2019.90708","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90708","url":null,"abstract":"Coreference resolution plays an important role in Information Extraction.This paper covers the investigation of two strategies based on a mention-pair resolver using Decision Tree classifier on structured and unstructured dataset, targeting coreference resolution in Dari language. Strategies are (1) training separate models which is specialized in particular categories (e.g., lexical, syntactic and semantic) and types of mentions (e.g. pronouns, proper nouns) and (2) using a structured dataset on a machine learning library that is designed to classify numerical values. Moreover, these modifications and comparative models describe a contribution of comprehensive factors involved in the resolution of texts. Specifically, we developed the first Dari corpus (’DariCoref’) based on OntoNotes and WikiCoref scheme. Both strategies are produced f-score of state-of-the-art.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"129 13","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114065594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Solving the Binarization Challenges in Document Images Using OTSU Multilevel","authors":"Enas M. Elgbbas, M. Khalil, Hazem M. Abbas","doi":"10.5121/CSIT.2019.90715","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90715","url":null,"abstract":"This paper introduces a method for binarization of historical document images that suffer from non-uniform background, faint text, low contrast, stain, bleed-through, or shadow challenges. The proposed method adaptively detects the non-uniform background in the document image and eliminates it. Areas that contain missing text are adaptively identified and reprocessed separately. Stain and bleed-through objects are found depending on stroke width and locally binarized. Shadow is detected based on the image contrast. Otsu multilevel is applied for binarization. DIBCO series is used for testing. K EYWORDS Document image binarization, Otsu multilevel","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115354537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Safa Habibullah, Xiaodong Liu, Zhiyuan Simon Tan, Yonghong Zhang, Qi Liu
{"title":"Reviving Legacy Enterprise Systems with Micro service-Based Architecture with in Cloud Environments","authors":"Safa Habibullah, Xiaodong Liu, Zhiyuan Simon Tan, Yonghong Zhang, Qi Liu","doi":"10.5121/CSIT.2019.90713","DOIUrl":"https://doi.org/10.5121/CSIT.2019.90713","url":null,"abstract":"Evolution has always been a challenge for enterprise computing systems. The microservice based architecture is a new design model which is rapidly becoming one of the most effective means to re-architect legacy enterprise systems and to reengineer them into new modern systems at a relatively low cost. This architectural style has evolved based on a number of different approaches and standards. However, there are quite a few technical challenges which emerge when adopting microservices to revive a legacy enterprise system. In this paper, an evolution framework and a set of feature-driven microservices-oriented evolution rules have been proposed and applied to modernise legacy enterprise systems, with a special emphasis on analysing the implications as regards runtime performance, scalability, maintainability and testability. Testing and evaluation have been carried out in depth, aiming to provide a guidance for the evolution of legacy enterprise systems.","PeriodicalId":383682,"journal":{"name":"8th International Conference on Soft Computing, Artificial Intelligence and Applications","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127270858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}