Savaş Yıldırım, Dhanya Jothimani, Can Kavaklioglu, A. Bener
{"title":"Building Domain-Specific Lexicons: An Application to Financial News","authors":"Savaş Yıldırım, Dhanya Jothimani, Can Kavaklioglu, A. Bener","doi":"10.1109/Deep-ML.2019.00013","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00013","url":null,"abstract":"Natural Language Processing (NLP) has gained attention in the recent years. Previous research (such as WordNet and Cyc) has focused on developing an all purpose (generalised) polarised lexicons. However, these lexicons do not provide much information in different domains such as Finance and Medical Sciences. Using these lexicons for text classification could affect the prediction accuracy. Therefore, there is a need for building domain-and context-specific lexicons. To achieve this, in this work, a label based propagation based word embedding algorithm has been proposed to obtain positive and negative lexicons. The proposed algorithm works on the principle of graph theory and word embedding. The proposed algorithm is tested on Dow Jones news wires text feed to classify the Financial news as hot and non-hot. Three classifiers, namely, Logistic Regression, Random Forest and XGBoost, employing polarised lexicons, seed words and random words were used. The performance of classifiers in all cases was evaluated using accuracy. Lexicons generated using the proposed approach were effective in classifying the Financial news articles as hot and non-hot compared to classifiers using seed words and random words. Proposed label propagation with word embedding algorithm generates context-specific lexicons, which aids in helps in better representation of text in natural processing tasks and avoids the problem of dimensionality.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127282795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shahbaz Hassan, Ayesha Irfan, Ali Mirza, I. Siddiqi
{"title":"Cursive Handwritten Text Recognition using Bi-Directional LSTMs: A Case Study on Urdu Handwriting","authors":"Shahbaz Hassan, Ayesha Irfan, Ali Mirza, I. Siddiqi","doi":"10.1109/Deep-ML.2019.00021","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00021","url":null,"abstract":"Recognition of cursive handwritten text is a complex problem due challenges like context sensitive character shapes, non-uniform inter and intra word spacings, complex positioning of dots and diacritics and very low inter class variation among certain classes. This paper presents an effective technique for recognition of cursive handwritten text using Urdu as a case study (though findings can be generalized to other cursive scripts as well). We present an analytical approach based on implicit character segmentation where convolutional neural networks (CNNs) are employed as feature extractors while classification is carried out using a bi-directional Long-Short-Term Memory (LSTM) network. The proposed technique is validated on a dataset of 6000 unique handwritten text lines reporting promising character recognition rates.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"490 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116336170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Visual Similarity Recommendation System using Generative Adversarial Networks","authors":"Betul Ay, G. Aydin, Zeynep Koyun, Mehmet Demir","doi":"10.1109/Deep-ML.2019.00017","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00017","url":null,"abstract":"The goal of content-based recommendation system is to retrieve and rank the list of items that are closest to the query item. Today, almost every e-commerce platform has a recommendation system strategy for products that customers can decide to buy. In this paper we describe our work on creating a Generative Adversarial Network based image retrieval system for e-commerce platforms to retrieve best similar images for a given product image specifically for shoes. We compare state-of-the-art solutions and provide results for the proposed deep learning network on a standard data set.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129377544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DEEP-ML 2019 Organizing Committee","authors":"","doi":"10.1109/deep-ml.2019.00006","DOIUrl":"https://doi.org/10.1109/deep-ml.2019.00006","url":null,"abstract":"","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129409512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Comparison of Clustering Measures on Raw Signals of Welding Production Data","authors":"Selvine G. Mathias, Daniel Grossmann, G. Sequeira","doi":"10.1109/Deep-ML.2019.00019","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00019","url":null,"abstract":"Production data from industries today have a heterogeneous structure, which makes it difficult to analyze and derive some viable inferences. Because of the varying pattern of data, whether labeled or unlabeled, numerical or categorical, any strict standard or optimization procedure using production data is a difficult task. Applying machine learning (ML) algorithms to analyze production data has therefore become an essential requirement for industries. In this study, production data obtained from welding seams is used. We analyze raw signals of electrical current and voltage in the form of arrays obtained from welding processes by applying clustering algorithms. Each process is represented by a group number in the procured data and the corresponding welds of a group are divided into optimal number of clusters on the basis of results given by metrics such as Silhouette Scores and Adjusted Rand's Index. As a validation of the metrics, we use Davies-Bouldin Index to compare and optimize our results. We conclude that a multi-clustering technique can be devised to profile clusters of welding data using only current and voltage signals.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126325863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tamer Karatekin, S. Sancak, G. Celik, S. Topçuoğlu, G. Karatekin, Pınar Kırcı, A. Okatan
{"title":"Interpretable Machine Learning in Healthcare through Generalized Additive Model with Pairwise Interactions (GA2M): Predicting Severe Retinopathy of Prematurity","authors":"Tamer Karatekin, S. Sancak, G. Celik, S. Topçuoğlu, G. Karatekin, Pınar Kırcı, A. Okatan","doi":"10.1109/Deep-ML.2019.00020","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00020","url":null,"abstract":"We have investigated the risk factors that lead to severe retinopathy of prematurity using statistical analysis and logistic regression as a form of generalized additive model (GAM) with pairwise interaction terms (GA2M). In this process, we discuss the trade-off between accuracy and interpretability of these machine learning techniques on clinical data. We also confirm the intuition of expert neonatologists on a few risk factors, such as gender, that were previously deemed as clinically not significant in RoP prediction.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114580565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Deep Learning Based Distributed Smart Surveillance Architecture using Edge and Cloud Computing","authors":"Halil Can Kaskavalci, Sezer Gören","doi":"10.1109/Deep-ML.2019.00009","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00009","url":null,"abstract":"Smart surveillance is getting increasingly popular as technologies become easier to use and cheaper. Traditional surveillance records video footage to a storage device continuously. However, this generates enormous amount of data and reduces the life of the hard drive. Newer devices with Internet connection save footage to the Cloud. This feature comes with bandwidth requirements and extra Cloud costs. In this paper, we propose a deep learning based, distributed, and scalable surveillance architecture using Edge and Cloud computing. Our design reduces both the bandwidth and as well as the Cloud costs significantly by processing footage prior sending to the Cloud.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123183380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparative Study of Neural Networks and Decision Trees for Application in Trading Financial Futures","authors":"Saulius Blaziunas, A. Raudys","doi":"10.1109/Deep-ML.2019.00015","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00015","url":null,"abstract":"The aim of this paper is to compare neural networks and decision trees in the ability to trade financial futures. Emphasis is made on correctness of comparison from practitioner's point of view. Contrary to other papers, we implemented actual trading strategy simulation with slippage and commission fees. Many research papers do not pay attention to correctness of experiments from practical point of view. Predictions are incorporated into trading algorithm and trading profits and Sharpe ratio are calculated. Experiments using 30 technical indicators and 45 different futures is repeated 16,895 times and evaluation is made using out of sample data. Results show that both methods have comparable accuracy and perform similarly.","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121018596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DEEP-ML 2019 Program Committee","authors":"Ahmad Javaid","doi":"10.1109/deep-ml.2019.00007","DOIUrl":"https://doi.org/10.1109/deep-ml.2019.00007","url":null,"abstract":"Ahmad Javaid, The University of Toledo, Spain Angelo Genovese, Università Degli Studi Di Milano, Italy Atul Negi, University of Hyderabad, India Bahadorreza Ofoghi, The University of Melbourne, Australia Bruno Veloso, INESC Technology and Science, Porto, Portugal Changwei Hu, Yahoo Research, USA Cihan Varol Sam, Houston State University, USA Daniele Apiletti, Polytechnic University of Turin, Italy Dhiya Al-Jumeily, Liverpool John Moores University, UK Fahimeh Farahnakian, University of Turku, Finland George Vouros, UNIPI, Greece Gulustan DOGAN, Yildiz Technical University, Turkey Huiru (Jane) Zheng, Ulster University, UK Huseyin Seker, The University of Northumbria at Newcastle, UK Jims Marchang, Sheffield Hallam University, UK Jose de Jesus Rubio Avila, SEPI-ESIME UA-IPN, Mexico Lei Zhang, East China Normal University, China Mahardhika Pratama, Nanyang Technology University, Singapore Nizar Bouguila, Concordia University, Canada Noor Akhmad Setiawan, Universitas Gadjah Mada, Indonesia Rabiah Ahmad, Universiti Teknikal Malaysia, Malaysia Santanu Pal, Universität des Saarlandes, Germany Serap SAHIN, Izmir Institute of Technology, Turkey Sevil SEN, Hacettepe University, Turkey Sotiris Kotsiantis, University of Patras, Greece Sung-Bae Cho, Yonsei University, Korea Tomoyuki Uchida, Hiroshima City University, Japan Valentina Emilia Balas, Aurel Vlaicu University of Arad, Romania Rosangela Ballini, University of Campinas, Brazil Yevgeni Bodyanskiy, Kharkiv National University of Radio Electronics, Ukraine Giovanna Castellano, University of Bari, Italy Antonio Dourado, University of Coimbra, Portugal Christian Eitzinger, Profactor GmbH, Steyr-Gleink, Austria Jus Kocijan, University of Nova Gorica, Slovenia Daniel Leite, Federal University of Lavras, Brazil Chin Teng Lin, University of Technology Sydney, Australia Seiichi Ozawa, Kobe University, Japan Daniel Sanchez, University of Granada, Spain Daniela Zaharie, West University of Timisoara, Romania Dhanya Jothimani, Ryerson University, Canada","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124374711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Risk Parity Models for Portfolio Optimization: A Study of the Toronto Stock Exchange","authors":"Dhanya Jothimani, A. Bener","doi":"10.1109/Deep-ML.2019.00014","DOIUrl":"https://doi.org/10.1109/Deep-ML.2019.00014","url":null,"abstract":"It has been more than 60 years since the development of Mean-Variance (MV) framework and inception of Modern Portfolio theory. Despite its wide acceptance and applicability, it suffers from few limitations. This paper addresses two issues of MV framework: (i) estimation errors of mean-variance model, and (ii) instability of covariance matrix. Risk parity models, robust statistics and robust optimization minimize the effects of estimation errors of parameters of MV framework. The paper presents two such risk parity models for portfolio optimization, namely, (a) Hierarchical Risk Parity model based on Historical correlation (HRP-HC), and (b) Hierarchical Risk parity model based on Gerber statistics (HRP-GS). The models are tested and analysed using stocks comprising the TSX complete index for a time period of 10 years ranging from 2007 to 2016. Results suggest that the proposed HRP-GS model outperforms HRP-HC model. This is due to the fact that the HRP-GS model integrates the advantages of a risk parity model (i.e. HRP) and robust statistics (i.e. Gerber statistics).","PeriodicalId":228378,"journal":{"name":"2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122605130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}