{"title":"Understanding Relations using Concepts and Semantics","authors":"Jouyon Park, Hyunsouk Cho, Seung-won Hwang","doi":"10.1145/3077240.3077250","DOIUrl":"https://doi.org/10.1145/3077240.3077250","url":null,"abstract":"The Financial Entity Identification and Information Integration (FEIII) task aims at the question of understanding relationships among financial entities and their roles using three sentences extracted from each financial contract containing the target word. FEIII task has two challenges - 1) data sparseness: small training sets (9% of test data) and 2) context sparseness: limited context (three sentences). Existing statistical approaches, such as Bayes and TF-IDF, cannot evaluate the imporatance of words unobservged in training data, which is vulnerable to the above challenges. We overcome each challenge by considering 1) the concepts of words from knowledge bases (Probase) in addition to the words themselves (conceptual feature) and 2) word semantics from distributed representations such as word2vec (semantic feature). We empirically evaluate the proposed classification model on the four-class classification (highly relevant, relevant, neutral, and irrelevant), and show that the proposed model increases 18% of F1-score compared to the statistical baselines.","PeriodicalId":326424,"journal":{"name":"Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127395952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Roman, B. Ulicny, Yi Du, Srijith Poduval, A. Ko
{"title":"Thomson Reuters' Solution for Triple Ranking in the FEIII 2017 Challenge","authors":"E. Roman, B. Ulicny, Yi Du, Srijith Poduval, A. Ko","doi":"10.1145/3077240.3077253","DOIUrl":"https://doi.org/10.1145/3077240.3077253","url":null,"abstract":"In this paper we describe our approach to the triple ranking task of the FEIII 2017 challenge. Our method leveraged different machine learning classifiers in an ensemble as well as Thomson Reuters knowledge bases and information services to bring in external world knowledge of mentioned entities and extract information from the contextual sentences. Internal evaluation of our method was done by computing the Normalized Discounted Cumulative Gain (NDCG) as tracked by the challenge and classification accuracy. The official FEIII Challenge evaluation showed our system performed highly in single ranking of all triples, placing in 2nd or 3rd place out of 17 participants for 4 of 6 scoring variants; the system also performed above average in per role ranking for 4 of 6 average role NDCG scoring variants.","PeriodicalId":326424,"journal":{"name":"Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130590993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
L. Raschid, J. Langsam, Tharindu Pieris, Anushka Bandara
{"title":"Tensor Factors to Monitor the Co-Movement of Equity Prices","authors":"L. Raschid, J. Langsam, Tharindu Pieris, Anushka Bandara","doi":"10.1145/3077240.3077242","DOIUrl":"https://doi.org/10.1145/3077240.3077242","url":null,"abstract":"We identify a set of features that are related to extremes of price changes of individual equities. Our hypothesis is that these extreme features may be used to isolate co-movements of prices for groups of equities, reflecting systematic risk. The equities are classified within industry sectors and we create a three mode tensor to represent the dataset; the dimensions of the three mode tensor correspond to the equity, the industry sector and the day on which the feature occurred. We use a method for non-negative tensor factorization (NOTF) to identify factors or communities that are composed of multiple equities, and / or industry sectors. Our preliminary results indicate that our NOTF approach has the potential to identify such communities of price related features that may experience co-movement across industry sectors and temporal intervals.","PeriodicalId":326424,"journal":{"name":"Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114375981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Balance Sheet Driven Probability Factorization for Inferring Bank Holdings: Extended Abstract","authors":"Shawn Mankad, Celso Brunetti, J. Harris","doi":"10.1145/3077240.3077243","DOIUrl":"https://doi.org/10.1145/3077240.3077243","url":null,"abstract":" Assistant Professor of Operations, Technology and Information Management, Samuel Curtis Johnson Graduate School of Management, Cornell University, 2015 – Present o Graduate Field Member in Statistics, 2017 – Present Assistant Professor of Business Analytics, Robert H. Smith School of Business, University of Maryland, 2013 – 2015 o Affiliate Faculty of Applied Mathematics and Scientific Computation, University of Maryland, 2014 – 2015 Federal Contractor, the U.S. Commodity Futures Trading Commission, 2009 – 2013 Dissertation Intern, Federal Reserve Board of Governors, Summer 2012","PeriodicalId":326424,"journal":{"name":"Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets","volume":"213 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124188821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Predicting Role Relevance with Minimal Domain Expertise in a Financial Domain","authors":"M. Kejriwal","doi":"10.1145/3077240.3077249","DOIUrl":"https://doi.org/10.1145/3077240.3077249","url":null,"abstract":"Word embeddings have made enormous inroads in recent years in a wide variety of text mining applications. In this paper, we explore a word embedding-based architecture for predicting the relevance of a role between two financial entities within the context of natural language sentences. In this extended abstract, we propose a pooled approach that uses a collection of sentences to train word embeddings using the skip-gram word2vec architecture. We use the word embeddings to obtain context vectors that are assigned one or more labels based on manual annotations. We train a machine learning classifier using the labeled context vectors, and use the trained classifier to predict contextual role relevance on test data. Our approach serves as a good minimal-expertise baseline for the task as it is simple and intuitive, uses open-source modules, requires little feature crafting effort and performs well across roles.","PeriodicalId":326424,"journal":{"name":"Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116719722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets","authors":"","doi":"10.1145/3077240","DOIUrl":"https://doi.org/10.1145/3077240","url":null,"abstract":"","PeriodicalId":326424,"journal":{"name":"Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116451991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}