Pol Schumacher, Mirjam Minor, Eric Schulte-Zurhausen
{"title":"Extracting and enriching workflows from text","authors":"Pol Schumacher, Mirjam Minor, Eric Schulte-Zurhausen","doi":"10.1109/IRI.2013.6642484","DOIUrl":null,"url":null,"abstract":"This paper is on a workflow extraction framework which allows to derive a formal representation based on workflows from textual descriptions of instructions, for instance, of aircraft repair procedures from a maintenance manual. The framework applies a pipes-and-filters architecture and uses NLP (Natural Language Processing) tools to perform information extraction steps automatically. In detail, the paper presents on the step of anaphora resolution to enrich the workflow extracted so far. We introduce a lexical approach and two further approaches based on a set of association rules which are created during a statistical analysis of a corpus of workflows. The results of the approaches are compared to each other. For the evaluation, we use 37 workflows which have been created by a human expert.","PeriodicalId":418492,"journal":{"name":"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)","volume":"163 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 14th International Conference on Information Reuse & Integration (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2013.6642484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31
Abstract
This paper is on a workflow extraction framework which allows to derive a formal representation based on workflows from textual descriptions of instructions, for instance, of aircraft repair procedures from a maintenance manual. The framework applies a pipes-and-filters architecture and uses NLP (Natural Language Processing) tools to perform information extraction steps automatically. In detail, the paper presents on the step of anaphora resolution to enrich the workflow extracted so far. We introduce a lexical approach and two further approaches based on a set of association rules which are created during a statistical analysis of a corpus of workflows. The results of the approaches are compared to each other. For the evaluation, we use 37 workflows which have been created by a human expert.