Automated Handling of Anaphoric Ambiguity in Requirements: A Multi-solution Study

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE) Pub Date : 2022-05-01 DOI:10.1145/3510003.3510157

Saad Ezzini, Sallam Abualhaija, Chetan Arora, M. Sabetzadeh

{"title":"Automated Handling of Anaphoric Ambiguity in Requirements: A Multi-solution Study","authors":"Saad Ezzini, Sallam Abualhaija, Chetan Arora, M. Sabetzadeh","doi":"10.1145/3510003.3510157","DOIUrl":null,"url":null,"abstract":"Ambiguity is a pervasive issue in natural-language requirements. A common source of ambiguity in requirements is when a pronoun is anaphoric. In requirements engineering, anaphoric ambiguity occurs when a pronoun can plausibly refer to different entities and thus be interpreted differently by different readers. In this paper, we develop an accurate and practical automated approach for handling anaphoric ambiguity in requirements, addressing both ambiguity detection and anaphora interpretation. In view of the multiple competing natural language processing (NLP) and machine learning (ML) technologies that one can utilize, we simultaneously pursue six alternative solutions, empirically assessing each using a col-lection of ≈1,350 industrial requirements. The alternative solution strategies that we consider are natural choices induced by the existing technologies; these choices frequently arise in other automation tasks involving natural-language requirements. A side-by-side em-pirical examination of these choices helps develop insights about the usefulness of different state-of-the-art NLP and ML technologies for addressing requirements engineering problems. For the ambigu-ity detection task, we observe that supervised ML outperforms both a large-scale language model, SpanBERT (a variant of BERT), as well as a solution assembled from off-the-shelf NLP coreference re-solvers. In contrast, for anaphora interpretation, SpanBERT yields the most accurate solution. In our evaluation, (1) the best solution for anaphoric ambiguity detection has an average precision of ≈60% and a recall of 100%, and (2) the best solution for anaphora interpretation (resolution) has an average success rate of ≈98%.","PeriodicalId":202896,"journal":{"name":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","volume":"19 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3510003.3510157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 16

Abstract

Ambiguity is a pervasive issue in natural-language requirements. A common source of ambiguity in requirements is when a pronoun is anaphoric. In requirements engineering, anaphoric ambiguity occurs when a pronoun can plausibly refer to different entities and thus be interpreted differently by different readers. In this paper, we develop an accurate and practical automated approach for handling anaphoric ambiguity in requirements, addressing both ambiguity detection and anaphora interpretation. In view of the multiple competing natural language processing (NLP) and machine learning (ML) technologies that one can utilize, we simultaneously pursue six alternative solutions, empirically assessing each using a col-lection of ≈1,350 industrial requirements. The alternative solution strategies that we consider are natural choices induced by the existing technologies; these choices frequently arise in other automation tasks involving natural-language requirements. A side-by-side em-pirical examination of these choices helps develop insights about the usefulness of different state-of-the-art NLP and ML technologies for addressing requirements engineering problems. For the ambigu-ity detection task, we observe that supervised ML outperforms both a large-scale language model, SpanBERT (a variant of BERT), as well as a solution assembled from off-the-shelf NLP coreference re-solvers. In contrast, for anaphora interpretation, SpanBERT yields the most accurate solution. In our evaluation, (1) the best solution for anaphoric ambiguity detection has an average precision of ≈60% and a recall of 100%, and (2) the best solution for anaphora interpretation (resolution) has an average success rate of ≈98%.

查看原文本刊更多论文

需求中回指歧义的自动处理:一个多解决方案的研究

歧义是自然语言需求中普遍存在的问题。需求中歧义的一个常见来源是代词的回指。在需求工程中，当一个代词可以合理地指代不同的实体，从而被不同的读者以不同的方式解释时，就会出现回指歧义。在本文中，我们开发了一种准确实用的自动化方法来处理需求中的回指歧义，同时解决了歧义检测和回指解释。鉴于可以利用的多种相互竞争的自然语言处理(NLP)和机器学习(ML)技术，我们同时追求六种替代解决方案，使用≈1,350个工业需求的集合对每种解决方案进行经验评估。我们考虑的备选解决方案策略是由现有技术诱导的自然选择;这些选择经常出现在涉及自然语言需求的其他自动化任务中。对这些选择进行并行的实证检查有助于深入了解不同的最先进的NLP和ML技术对解决需求工程问题的有用性。对于歧义检测任务，我们观察到监督ML优于大规模语言模型SpanBERT (BERT的一种变体)以及由现成的NLP共同参考重新求解器组装的解决方案。相比之下，对于回指解释，SpanBERT给出了最准确的解决方案。在我们的评估中，(1)回指歧义检测的最佳解决方案平均精度≈60%，召回率为100%，(2)回指解释(分辨率)的最佳解决方案平均成功率≈98%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量