Semantic Frame Embeddings for Detecting Relations between Software Requirements

Waad Alhoshan, R. Batista-Navarro, Liping Zhao
{"title":"Semantic Frame Embeddings for Detecting Relations between Software Requirements","authors":"Waad Alhoshan, R. Batista-Navarro, Liping Zhao","doi":"10.18653/v1/W19-0606","DOIUrl":null,"url":null,"abstract":"The early phases of requirements engineering (RE) deal with a vast amount of software requirements (i.e., requirements that define characteristics of software systems), which are typically expressed in natural language. Analysing such unstructured requirements, usually obtained from users’ inputs, is considered a challenging task due to the inherent ambiguity and inconsistency of natural language. To support such a task, methods based on natural language processing (NLP) can be employed. One of the more recent advances in NLP is the use of word embeddings for capturing contextual information, which can then be applied in word analogy tasks. In this paper, we describe a new resource, i.e., embedding-based representations of semantic frames in FrameNet, which was developed to support the detection of relations between software requirements. Our embeddings, which encapsulate contextual information at the semantic frame level, were trained on a large corpus of requirements (i.e., a collection of more than three million mobile application reviews). The similarity between these frame embeddings is then used as a basis for detecting semantic relatedness between software requirements. Compared with existing resources underpinned by word-level embeddings alone, and frame embeddings built upon pre-trained vectors, our proposed frame embeddings obtained better performance against judgements of an RE expert. These encouraging results demonstrate the strong potential of the resource in supporting RE analysis tasks (e.g., traceability), which we plan to investigate as part of our future work.","PeriodicalId":171876,"journal":{"name":"Proceedings of the 13th International Conference on Computational Semantics - Student Papers","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 13th International Conference on Computational Semantics - Student Papers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/W19-0606","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

The early phases of requirements engineering (RE) deal with a vast amount of software requirements (i.e., requirements that define characteristics of software systems), which are typically expressed in natural language. Analysing such unstructured requirements, usually obtained from users’ inputs, is considered a challenging task due to the inherent ambiguity and inconsistency of natural language. To support such a task, methods based on natural language processing (NLP) can be employed. One of the more recent advances in NLP is the use of word embeddings for capturing contextual information, which can then be applied in word analogy tasks. In this paper, we describe a new resource, i.e., embedding-based representations of semantic frames in FrameNet, which was developed to support the detection of relations between software requirements. Our embeddings, which encapsulate contextual information at the semantic frame level, were trained on a large corpus of requirements (i.e., a collection of more than three million mobile application reviews). The similarity between these frame embeddings is then used as a basis for detecting semantic relatedness between software requirements. Compared with existing resources underpinned by word-level embeddings alone, and frame embeddings built upon pre-trained vectors, our proposed frame embeddings obtained better performance against judgements of an RE expert. These encouraging results demonstrate the strong potential of the resource in supporting RE analysis tasks (e.g., traceability), which we plan to investigate as part of our future work.
用于检测软件需求之间关系的语义框架嵌入
需求工程(RE)的早期阶段处理大量的软件需求(例如,定义软件系统特征的需求),这些需求通常用自然语言表示。由于自然语言固有的模糊性和不一致性,分析这种通常从用户输入中获得的非结构化需求被认为是一项具有挑战性的任务。为了支持这样的任务,可以采用基于自然语言处理(NLP)的方法。NLP的最新进展之一是使用词嵌入来捕获上下文信息,然后可以将其应用于词类比任务。在本文中,我们描述了一种新的资源,即框架中基于嵌入的语义框架表示,它是为了支持软件需求之间关系的检测而开发的。我们的嵌入在语义框架级别封装上下文信息,是在大量需求语料库(即超过300万个移动应用程序评论的集合)上进行训练的。然后,这些框架嵌入之间的相似性被用作检测软件需求之间语义相关性的基础。与单独基于词级嵌入的现有资源和基于预训练向量的框架嵌入相比,我们提出的框架嵌入在可重构专家的判断下获得了更好的性能。这些令人鼓舞的结果证明了该资源在支持RE分析任务(例如,可追溯性)方面的强大潜力,我们计划将其作为我们未来工作的一部分进行调查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信