论文标题

基于领域本体的自动生成自然语言要求

Automated Generating Natural Language Requirements based on Domain Ontology

论文作者

Zhao, Ziyan, Zhang, Li, Gao, Xiaoyun, Lian, Xiaoli, Lv, Heyang, Shi, Lin

论文摘要

软件需求规范无疑对于整个软件生命周期至关重要。如今,编写软件需求规格主要取决于人类工作。尽管已经提出了大规模的研究来通过提出高级启发和分析技术来固定该过程,但它仍然是一项耗时且容易出错的任务,需要考虑域知识和业务信息。在本文中,我们提出了一种名为Reqgen的方法,该方法可以通过根据某些给定关键字自动生成自然语言需求规范来提供建议。具体而言,Reqgen由三个关键步骤组成。首先,以关键字为导向的知识是从域本体中选择的,并将其注入基本的统一预训练的语言模型(UNILM),以进行域微调。其次,集成了复制机制,以确保生成语句中关键字的出现。最后,设计的语法被限制了解码,以关闭候选者和参考规范之间的语义和语法距离。来自不同组和域的两个公共数据集的实验表明,Reqgen在关键字(Phrases)包含,BLEU,ROUGE和语法合规性方面的六种自然语言生成方法优于六种流行的自然语言生成方法。我们认为,Reqgen可以促进指定软件要求的效率和智能。

Software requirements specification is undoubtedly critical for the whole software life-cycle. Nowadays, writing software requirements specifications primarily depends on human work. Although massive studies have been proposed to fasten the process via proposing advanced elicitation and analysis techniques, it is still a time-consuming and error-prone task that needs to take domain knowledge and business information into consideration. In this paper, we propose an approach, named ReqGen, which can provide recommendations by automatically generating natural language requirements specifications based on certain given keywords. Specifically, ReqGen consists of three critical steps. First, keywords-oriented knowledge is selected from domain ontology and is injected to the basic Unified pre-trained Language Model (UniLM) for domain fine-tuning. Second, a copy mechanism is integrated to ensure the occurrence of keywords in the generated statements. Finally, a requirement syntax constrained decoding is designed to close the semantic and syntax distance between the candidate and reference specifications. Experiments on two public datasets from different groups and domains show that ReqGen outperforms six popular natural language generation approaches with respect to the hard constraint of keywords(phrases) inclusion, BLEU, ROUGE and syntax compliance. We believe that ReqGen can promote the efficiency and intelligence of specifying software requirements.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源