Improving Requirements Completeness: Automated Assistance through Large Language Models

Luitel, Dipeeka; Hassani, Shabnam; Sabetzadeh, Mehrdad

Computer Science > Software Engineering

arXiv:2308.03784 (cs)

[Submitted on 3 Aug 2023 (v1), last revised 14 Feb 2024 (this version, v2)]

Title:Improving Requirements Completeness: Automated Assistance through Large Language Models

Authors:Dipeeka Luitel, Shabnam Hassani, Mehrdad Sabetzadeh

View PDF HTML (experimental)

Abstract:Natural language (NL) is arguably the most prevalent medium for expressing systems and software requirements. Detecting incompleteness in NL requirements is a major challenge. One approach to identify incompleteness is to compare requirements with external sources. Given the rise of large language models (LLMs), an interesting question arises: Are LLMs useful external sources of knowledge for detecting potential incompleteness in NL requirements? This article explores this question by utilizing BERT. Specifically, we employ BERT's masked language model (MLM) to generate contextualized predictions for filling masked slots in requirements. To simulate incompleteness, we withhold content from the requirements and assess BERT's ability to predict terminology that is present in the withheld content but absent in the disclosed content. BERT can produce multiple predictions per mask. Our first contribution is determining the optimal number of predictions per mask, striking a balance between effectively identifying omissions in requirements and mitigating noise present in the predictions. Our second contribution involves designing a machine learning-based filter to post-process BERT's predictions and further reduce noise. We conduct an empirical evaluation using 40 requirements specifications from the PURE dataset. Our findings indicate that: (1) BERT's predictions effectively highlight terminology that is missing from requirements, (2) BERT outperforms simpler baselines in identifying relevant yet missing terminology, and (3) our filter significantly reduces noise in the predictions, enhancing BERT's effectiveness as a tool for completeness checking of requirements.

Comments:	This article has been accepted at the Requirements Engineering Journal (REJ), REFSQ'23 Special Issue. arXiv admin note: text overlap with arXiv:2302.04792
Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2308.03784 [cs.SE]
	(or arXiv:2308.03784v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2308.03784

Submission history

From: Mehrdad Sabetzadeh [view email]
[v1] Thu, 3 Aug 2023 19:49:18 UTC (3,677 KB)
[v2] Wed, 14 Feb 2024 19:58:49 UTC (3,458 KB)

Computer Science > Software Engineering

Title:Improving Requirements Completeness: Automated Assistance through Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Improving Requirements Completeness: Automated Assistance through Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators