Large language models (LLMs) for requirements engineering (RE): A systematic literature review,

· 2025 · arXiv 2509.11446

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

read on arXiv browse 11 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

LLM-Assisted Empirical Software Engineering: Systematic Literature Review and Research Agenda

cs.SE · 2026-04-29 · unverdicted · novelty 7.0

A systematic review of 50 studies identifies 69 LLM-assisted tasks in empirical software engineering, concentrated in data processing and analysis with gaps in human-centered integration and reproducibility reporting.

R2Code: A Self-Reflective LLM Framework for Requirements-to-Code Traceability

cs.SE · 2026-04-24 · unverdicted · novelty 7.0

R2Code improves requirement-to-code traceability with a bidirectional alignment network, self-reflective consistency verification, and dynamic context-adaptive retrieval, yielding 7.4% average F1 gain and up to 41.7% lower token use on five datasets.

BT-APE: A Computationally Light Backtracking Approach to Automatic Prompt Engineering for Requirements Classification

cs.SE · 2026-07-01 · unverdicted · novelty 6.0

BT-APE automates prompt engineering for requirements classification using backtracking search and dynamic examples, matching PE2 accuracy while using 72% fewer tokens and 66% less time than that baseline.

Characterizing Datasets for LLM-based Requirements Engineering: A Systematic Mapping Study

cs.SE · 2025-10-21 · unverdicted · novelty 6.0

A systematic mapping study of 45 LLM-based RE papers identifies and characterizes 62 public datasets, revealing imbalances in open-science practices, elicitation support, and socio-technical diversity.

Cluster-Aware Dual-Level Test Specification Generation for Large-Scale Automotive Software Requirements

cs.SE · 2026-06-15 · unverdicted · novelty 5.0

A clustering-based pipeline generates individual and integration-level test specifications from thousands of automotive requirements by grouping embeddings, summarizing clusters, and applying LLM calls with bounded context and standards grounding.

User Reviews as a Source for Usability Requirements: A Precursor Study on Using Large Language Models

cs.SE · 2026-05-12 · conditional · novelty 5.0

LLMs can detect usability content in user reviews with F-scores comparable to humans, though performance depends strongly on prompt design.

Evaluating LLM-Based Goal Extraction in Requirements Engineering: Prompting Strategies and Their Limitations

cs.SE · 2026-04-24 · conditional · novelty 5.0

LLM pipeline with generation-critic feedback reaches 61% accuracy on low-level goal extraction from requirements documents and outperforms standalone few-shot prompting, yet remains best suited as an accelerator for manual work.

Towards an Agentic LLM-based Approach to Requirement Formalization from Unstructured Specifications

cs.SE · 2026-04-20 · unverdicted · novelty 5.0

An agentic LLM pipeline extracts and translates unstructured requirements into syntactically and semantically aligned formal properties, achieving 77.8% accuracy across three scenarios.

Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case

cs.CE · 2026-03-20 · accept · novelty 5.0

Design-OS is a specification-driven five-stage framework for engineering system design that maintains traceability from intent to implementation and supports human-AI collaboration, demonstrated on rotary inverted pendulum control cases.

An empirical study of LoRA-based fine-tuning of large language models for automated test case generation

cs.SE · 2026-04-08 · unverdicted · novelty 4.0

LoRA fine-tuning enables open-source LLMs such as Ministral-8B to generate requirement-based test cases at a level comparable to pre-tuned proprietary GPT-4.1 models.

LLM-Driven Cost-Effective Requirements Change Impact Analysis

cs.SE · 2025-10-31 · unverdicted · novelty 4.0

ProReFiCIA uses LLMs with tailored prompts to identify impacted requirements, achieving 85.7% recall on unseen industrial data while requiring review of only 3% of requirements, rising to 95.7% recall with RAG at 3.6% review cost.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Large language models (LLMs) for requirements engineering (RE): A systematic literature review,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer