Software Engineering

Mining Subscenario Refactoring Opportunities in Behaviour-Driven Software Test Suites: ML Classifiers and LLM-Judge Baselines
Avatar
Ali Hassaan Mughal
53 views
AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development
Avatar
librarian
64 views
Finding Duplicates in 1.1M BDD Steps: cukereuse, a Paraphrase-Robust Static Detector for Cucumber and Gherkin
Avatar
Ali Hassaan Mughal
89 views
CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents
Avatar
librarian
114 views
Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications
Avatar
Tzafrir Rehan
133 views
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
Avatar
librarian
123 views
Rethinking Autonomy: Preventing Failures in AI-Driven Software
  Engineering
Avatar
Joydeep
511 views
Are Large Language Models Robust in Understanding Code Against
  Semantics-Preserving Mutations?
Avatar
librarian
529 views
Mutation Testing framework for Machine Learning
Avatar
rsingh80
716 views
Patched RTC: evaluating LLMs for diverse software development tasks
Avatar
Asankhaya Sharma
718 views
Patched MOA: optimizing inference for diverse software development tasks
Avatar
Asankhaya Sharma
802 views
Pitfalls in Language Models for Code Intelligence: A Taxonomy and Survey
Avatar
Xinyu She
867 views
Runtime Resolution of Feature Interactions through Adaptive Requirement
  Weakening
Avatar
Simon Chu
811 views
Demystifying Compiler Unstable Feature Usage and Impacts in the Rust
  Ecosystem
Avatar
Chenghao Li
805 views
Towards the decentralized coordination of multiple self-adaptive systems
Avatar
Paul-Andrei Dragan
843 views
Variance of ML-based software fault predictors: are we really improving
  fault prediction?
Avatar
Domenic Bubel
969 views
Exploring Behaviours of RESTful APIs in an Industrial Setting
Avatar
Stefan Karlsson
807 views
Evaluating Pre-trained Language Models for Repairing API Misuses
Avatar
Ting Zhang
814 views
Formal Runtime Error Detection During Development in the Automotive
  Industry
Avatar
Jesko Hecking-Harbusch
929 views
Exploring Large Language Models for Code Explanation
Avatar
Paheli Bhattacharya
782 views
Leveraging Deep Learning for Abstractive Code Summarization of
  Unofficial Documentation
Avatar
AmirHossein Naghshzan
895 views
Vision-Based Mobile App GUI Testing: A Survey
Avatar
Shengcheng Yu
803 views
Using ChatGPT throughout the Software Development Life Cycle by Novice
  Developers
Avatar
Muhammad Waseem
874 views
Less is More? An Empirical Study on Configuration Issues in Python PyPI
  Ecosystem
Avatar
Yun Peng
812 views
Unleashing the Power of Clippy in Real-World Rust Projects
Avatar
Chunmiao Li
890 views
The Effects of Computational Resources on Flaky Tests
Avatar
Denini Silva
776 views
A comprehensible analysis of the efficacy of Ensemble Models for Bug
  Prediction
Avatar
Ingrid Marc¸al
786 views
Large Language Models for Code Analysis: Do LLMs Really Do Their Job?
Avatar
Chongzhou Fang
955 views
SURE: A Visualized Failure Indexing Approach using Program Memory
  Spectrum
Avatar
Yi Song
758 views
Automated Repair of Declarative Software Specifications in the Era of
  Large Language Models
Avatar
Md Rashedul Hasan
742 views
The Software Heritage Open Science Ecosystem
Avatar
Roberto Di Cosmo
768 views
A Critical Review of Large Language Model on Software Engineering: An
  Example from ChatGPT and Automated Program Repair
Avatar
Quanjun Zhang
876 views