Software Engineering

Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications
Avatar
Tzafrir Rehan
6 views
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
Avatar
librarian
10 views
Rethinking Autonomy: Preventing Failures in AI-Driven Software
  Engineering
Avatar
Joydeep
402 views
Are Large Language Models Robust in Understanding Code Against
  Semantics-Preserving Mutations?
Avatar
librarian
430 views
Mutation Testing framework for Machine Learning
Avatar
rsingh80
542 views
Patched RTC: evaluating LLMs for diverse software development tasks
Avatar
Asankhaya Sharma
587 views
Patched MOA: optimizing inference for diverse software development tasks
Avatar
Asankhaya Sharma
666 views
Pitfalls in Language Models for Code Intelligence: A Taxonomy and Survey
Avatar
Xinyu She
723 views
Runtime Resolution of Feature Interactions through Adaptive Requirement
  Weakening
Avatar
Simon Chu
667 views
Demystifying Compiler Unstable Feature Usage and Impacts in the Rust
  Ecosystem
Avatar
Chenghao Li
653 views
Towards the decentralized coordination of multiple self-adaptive systems
Avatar
Paul-Andrei Dragan
697 views
Variance of ML-based software fault predictors: are we really improving
  fault prediction?
Avatar
Domenic Bubel
848 views
Exploring Behaviours of RESTful APIs in an Industrial Setting
Avatar
Stefan Karlsson
667 views
Evaluating Pre-trained Language Models for Repairing API Misuses
Avatar
Ting Zhang
661 views
Formal Runtime Error Detection During Development in the Automotive
  Industry
Avatar
Jesko Hecking-Harbusch
774 views
Exploring Large Language Models for Code Explanation
Avatar
Paheli Bhattacharya
630 views
Leveraging Deep Learning for Abstractive Code Summarization of
  Unofficial Documentation
Avatar
AmirHossein Naghshzan
749 views
Vision-Based Mobile App GUI Testing: A Survey
Avatar
Shengcheng Yu
657 views
Using ChatGPT throughout the Software Development Life Cycle by Novice
  Developers
Avatar
Muhammad Waseem
737 views
Less is More? An Empirical Study on Configuration Issues in Python PyPI
  Ecosystem
Avatar
Yun Peng
661 views
Unleashing the Power of Clippy in Real-World Rust Projects
Avatar
Chunmiao Li
751 views
The Effects of Computational Resources on Flaky Tests
Avatar
Denini Silva
641 views
A comprehensible analysis of the efficacy of Ensemble Models for Bug
  Prediction
Avatar
Ingrid Marc¸al
647 views
Large Language Models for Code Analysis: Do LLMs Really Do Their Job?
Avatar
Chongzhou Fang
765 views
SURE: A Visualized Failure Indexing Approach using Program Memory
  Spectrum
Avatar
Yi Song
639 views
Automated Repair of Declarative Software Specifications in the Era of
  Large Language Models
Avatar
Md Rashedul Hasan
613 views
The Software Heritage Open Science Ecosystem
Avatar
Roberto Di Cosmo
655 views
A Critical Review of Large Language Model on Software Engineering: An
  Example from ChatGPT and Automated Program Repair
Avatar
Quanjun Zhang
742 views
Qualitative Analysis for Validating IEC 62443-4-2 Requirements in
  DevSecOps
Avatar
John Appleseed
622 views
On Using GUI Interaction Data to Improve Text Retrieval-based Bug
  Localization
Avatar
Junayed Mahmud
707 views
Rethinking Negative Pairs in Code Search
Avatar
Haochen Li
649 views
MCRepair: Multi-Chunk Program Repair via Patch Optimization with Buggy
  Block
Avatar
Jisung Kim
701 views