Devon Hjelm in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Agentic AI in the Software Development Lifecycle: Architecture, Empirical Evidence, and the Reshaping of Software Engineering

Happy Bhati · 2026

The arrival of large language models (LLMs) capable of multi-step reasoning, tool use, and long-horizon planning has produced a qualitative shift in software engineering. Where earlier code-completion…

Read Paper →

Computer Science Preprint PDF DOI

Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows

Syed Muhammad Ashhar Shah, Sehrish Habib, Muizz Hussain, Maryam Abdul Ghafoor, Abdul Ali Bangash · 2026

Continuous Integration and Deployment (CI/CD) workflows are central to modern software delivery, yet the reliability of agentic AI bots operating within these workflows remain underexplored. Using pul…

Read Paper →

Computer Science Preprint PDF DOI

Strategies in Sabotage Games: Temporal and Epistemic Perspectives

Nina Gierasimczuk, Katrine B.P. Thoft · 2026

Sabotage games are played on a dynamic graph, in which one agent, called a runner, attempts to reach a goal state, while being obstructed by a demon who at each round removes an edge from the graph. S…

Read Paper →

Computer Science Preprint PDF DOI

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

Razvan Mihai Popescu, David Gros, Andrei Botocan, Rahul Pandita, Prem Devanbu, Maliheh Izadi · 2026

The rise of large language models for code has reshaped software development. Autonomous coding agents, able to create branches, open pull requests, and perform code reviews, now actively contribute t…

Read Paper →

Computer Science Preprint PDF DOI

Code Review Agent Benchmark

Yuntong Zhang, Zhiyuan Pan, Imam Nur Bani Yusuf, Haifeng Ruan, Ridwan Shariffdeen, Abhik Roychoudhury · 2026

Software engineering agents have shown significant promise in writing code. As AI agents permeate code writing, and generate huge volumes of code automatically -- the matter of code quality comes fron…

Read Paper →

Computer Science Preprint PDF DOI

AIDev: Studying AI Coding Agents on GitHub

Hao Li, Haoxiang Zhang, Ahmed E. Hassan · 2026

AI coding agents are rapidly transforming software engineering by performing tasks such as feature development, debugging, and testing. Despite their growing impact, the research community lacks a com…

Read Paper →

Computer Science Preprint PDF DOI

Comparing AI Coding Agents: A Task-Stratified Analysis of Pull Request Acceptance

Giovanni Pinna, Jingzhi Gong, David Williams, Federica Sarro · 2026

The rapid adoption of AI-powered coding assistants is transforming software development practices, yet systematic comparisons of their effectiveness across different task types and over time remain li…

Read Paper →

Computer Science Preprint PDF DOI

Why Agentic-PRs Get Rejected: A Comparative Study of Coding Agents

Sota Nakashima, Yuta Ishimoto, Masanari Kondo, Shane Mclntosh, Yasutaka Kamei · 2026

Agentic coding -- software development workflows in which autonomous coding agents plan, implement, and submit code changes with minimal human involvement -- is rapidly gaining traction. Prior work ha…

Read Paper →

Computer Science Preprint PDF DOI

Why Are AI Agent Involved Pull Requests (Fix-Related) Remain Unmerged? An Empirical Study

Khairul Alam, Saikat Mondal, Banani Roy · 2026

Autonomous coding agents (e.g., OpenAI Codex, Devin, GitHub Copilot) are increasingly used to generate fix-related pull requests (PRs) in real world software repositories. However, their practical eff…

Read Paper →

Computer Science Preprint PDF DOI

How do Agents Refactor: An Empirical Study

Lukas Ottenhof, Daniel Penner, Abram Hindle, Thibaud Lutellier · 2026

Software development agents such as Claude Code, GitHub Copilot, Cursor Agent, Devin, and OpenAI Codex are being increasingly integrated into developer workflows. While prior work has evaluated agent …

Read Paper →

Computer Science Preprint PDF DOI

HELM: A Human-Centered Evaluation Framework for LLM-Powered Recommender Systems

Sushant Mehta · 2026

The integration of Large Language Models (LLMs) into recommendation systems has introduced unprecedented capabilities for natural language understanding, explanation generation, and conversational int…

Read Paper →

Computer Science Preprint PDF DOI

When AI Agents Touch CI/CD Configurations: Frequency and Success

Taher A. Ghaleb · 2026

AI agents are increasingly used in software development, yet their interaction with CI/CD configurations is not well studied. We analyze 8,031 agentic pull requests (PRs) from 1,605 GitHub repositorie…

Read Paper →

Computer Science Preprint PDF DOI

Fingerprinting AI Coding Agents on GitHub

Taher A. Ghaleb · 2026

AI coding agents are reshaping software development through both autonomous and human-mediated pull requests (PRs). When developers use AI agents to generate code under their own accounts, code author…

Read Paper →

Computer Science Preprint PDF DOI

On Angels and Demons: Strategic (De)Construction of Dynamic Models

Davide Catta, Rustam Galimullin, Munyque Mittelmann · 2026

In recent years, there has been growing interest in logics that formalise strategic reasoning about agents capable of modifying the structure of a given model. This line of research has been motivated…

Read Paper →

Computer Science Preprint PDF DOI

Efficient Multi-Model Orchestration for Self-Hosted Large Language Models

Bhanu Prakash Vangala, Tanu Malik · 2025

Self-hosting large language models (LLMs) is increasingly appealing for organizations seeking privacy, cost control, and customization. Yet deploying and maintaining in-house models poses challenges i…

Read Paper →

Computer Science Preprint PDF DOI

Reliablocks: Developing Reliability Scores for Optimistic Rollups

Souradeep Das, Ethan Lam, Varun Vaidya, Sanjay Amirthraj · 2025

Introducing Reliablocks, an on-chain reliability index for non-finalized blocks in Optimistic Rollups. This was built during the EigenLayer Infinite Hackathon at the Infinite Hacker House at DevCon 20…

Read Paper →

Computer Science Preprint PDF DOI

Serverless GPU Architecture for Enterprise HR Analytics: A Production-Scale BDaaS Implementation

Guilin Zhang, Wulan Guo, Ziqi Tan, Srinivas Vippagunta, Suchitra Raman, Shreeshankar Chatterjee, Ju Lin, Shang Liu, Mary Schladenhauffen, Jeffrey Luo, Hailong Jiang · 2025

Industrial and government organizations increasingly depend on data-driven analytics for workforce, finance, and regulated decision processes, where timeliness, cost efficiency, and compliance are cri…

Read Paper →

Computer Science Preprint PDF DOI

CXLAimPod: CXL Memory is all you need in AI era

Yiwei Yang, Yusheng Zheng, Yiqi Chen, Zheng Liang, Kexin Chu, Zhe Zhou, Andi Quinn, Wei Zhang · 2025

The proliferation of data-intensive applications, ranging from large language models to key-value stores, increasingly stresses memory systems with mixed read-write access patterns. Traditional half-d…

Read Paper →

Computer Science Preprint PDF DOI

The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering

Hao Li, Haoxiang Zhang, Ahmed E. Hassan · 2025

The future of software engineering--SE 3.0--is unfolding with the rise of AI teammates: autonomous, goal-driven systems collaborating with human developers. Among these, autonomous coding agents are e…

Read Paper →

Computer Science Preprint PDF DOI

Plug. Play. Persist. Inside a Ready-to-Go Havoc C2 Infrastructure

Alessio Di Santo · 2025

This analysis focuses on a single Azure-hosted Virtual Machine at 52.230.23.114 that the adversary converted into an all-in-one delivery, staging and Command-and-Control node. The host advertises an o…

Read Paper →

Browse Research Papers

Agentic AI in the Software Development Lifecycle: Architecture, Empirical Evidence, and the Reshaping of Software Engineering

Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows

Strategies in Sabotage Games: Temporal and Epistemic Perspectives

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

Code Review Agent Benchmark

AIDev: Studying AI Coding Agents on GitHub

Comparing AI Coding Agents: A Task-Stratified Analysis of Pull Request Acceptance

Why Agentic-PRs Get Rejected: A Comparative Study of Coding Agents

Why Are AI Agent Involved Pull Requests (Fix-Related) Remain Unmerged? An Empirical Study

How do Agents Refactor: An Empirical Study

HELM: A Human-Centered Evaluation Framework for LLM-Powered Recommender Systems

When AI Agents Touch CI/CD Configurations: Frequency and Success

Fingerprinting AI Coding Agents on GitHub

On Angels and Demons: Strategic (De)Construction of Dynamic Models

Efficient Multi-Model Orchestration for Self-Hosted Large Language Models

Reliablocks: Developing Reliability Scores for Optimistic Rollups

Serverless GPU Architecture for Enterprise HR Analytics: A Production-Scale BDaaS Implementation

CXLAimPod: CXL Memory is all you need in AI era

The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering

Plug. Play. Persist. Inside a Ready-to-Go Havoc C2 Infrastructure

Browse by Category

Research Type

Publish Your Research