Gerard Briscoe — Research Repository

AI & Data Science Preprint PDF DOI

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

Feiyu Wu, Xu Zheng, Zhuocheng Wang, Yi ming Dai, Hui Li · 2026

Large language models (LLMs) make reward design in reinforcement learning substantially more scalable, but generated rewards are not automatically reliable training objectives. Existing work has focus…

Read Paper →

AI & Data Science Preprint PDF DOI

Calibrating Attribution Proxies for Reward Allocation in Participatory Weather Sensing

Mark C. Ballandies, Michael T. C. Chiu, Claudio J. Tessone · 2026

Large-scale IoT weather sensing networks require incentive mechanisms to sustain participation, yet determining how much value individual data contributions bring to the network remains an open proble…

Read Paper →

Computer Science Preprint PDF DOI

The Grand Software Supply Chain of AI Systems

Carmine Cesarano, Martin Monperrus · 2026

AI systems rest on software with low integrity mechanisms, leaving AI systems exposed across every stage from data acquisition to final inference. This paper makes the AI supply chain a first-class ob…

Read Paper →

Computer Science Preprint PDF DOI

SST-Guard: Detecting and Characterizing Server-Side Google Analytics in the Wild

Muhammad Jazlan, Alexander Gamero-Garrido, Zubair Shafiq, Yash Vekaria · 2026

As web browsers increasingly restrict client-side tracking, the web tracking ecosystem is shifting from client-side to server-side tracking (SST). In SST, the browser sends tracking requests to an int…

Read Paper →

AI & Data Science Preprint PDF DOI

Debiasing Reward Models via Causally Motivated Inference-Time Intervention

Kazutoshi Shinoda, Kosuke Nishida, Kyosuke Nishida · 2026

Reward models (RMs) play a central role in aligning large language models (LLMs) with human preferences. However, RMs are often sensitive to spurious features such as response length. Existing inferen…

Read Paper →

AI & Data Science Preprint PDF DOI

From Coarse to Fine: Benchmarking and Reward Modeling for Writing-Centric Generation Tasks

Qingyu Ren, Tianjun Pan, Xingzhou Chen, Xuhong Wang · 2026

Large language models have achieved remarkable progress in text generation but still struggle with generative writing tasks. In terms of evaluation, existing benchmarks evaluate writing reward models …

Read Paper →

Computer Science Preprint PDF DOI

Now's the Time: Computer Science Must Evolve to Emphasize Software and Systems Engineering with Artificial Intelligence (AI)

Chandra N. Sekharan, George K. Thiruvathukal · 2026

Computer science (CS) education needs to evolve to support software and artificial intelligence (AI) systems engineering, and it needs to happen now -- precisely because the core intellectual contribu…

Read Paper →

AI & Data Science Preprint PDF DOI

How to Guide Your Flow: Few-Step Alignment via Flow Map Reward Guidance

Jerry Y. Huang, Justin Lin, Sheel Shah, Kartik Nair, Nicholas M. Boffi · 2026

In generative modeling, we often wish to produce samples that maximize a user-specified reward such as aesthetic quality or alignment with human preferences, a problem known as guidance. Despite their…

Read Paper →

AI & Data Science Preprint PDF DOI

AdvDMD: Adversarial Reward Meets DMD For High-Quality Few-Step Generation

Xu Wang, Zexian Li, Litong Gong, Tiezheng Ge, Zhijie Deng · 2026

Diffusion models offer superior generation quality at the expense of extensive sampling steps. Distillation methods, with Distribution Matching Distillation (DMD) as a popular example, can mitigate …

Read Paper →

AI & Data Science Preprint PDF DOI

Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking

Disha Singha · 2026

Reinforcement learning (RL) systems typically optimize scalar reward functions that assume precise and reliable evaluation of outcomes. However, real-world objectives--especially those derived from hu…

Read Paper →

AI & Data Science Preprint PDF DOI

reward-lens: A Mechanistic Interpretability Library for Reward Models

Mohammed Suhail B Nadaf · 2026

Every RLHF-trained language model is shaped by a reward model, yet the mechanistic interpretability toolkit -- logit lens, direct logit attribution, activation patching, sparse autoencoders -- was bui…

Read Paper →

Engineering Preprint PDF DOI

Robust Accent Identification via Voice Conversion and Non-Timbral Embeddings

Rayane Bakari, Olivier Le Blouch, Nicolas Gengembre, Nicholas Evans · 2026

Automatic accent identification (AID) remains a challenging task due to the complex variability of accents, the entanglement of accent cues with speaker traits, and the scarcity of reliable accentlabe…

Read Paper →

Computer Science Preprint PDF DOI

R$^3$-SQL: Ranking Reward and Resampling for Text-to-SQL

Hojae Han, Yeonseok Jeong, Seung-won Hwang, Zhewei Yao, Yuxiong He · 2026

Modern Text-to-SQL systems generate multiple candidate SQL queries and rank them to judge a final prediction. However, existing methods face two limitations. First, they often score functionally equiv…

Read Paper →

AI & Data Science Preprint PDF DOI

Zero Shot Coordination for Sparse Reward Tasks with Diverse Reward Shapings

Keenan Powell, Peihong Yu, Pratap Tokekar · 2026

Many Multi-Agent Reinforcement Learning (MARL) agents fail to adapt properly to cooperating with agents trained with the same objectives but different seeds, algorithms, or other training differences.…

Read Paper →

AI & Data Science Preprint PDF DOI

Improving Vision-language Models with Perception-centric Process Reward Models

Yingqian Min, Kun Zhou, Yifan Li, Yuhuan Wu, Han Peng, Yifan Du, Wayne Xin Zhao, Min Yang, Ji-Rong Wen · 2026

Recent advancements in reinforcement learning with verifiable rewards (RLVR) have significantly improved the complex reasoning ability of vision-language models (VLMs). However, its outcome-level supe…

Read Paper →

AI & Data Science Preprint PDF DOI

A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

Ying-Tu Chen, Wei Hung, Bing-Shu Wu, Zhang-Wei Hong, Ping-Chun Hsieh · 2026

Many sequential decision-making tasks involve optimizing multiple conflicting objectives, requiring policies that adapt to different user preferences. In multi-objective reinforcement learning (MORL),…

Read Paper →

AI & Data Science Preprint PDF DOI

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

Zhisong Qiu, Shuofei Qiao, Kewei Xu, Yuqi Zhu, Lun Du, Ningyu Zhang, Huajun Chen · 2026

Process Reward Models (PRMs) have achieved remarkable success in augmenting the reasoning capabilities of Large Language Models (LLMs) within static domains such as mathematics. However, their potenti…

Read Paper →

Biology & Life Sciences Preprint PDF DOI

DNA melting: intra base-pair dynamics and a vector generalization of the Peyrard-Bishop-Dauxois model

Nikos Theodorakopoulos · 2026

The Peyrard-Bishop-Dauxois (PBD) model of DNA denaturation, although successful in the description of melting profiles, fails to predict melting entropies, unzipping forces and dynamical properties, e…

Read Paper →

AI & Data Science Preprint PDF DOI

Do Synthetic Trajectories Reflect Real Reward Hacking? A Systematic Study on Monitoring In-the-Wild Hacking in Code Generation

Lichen Li, Hengguang Zhou, Yijun Liang, Tianyi Zhou, Cho-Jui Hsieh · 2026

Reward hacking in code generation, where models exploit evaluation loopholes to obtain full reward without correctly solving the tasks, poses a critical challenge for Reinforcement Learning (RL) and t…

Read Paper →

AI & Data Science Preprint PDF DOI

K-Score: Kalman Filter as a Principled Alternative to Reward Normalization in Reinforcement Learning

Zixuan Xia, Quanxi Li · 2026

We propose a simple yet effective alternative to reward normalization in policy gradient reinforcement learning by integrating a 1D Kalman filter for online reward estimation. Instead of relying on fi…

Read Paper →

Browse Research Papers

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

Calibrating Attribution Proxies for Reward Allocation in Participatory Weather Sensing

The Grand Software Supply Chain of AI Systems

SST-Guard: Detecting and Characterizing Server-Side Google Analytics in the Wild

Debiasing Reward Models via Causally Motivated Inference-Time Intervention

From Coarse to Fine: Benchmarking and Reward Modeling for Writing-Centric Generation Tasks

Now's the Time: Computer Science Must Evolve to Emphasize Software and Systems Engineering with Artificial Intelligence (AI)

How to Guide Your Flow: Few-Step Alignment via Flow Map Reward Guidance

AdvDMD: Adversarial Reward Meets DMD For High-Quality Few-Step Generation

Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking

reward-lens: A Mechanistic Interpretability Library for Reward Models

Robust Accent Identification via Voice Conversion and Non-Timbral Embeddings

R$^3$-SQL: Ranking Reward and Resampling for Text-to-SQL

Zero Shot Coordination for Sparse Reward Tasks with Diverse Reward Shapings

Improving Vision-language Models with Perception-centric Process Reward Models

A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

DNA melting: intra base-pair dynamics and a vector generalization of the Peyrard-Bishop-Dauxois model

Do Synthetic Trajectories Reflect Real Reward Hacking? A Systematic Study on Monitoring In-the-Wild Hacking in Code Generation

K-Score: Kalman Filter as a Principled Alternative to Reward Normalization in Reinforcement Learning

Browse by Category

Research Type

Publish Your Research