Bob Carpenter — Preprint — Research Repository

AI & Data Science Preprint PDF DOI

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Sudong Wang, Weiquan Huang, Xiaomin Yu, Zuhao Yang, Hehai Lin, Keming Wu, Chaojun Xiao, Chen Chen, Wenxuan Wang, Beier Zhu, Yunjian Zhang, Chengwei Qin · 2026

The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (RLVR). H…

Read Paper →

Physics Preprint PDF DOI

Blazar flares from plasma blobs crossing the broad-line region

Sebastien Le Bihan, Anton Dmytriiev, Andreas Zech · 2026

The blazar 3C 279 is well known for its rapid and large-amplitude variability. On 20 December 2013, the source exhibited an orphan {\gamma}-ray flare characterized by a flux-doubling timescale of a fe…

Read Paper →

Computer Science Preprint PDF DOI

The Nesting Bird Box Problem is ER-complete: Sharp Hardness Results for the Hidden Set Problem

Lucas Meijer, Till Miltzow, Johanna Ockenfels, Milos Stojakovic · 2026

In the (Nesting) Bird Box Problem we are given a polygonal domain P and a number k and we want to know if there is a set B of k points inside P such that no two points in B can see each other. The und…

Read Paper →

Mathematics Preprint PDF DOI

A Leakage Bound for Confidence Sets after Black-Box Selection

Sayantan Banerjee · 2026

In many analyses the object reported at the end is not fixed in advance, but is chosen after a preliminary search over variables, subgroups, transformations, models or contrasts. Classical selective-i…

Read Paper →

AI & Data Science Preprint PDF DOI

From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

Serhii Zabolotnii, Viktoriia Holinko, Olha Antonenko · 2026

Trust in clinical artificial intelligence (AI) cannot be reduced to model accuracy, fluency of generation, or overall positive user impression. In medicine, trust must be engineered as a measurable sy…

Read Paper →

Mathematics Preprint PDF DOI

Exact formula for the 2-marginal second moment function of the multidimensional symmetric Markov random flight

Alexander D. Kolesnik · 2026

We consider the symmetric Markov random flight $\bold X(t), \; t>0,$ in the Euclidean space $\Bbb R^m, \; m\ge 3$, performed by a particle that moves in $\Bbb R^m$ with constant finite speed and chang…

Read Paper →

AI & Data Science Preprint PDF DOI

Improving Diversity in Black-box Few-shot Knowledge Distillation

Tri-Nhan Vo, Dang Nguyen, Kien Do, Sunil Gupta · 2026

Knowledge distillation (KD) is a well-known technique to effectively compress a large network (teacher) to a smaller network (student) with little sacrifice in performance. However, most KD methods re…

Read Paper →

AI & Data Science Preprint PDF DOI

Diverse Image Priors for Black-box Data-free Knowledge Distillation

Tri-Nhan Vo, Dang Nguyen, Trung Le, Kien Do, Sunil Gupta · 2026

Knowledge distillation (KD) represents a vital mechanism to transfer expertise from complex teacher networks to efficient student models. However, in decentralized or secure AI ecosystems, privacy reg…

Read Paper →

Computer Science Preprint PDF DOI

Using Large Language Models for Black-Box Testing of FMU-Based Simulations

Abdullah Mughees, Gaadha Sudheerbabu, Tanwir Ahmad, Dragos Truscan, Mikael Manng{aa}rd, Kristian Klemets · 2026

We propose a human in the loop approach for black-box testing of Functional Mock-up Units (FMUs) using Large Language Models (LLMs). The goal is to reduce the manual effort in defining test scenarios …

Read Paper →

AI & Data Science Preprint PDF DOI

Below-Chance Blindness: Prompted Underperformance in Small LLMs Produces Positional Bias Rather than Answer Avoidance

Jon-Paul Cacioli · 2026

Detecting sandbagging--the deliberate underperformance on capability evaluations--is an open problem in AI safety. We tested whether symptom validity testing (SVT) logic from clinical malingering dete…

Read Paper →

Mathematics Preprint PDF DOI

A polynomial-time solvable class of sparse box-constrained polynomial optimization problems

Aida Khajavirad · 2026

We study the problem of minimizing a multivariate polynomial function over the unit hypercube. By representing the polynomial through a hypergraph and exploiting its sparsity structure, we establish a…

Read Paper →

AI & Data Science Preprint PDF DOI

Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity

Bojie Li · 2026

Closed-source frontier labs do not disclose parameter counts, and the standard alternative -- inference economics -- carries $2\times$+ uncertainty from hardware, batching, and serving-stack assumptio…

Read Paper →

Physics Preprint PDF DOI

Dualistic operational characterization of device-dependent correlation sets via convex analysis in the $(2,m,2)$ Bell scenario

Ryosuke Nogami, Jaeha Lee · 2026

We analyze device-dependent correlation sets generated by fixed local dichotomic measurements for two-qubit systems in the $(2,m,2)$ Bell scenario. We consider three fundamental state spaces for the c…

Read Paper →

Physics Preprint PDF DOI

Thermal instability in coronal loops: linking eigenvalue spectra to time-dependent evolution

Adrian Kelly, Rony Keppens, Jordi De Jonghe · 2026

Cool, dense condensations such as coronal rain and prominences suggest that coronal plasma can undergo runaway radiative cooling. Connecting this behaviour to linear thermal modes requires us to fully…

Read Paper →

AI & Data Science Preprint PDF DOI

Instance Awareness of Multi-class Semantic Segmentation Loss Functions

Soumya Snigdha Kundu, Florian Kofler, Marina Ivory, Hendrik Moller, Jonathan Shapey, Tom Vercauteren · 2026

Instance-sensitive losses for semantic segmentation such as blob loss and CC loss were designed to address instance imbalance, ensuring small lesions generate the same gradient as large ones, but oper…

Read Paper →

Computer Science Preprint PDF DOI

KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances

Taeyoon Kim, Kyumin Kim, Enrique Molina-Gimenez, Pedro Garcia-Lopez, Kyungyong Lee · 2026

Cloud users aim to minimize cost while maximizing performance by selecting the most suitable instance types for their workloads. To reduce expenses, spot instances have been widely adopted due to thei…

Read Paper →

Physics Preprint PDF DOI

Partial solvability induced by dark states in a box trap with decentered two-body interaction

Hossein Abedi, Nathan L. Harshman, Peter Schmelcher · 2026

We consider a generalization of the two-body contact interaction for nonrelativistic particles confined to a one-dimensional box, in which the interaction is decentered, i.e., the particles interact o…

Read Paper →

AI & Data Science Preprint PDF DOI

Agentic Adversarial Rewriting Exposes Architectural Vulnerabilities in Black-Box NLP Pipelines

Mazal Bethany, Kim-Kwang Raymond Choo, Nishant Vishwamitra, Peyman Najafirad · 2026

Multi-component natural language processing (NLP) pipelines are increasingly deployed for high-stakes decisions, yet no existing adversarial method can test their robustness under realistic conditions…

Read Paper →

Computer Science Preprint PDF DOI

Protecting the Trace: A Principled Black-Box Approach Against Distillation Attacks

Max Hartman, Vidhata Jayaraman, Moulik Choraria, Lav R. Varshney · 2026

Frontier models push the boundaries of what is learnable at extreme computational costs, yet distillation via sampling reasoning traces exposes closed-source frontier models to adversarial third parti…

Read Paper →

AI & Data Science Preprint PDF DOI

Operational Feature Fingerprints of Graph Datasets via a White-Box Signal-Subspace Probe

Yuchen Xiong, Swee Keong Yeap, Zhen Hong Ban · 2026

Graph neural networks achieve strong node-classification accuracy, but learned message passing entangles ego attributes, neighborhood smoothing, high-pass graph differences, class geometry, and classi…

Read Paper →

Browse Research Papers

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Blazar flares from plasma blobs crossing the broad-line region

The Nesting Bird Box Problem is ER-complete: Sharp Hardness Results for the Hidden Set Problem

A Leakage Bound for Confidence Sets after Black-Box Selection

From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

Exact formula for the 2-marginal second moment function of the multidimensional symmetric Markov random flight

Improving Diversity in Black-box Few-shot Knowledge Distillation

Diverse Image Priors for Black-box Data-free Knowledge Distillation

Using Large Language Models for Black-Box Testing of FMU-Based Simulations

Below-Chance Blindness: Prompted Underperformance in Small LLMs Produces Positional Bias Rather than Answer Avoidance

A polynomial-time solvable class of sparse box-constrained polynomial optimization problems

Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity

Dualistic operational characterization of device-dependent correlation sets via convex analysis in the $(2,m,2)$ Bell scenario

Thermal instability in coronal loops: linking eigenvalue spectra to time-dependent evolution

Instance Awareness of Multi-class Semantic Segmentation Loss Functions

KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances

Partial solvability induced by dark states in a box trap with decentered two-body interaction

Agentic Adversarial Rewriting Exposes Architectural Vulnerabilities in Black-Box NLP Pipelines

Protecting the Trace: A Principled Black-Box Approach Against Distillation Attacks

Operational Feature Fingerprints of Graph Datasets via a White-Box Signal-Subspace Probe

Browse by Category

Research Type

Publish Your Research