Kurt Keutzer — Research Repository

AI & Data Science Preprint PDF DOI

Detecting is Easy, Adapting is Hard: Local Expert Growth for Visual Model-Based Reinforcement Learning under Distribution Shift

Haiyang Zhao · 2026

Visual model-based reinforcement learning (MBRL) agents can perform well on the training distribution, but often break down once the test environment shifts. In visual MBRL, recognizing that a shift h…

Read Paper →

Physics Preprint PDF DOI

Schroedinger's Equation at 100: The Wave Picture That Helped and Possibly Hurt

Caslav Brukner · 2026

Schroedinger's equation gave early quantum theory a visual language that looked like physics again: a wave evolving by a linear differential equation. This essay argues that the same success also seed…

Read Paper →

Engineering Preprint PDF DOI

Dual-Polarized Massive MIMO Based on Precoding for Vehicle-To-Ground Communication in Urban Rail Transit

Zhengyuan Wu, Junhui Zhao, Qingmiao Zhang, Ming Zhang · 2026

The development of intelligent and diversified ser vices in urban rail transit (URT) has resulted in an increasing de mand for high-rate communication between vehicles and ground equipment. However, e…

Read Paper →

Physics Preprint PDF DOI

Renormalization-group improved Schwarzschild black hole: shadow, ringdown, and strong cosmic censorship

Ahmad Al-Badawi, Faizuddin Ahmed, Izzet Sakall{i} · 2026

We study a renormalization-group (RG) improved Schwarzschild-like black hole (BH) whose lapse interpolates between a classical Schwarzschild exterior and a quantum-smoothed interior governed by a cuto…

Read Paper →

AI & Data Science Preprint PDF DOI

When Corrective Hints Hurt: Prompt Design in Reasoner-Guided Repair of LLM Overcaution on Entailed Negations under OWL~2~DL

Yijiashun Qi, Xiang Xu, Yuxuan Li · 2026

We report a reproducible error pattern in GPT-5.4 on OWL~2~DL compliance queries: the model frequently answers ``unknown'' when the reasoner-entailed answer is ``no'' under \emph{FunctionalProperty} c…

Read Paper →

AI & Data Science Preprint PDF DOI

When Policies Cannot Be Retrained: A Unified Closed-Form View of Post-Training Steering in Offline Reinforcement Learning

Elias Hossain, Mohammad Jahid Ibna Basher, Ivan Garibay, Ozlem Garibay, Niloofar Yousefi · 2026

Offline reinforcement learning (RL) can learn effective policies from fixed datasets, but deployment objectives may change after training, and in many applications the trained actor cannot be retraine…

Read Paper →

AI & Data Science Preprint PDF DOI

Correction and Corruption: A Two-Rate View of Error Flow in LLM Protocols

Fernando Reitich · 2026

Large language models are increasingly deployed as protocols: structured multi-call procedures that spend additional computation to transform a baseline answer into a final one. These protocols are ev…

Read Paper →

AI & Data Science Preprint PDF DOI

GraSP: Graph-Structured Skill Compositions for LLM Agents

Tianle Xia, Lingxiang Hu, Yiding Sun, Ming Xu, Lan Xu, Siying Wang, Wei Xu, Jie Jiang · 2026

Skill ecosystems for LLM agents have matured rapidly, yet recent benchmarks show that providing agents with more skills does not monotonically improve performance -- focused sets of 2-3 skills outperf…

Read Paper →

AI & Data Science Preprint PDF DOI

BARD: Bridging AutoRegressive and Diffusion Vision-Language Models Via Highly Efficient Progressive Block Merging and Stage-Wise Distillation

Baoyou Chen, Hanchen Xia, Peng Tu, Haojun Shi, Liwei Zhang, Weihao Yuan, Siyu Zhu · 2026

Autoregressive vision-language models (VLMs) deliver strong multimodal capability, but their token-by-token decoding imposes a fundamental inference bottleneck. Diffusion VLMs offer a more parallel de…

Read Paper →

AI & Data Science Preprint PDF DOI

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Linhao Yu, Tianmeng Yang, Siyu Ding, Renren Jin, Naibin Gu, Xiangzhao Hao, Shuaiyi Nie, Deyi Xiong, Weichong Yin, Yu Sun, Hua Wu · 2026

RLVR improves reasoning in large language models, but its effectiveness is often limited by severe reward sparsity on hard problems. Recent hint-based RL methods mitigate sparsity by injecting partial…

Read Paper →

AI & Data Science Preprint PDF DOI

Who Handles Orientation? Investigating Invariance in Feature Matching

David Nordstrom, Johan Edstedt, Fredrik Kahl, Georg Bokman · 2026

Finding matching keypoints between images is a core problem in 3D computer vision. However, modern matchers struggle with large in-plane rotations. A straightforward mitigation is to learn rotation in…

Read Paper →

AI & Data Science Preprint PDF DOI

Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents

Xing Zhang, Guanghui Wang, Yanwei Cui, Wei Qiu, Ziyuan Li, Bing Zhu, Peiyang He · 2026

Developers increasingly guide AI coding agents through natural language instruction files (e.g., CLAUDE.md, .cursorrules), yet no controlled study has measured whether these rules actually improve age…

Read Paper →

AI & Data Science Preprint PDF DOI

When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation

Sandro Andric · 2026

Large language models are increasingly used as agents in social, economic, and policy simulations. A common assumption is that stronger reasoning should improve simulation fidelity. We argue that this…

Read Paper →

AI & Data Science Preprint PDF DOI

Turing or Cantor: That is the Question

Eugene Eberbach · 2026

Alan Turing is considered as a founder of current computer science together with Kurt Godel, Alonzo Church and John von Neumann. In this paper multiple new research results are presented. It is demons…

Read Paper →

AI & Data Science Preprint PDF DOI

$p1$: Better Prompt Optimization with Fewer Prompts

Zhaolin Gao, Yu (Sid) Wang, Bo Liu, Thorsten Joachims, Kiante Brantley, Wen Sun · 2026

Prompt optimization improves language models without updating their weights by searching for a better system prompt, but its effectiveness varies widely across tasks. We study what makes a task amenab…

Read Paper →

AI & Data Science Preprint PDF DOI

Shortcut Learning in Glomerular AI: Adversarial Penalties Hurt, Entropy Helps

Mohammad Daouk, Jan Ulrich Becker, Neeraja Kambham, Anthony Chang, Hien Van Nguyen, Chandra Mohan · 2026

Stain variability is a pervasive source of distribution shift and potential shortcut learning in renal pathology AI. We ask whether lupus nephritis glomerular lesion classifiers exploit stain as a sho…

Read Paper →

AI & Data Science Preprint PDF DOI

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Qihan Ren, Peng Wang, Ruikun Cai, Shuai Shao, Dadi Guo, Yuejin Xie, Yafu Li, Quanshi Zhang, Xia Hu, Jing Shao, Dongrui Liu · 2026

A prevailing narrative in LLM post-training holds that supervised finetuning (SFT) memorizes while reinforcement learning (RL) generalizes. We revisit this claim for reasoning SFT with long chain-of-t…

Read Paper →

AI & Data Science Preprint PDF DOI

When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction

Bryan Cheng, Jasper Zhang · 2026

We present the first systematic study of when target context helps molecular property prediction, evaluating context conditioning across 10 diverse protein families, 4 fusion architectures, data regim…

Read Paper →

AI & Data Science Preprint PDF DOI

Conformal Margin Risk Minimization: An Envelope Framework for Robust Learning under Label Noise

Yuanjie Shi, Peihong Li, Zijian Zhang, Janardhan Rao Doppa, Yan Yan · 2026

Most methods for learning with noisy labels require privileged knowledge such as noise transition matrices, clean subsets or pretrained feature extractors, resources typically unavailable when robustn…

Read Paper →

Physics Preprint PDF DOI

Testing parity with composite-field spectra of BOSS and DESI luminous red galaxies

Zucheng Gao, Marina S. Cagliari, Azadeh Moradinezhad Dizgah, Zvonimir Vlah · 2026

Detection of parity violation on cosmological scales would have profound implications for fundamental physics. Motivated in part by recent measurements of parity-odd four-point correlation functions i…

Read Paper →

Browse Research Papers

Detecting is Easy, Adapting is Hard: Local Expert Growth for Visual Model-Based Reinforcement Learning under Distribution Shift

Schroedinger's Equation at 100: The Wave Picture That Helped and Possibly Hurt

Dual-Polarized Massive MIMO Based on Precoding for Vehicle-To-Ground Communication in Urban Rail Transit

Renormalization-group improved Schwarzschild black hole: shadow, ringdown, and strong cosmic censorship

When Corrective Hints Hurt: Prompt Design in Reasoner-Guided Repair of LLM Overcaution on Entailed Negations under OWL~2~DL

When Policies Cannot Be Retrained: A Unified Closed-Form View of Post-Training Steering in Offline Reinforcement Learning

Correction and Corruption: A Two-Rate View of Error Flow in LLM Protocols

GraSP: Graph-Structured Skill Compositions for LLM Agents

BARD: Bridging AutoRegressive and Diffusion Vision-Language Models Via Highly Efficient Progressive Block Merging and Stage-Wise Distillation

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Who Handles Orientation? Investigating Invariance in Feature Matching

Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents

When Reasoning Models Hurt Behavioral Simulation: A Solver-Sampler Mismatch in Multi-Agent LLM Negotiation

Turing or Cantor: That is the Question

$p1$: Better Prompt Optimization with Fewer Prompts

Shortcut Learning in Glomerular AI: Adversarial Penalties Hurt, Entropy Helps

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction

Conformal Margin Risk Minimization: An Envelope Framework for Robust Learning under Label Noise

Testing parity with composite-field spectra of BOSS and DESI luminous red galaxies

Browse by Category

Research Type

Publish Your Research