Hartmut Kaiser — Research Repository

Computer Science Preprint PDF DOI

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Chenxin Li, Zhengyang Tang, Huangxin Lin, Yunlong Lin, Shijue Huang, Shengyuan Liu, Bowen Ye, Rang Li, Lei Li, Benyou Wang, Yixuan Yuan · 2026

LLM agents are expected to complete end-to-end units of work across software tools, business services, and local workspaces. Yet many agent benchmarks freeze a curated task set at release time and gra…

Read Paper →

Physics Preprint PDF DOI

Some Properties and Uses of the Species Scale

Luis E. Ibanez · 2026

The 'Species Scale' has proved to be an important concept when studying consistent effective actions in Quantum Gravity. This is a short summary of my contribution to the Corfu Summer Institute in Sep…

Read Paper →

Mathematics Preprint PDF DOI

A Systematic Review of Recent Advancements in PINN Augmented Deep Learning and Mathematical Modeling for Efficient Portfolio Management

Bahadur Yadav, Sanjay Kumar Mohanty · 2026

In finance, portfolio management is a traditional yet difficult problem that has drawn attention from practitioners and researchers for many years. However, there are still difficult technological pro…

Read Paper →

AI & Data Science Preprint PDF DOI

Detecting is Easy, Adapting is Hard: Local Expert Growth for Visual Model-Based Reinforcement Learning under Distribution Shift

Haiyang Zhao · 2026

Visual model-based reinforcement learning (MBRL) agents can perform well on the training distribution, but often break down once the test environment shifts. In visual MBRL, recognizing that a shift h…

Read Paper →

Mathematics Preprint PDF DOI

Locally rigid implies globally rigid in Kahler geometry

Mu-Lin Li · 2026

In this paper, we study the rigidity properties of compact Kahler manifolds. Given a smooth family of compact Kahler manifolds X over the unit disk, we show that all the fibers are mutually isomorphic…

Read Paper →

AI & Data Science Preprint PDF DOI

Towards interpretable AI with quantum annealing feature selection

Francesco Aldo Venturelli, Emanuele Costa, Sikha O K, Bruno Julia-Diaz, Miguel A. Gonzalez Ballester, Alba Cervera-Lierta · 2026

Deep learning models are used in critical applications, in which mistakes can have serious consequences. Therefore, it is crucial to understand how and why models generate predictions. This understand…

Read Paper →

AI & Data Science Preprint PDF DOI

Sample-efficient Neuro-symbolic Proximal Policy Optimization

Simone Murari, Celeste Veronese, Daniele Meli · 2026

Deep Reinforcement Learning (DRL) algorithms often require a large amount of data and struggle in sparse-reward domains with long planning horizons and multiple sub-goals. In this paper, we propose a …

Read Paper →

AI & Data Science Preprint PDF DOI

DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

Hanqing Yang, Qiang Zhou, Yongchao Du, Sashuai Zhou, Zhibin Wang, Jun Song, Tiezheng Ge, Cheng Yu, Bo Zheng · 2026

Recent image editing models have achieved strong visual fidelity but often struggle with tasks requiring complex reasoning. To investigate and enhance the reasoning-grounded planning for image editing…

Read Paper →

AI & Data Science Preprint PDF DOI

Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate

John Seon Keun Yi, Aaron Mueller, Dokyun Lee · 2026

Multi-agent debate has been shown to improve reasoning in large language models (LLMs). However, it is compute-intensive, requiring generation of long transcripts before answering questions. To addres…

Read Paper →

AI & Data Science Preprint PDF DOI

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

Qiliang Liang, Hansi Wang, Zhong Liang, Yang Liu · 2026

LLM agents increasingly rely on reusable skills, capability packages that combine instructions, control flow, constraints, and tool calls. In most current agent systems, however, skills are still repr…

Read Paper →

AI & Data Science Preprint PDF DOI

V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think

Bingda Tang, Yuhui Zhang, Xiaohan Wang, Jiayuan Mao, Ludwig Schmidt, Serena Yeung-Levy · 2026

Aligning denoising generative models with human preferences or verifiable rewards remains a key challenge. While policy-gradient online reinforcement learning (RL) offers a principled post-training fr…

Read Paper →

Computer Science Preprint PDF DOI

Fast Core Identification

Irene Aldridge · 2026

This paper examines the computational complexity of the \emph{Core Identification Problem} (CIP) in one-sided matching markets governed by the Top Trading Cycles (TTC) algorithm. The central contribut…

Read Paper →

AI & Data Science Preprint PDF DOI

Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models

Felix Herron, Solange Rossato, Alexandre Allauzen, Francois Portet · 2026

Modern automatic speech recognition (ASR) systems have been observed to function better for certain speaker groups (SGs) than others, despite recent gains in overall performance. One potential impedim…

Read Paper →

Physics Preprint PDF DOI

Kahler decoupling for Kerr perturbations

Stephen R. Green, Kirill Krasnov, Adam Shaw · 2026

The Euclidean Kerr metric is conformal, in two distinct ways, to a Kahler metric, with conformal factors determined by the repeated eigenvalue of the two chiral halves of the Weyl curvature. A Lorentz…

Read Paper →

Computer Science Preprint PDF DOI

Enhancing a gamified tool for UML modeling education

Giacomo Garaccione, Riccardo Coppola, Luca Ardito · 2026

Unified Modeling Language (UML) Use Case and Class Diagrams are fundamental modeling notations in Software Engineering (SE) education due to their importance for requirements and model-based engineeri…

Read Paper →

Mathematics Preprint PDF DOI

Kneser Graphs of Triangulations are Hamiltonian

Anton Molnar, Cosmin Pohoata, Michael Zheng · 2026

For every $n \geq 5$, we show that the Kneser graph of triangulations of a convex $n$-gon contains a Hamiltonian cycle.…

Read Paper →

Biology & Life Sciences Preprint PDF DOI

ProDock: From multi-target consensus docking into database-backed storage

Tieu-Long Phan, Lai Hoang Son Le, Thanh-An Pham, Nhu-Ngoc Nguyen Song, Tuyet-Minh Phan, Tuyen Ngoc Truong · 2026

Protein--ligand docking is widely used in structure-based discovery, but routine studies often fail at the workflow level rather than at the scoring level. Receptor cleaning, ligand preparation, file …

Read Paper →

Computer Science Preprint PDF DOI

Agentic AI-assisted coding offers a unique opportunity to instill epistemic grounding during software development

Magnus Palmblad, Jared M. Ragland, Benjamin A. Neely · 2026

The capabilities of AI-assisted coding are progressing at breakneck speed. Chat-based vibe coding has evolved into fully fledged AI-assisted, agentic software development using agent scaffolds where t…

Read Paper →

AI & Data Science Preprint PDF DOI

Causal Discovery in Multivariate Extremes via Tail Asymmetry

Mengran Li, Daniela Castro-Camilo · 2026

Causal discovery in multivariate extremes is challenging because extreme observations are sparse, dependent, and often affected by latent common shocks. Existing approaches focus on undirected extrema…

Read Paper →

AI & Data Science Preprint PDF DOI

Deep kernel video approximation for unsupervised action segmentation

Silvia L. Pintea, Jouke Dijkstra · 2026

This work focuses on per-video unsupervised action segmentation, which is of interest to applications where storing large datasets is either not possible, or nor permitted. We propose to segment video…

Read Paper →

Browse Research Papers

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Some Properties and Uses of the Species Scale

A Systematic Review of Recent Advancements in PINN Augmented Deep Learning and Mathematical Modeling for Efficient Portfolio Management

Detecting is Easy, Adapting is Hard: Local Expert Growth for Visual Model-Based Reinforcement Learning under Distribution Shift

Locally rigid implies globally rigid in Kahler geometry

Towards interpretable AI with quantum annealing feature selection

Sample-efficient Neuro-symbolic Proximal Policy Optimization

DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think

Fast Core Identification

Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models

Kahler decoupling for Kerr perturbations

Enhancing a gamified tool for UML modeling education

Kneser Graphs of Triangulations are Hamiltonian

ProDock: From multi-target consensus docking into database-backed storage

Agentic AI-assisted coding offers a unique opportunity to instill epistemic grounding during software development

Causal Discovery in Multivariate Extremes via Tail Asymmetry

Deep kernel video approximation for unsupervised action segmentation

Browse by Category

Research Type

Publish Your Research