Dominik Helm — Research Repository

Computer Science Preprint PDF DOI

Unsafe and Unused? A History of Utility Code in Mature Open Source Projects

Brandon Keller, Kaitlin Yandik, Angela Ngo, Andy Meneely · 2026

Filenames are a concise means of conveying information about source code to fellow developers. One such convention is util. Commonly understood to stand for "utility", filenames with the letters util …

Read Paper →

Computer Science Preprint PDF DOI

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

Prashant Kulkarni · 2026

Multi-turn prompt injection follows a known attack path -- trust-building, pivoting, escalation but text-level defenses miss covert attacks where individual turns appear benign. We show this attack pa…

Read Paper →

AI & Data Science Preprint PDF DOI

What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design

Ivan Bercovich · 2026

Terminal-agent benchmarks have become a primary signal for measuring the coding and system-administration capabilities of large language models. As the market for evaluation environments grows, so doe…

Read Paper →

AI & Data Science Preprint PDF DOI

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

Feiyu Wu, Xu Zheng, Zhuocheng Wang, Yi ming Dai, Hui Li · 2026

Large language models (LLMs) make reward design in reinforcement learning substantially more scalable, but generated rewards are not automatically reliable training objectives. Existing work has focus…

Read Paper →

AI & Data Science Preprint PDF DOI

PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking

Qing Lyu, Jeremy Hudson, Mohammad Kawas, Yuming Jiang, Chenyu You, Christopher T Whitlow · 2026

Individualized Alzheimer's disease (AD) progression prediction requires models that use irregular visits, account for censoring, avoid diagnostic leakage, and provide calibrated horizon risks. We prop…

Read Paper →

Engineering Preprint PDF DOI

Dreaming Across Towns: Semantic Rollout and Town-Adversarial Regularization for Zero-Shot Held-Out-Town Fixed-Route Driving in CARLA

Feeza Khan Khanzada, Jaerock Kwon · 2026

Learned driving agents often degrade when deployed in unseen environments. This paper studies a deliberately bounded instance of that problem in the CARLA simulator: zero-shot transfer of a closed-loo…

Read Paper →

AI & Data Science Preprint PDF DOI

D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery

Hanane Nour Moussa, Yifei Li, Zhuoyang Li, Yankai Yang, Cheng Tang, Tianshu Zhang, Nesreen K. Ahmed, Ali Payani, Ziru Chen, Huan Sun · 2026

Despite recent progress in language models and agents for scientific data-driven discovery, further advancing their capabilities is held back by the absence of verifiable environments representing rea…

Read Paper →

AI & Data Science Preprint PDF DOI

Simulating clinical interventions with a generative multimodal model of human physiology

Guy Lutsker, Gal Sapir, Jordi Merino, Smadar Shilo, Anastasia Godneva, Eli Meirom, Shie Mannor, Hagai Rossman, Gal Chechik, Eran Segal · 2026

Understanding how human health changes over time, and why responses to interventions vary between individuals, remains a central challenge in medicine. Here we present HealthFormer, a decoder-only tra…

Read Paper →

AI & Data Science Preprint PDF DOI

ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era

Mohit Dubey, Open Gigantic · 2026

Every document format in existence was designed for a human reader moving linearly through text. Autonomous LLM agents do not read - they retrieve. This fundamental mismatch forces agents to inject en…

Read Paper →

Physics Preprint PDF DOI

On Linear and Non-Linear Mechanics of Cyanobacterial Colonies

Yuri Z. Sinzato, Annemieke M. Drost, Dedmer B. Van de Waal, Robert Uittenbogaard, Petra M. Visser, Jef Huisman, Maziyar Jalaal · 2026

Toxic cyanobacterial blooms are a growing environmental concern that affects freshwater ecosystems, drinking water supplies, and public health. The cyanobacterium Microcystis is among the most importa…

Read Paper →

AI & Data Science Preprint PDF DOI

Differential Subgroup Discovery: Characterizing Where Two Populations Differ, and Why

Sascha Xu, Jilles Vreeken · 2026

We study the problem of understanding where two populations differ within a feature space, which we formalize in the concept of a differential subgroup: a subset of individuals from both populations w…

Read Paper →

Sociology & Anthropology Preprint PDF DOI

Crowd Dynamics in Historical Perspective: Reframing the Amritsar Massacre through Agent-Based Modelling and Social Psychology

Mohcine Chraibi, Krisztina Konya, Ezel Usten · 2026

Crowds have long held a paradoxical place in the human imagination, feared for their destructive potential yet essential for collective expression. This tension was tragically manifested in the 1919 J…

Read Paper →

Computer Science Preprint PDF DOI

LLM-as-a-Judge for Human-AI Co-Creation: A Reliability-Aware Evaluation Framework for Coding

Md Faizul Ibne Amin, Yutaka Watanobe, Daniel M. Muepu, Haruto Suzuki, Kenta Nanaumi, Md Mostafizer Rahman · 2026

LLMs are increasingly employed both as judges for evaluating open-ended outputs and as co-creation partners in AI-assisted programming; yet rigorous evaluation in human-AI co-creation settings remains…

Read Paper →

AI & Data Science Preprint PDF DOI

ANCORA: Learning to Question via Manifold-Anchored Self-Play for Verifiable Reasoning

Chengcao Yang, Jun Chen · 2026

We propose a paradigm shift from learning to answer to learning to question: can a language model generate verifiable problems, solve them, and turn the resulting feedback into self-improvement withou…

Read Paper →

Mathematics Preprint PDF DOI

A Systematic Review of Recent Advancements in PINN Augmented Deep Learning and Mathematical Modeling for Efficient Portfolio Management

Bahadur Yadav, Sanjay Kumar Mohanty · 2026

In finance, portfolio management is a traditional yet difficult problem that has drawn attention from practitioners and researchers for many years. However, there are still difficult technological pro…

Read Paper →

AI & Data Science Preprint PDF DOI

Learning from a single labeled face and a stream of unlabeled data

Branislav Kveton, Michal Valko · 2026

Face recognition from a single image per person is a challenging problem because the training sample is extremely small. We consider a variation of this problem. In our problem, we recognize only one …

Read Paper →

Physics Preprint PDF DOI

Topological antiqued mechanical toy

Hirofumi Wada, Hayato Mizobata, Shuto Ueno, Taiju Yoneda · 2026

{\it Jacob's ladder} -- a classic children's toy -- is a simple mechanical frame comprising rigid blocks connected by strings that shows curious unidirectional flipping waves. Nonetheless, its physica…

Read Paper →

Computer Science Preprint PDF DOI

lpviz: Interactive Linear Programming Visualization

Evan Grand, Michael Klamkin · 2026

This paper presents lpviz, a browser-based visualization tool for linear programming. lpviz is deeply interactive, offering an intuitive interface where users can directly draw and edit the feasible r…

Read Paper →

Engineering Preprint PDF DOI

BUT System Description for CHiME-9 MCoRec Challenge

Dominik Klement, Alexander Polok, Nguyen Hai Phong, Prachi Singh, Lukas Burget · 2026

Multi-talker automatic speech recognition (ASR) in conversational recordings remains an open problem, particularly in scenarios with large portion of overlapping speech where identifying and transcrib…

Read Paper →

AI & Data Science Preprint PDF DOI

Leading Across the Spectrum of Human-AI Relationships: A Conceptual Framework for Increasingly Heterogeneous Teams

Alejandro R. Jadad · 2026

What shapes a consequential decision when human and artificial intelligence work on it together? The answer is becoming harder to see. A decision may look human-led after AI has set the frame, or appe…

Read Paper →

Browse Research Papers

Unsafe and Unused? A History of Utility Code in Mature Open Source Projects

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design

RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses

PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking

Dreaming Across Towns: Semantic Rollout and Town-Adversarial Regularization for Zero-Shot Held-Out-Town Fixed-Route Driving in CARLA

D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery

Simulating clinical interventions with a generative multimodal model of human physiology

ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era

On Linear and Non-Linear Mechanics of Cyanobacterial Colonies

Differential Subgroup Discovery: Characterizing Where Two Populations Differ, and Why

Crowd Dynamics in Historical Perspective: Reframing the Amritsar Massacre through Agent-Based Modelling and Social Psychology

LLM-as-a-Judge for Human-AI Co-Creation: A Reliability-Aware Evaluation Framework for Coding

ANCORA: Learning to Question via Manifold-Anchored Self-Play for Verifiable Reasoning

A Systematic Review of Recent Advancements in PINN Augmented Deep Learning and Mathematical Modeling for Efficient Portfolio Management

Learning from a single labeled face and a stream of unlabeled data

Topological antiqued mechanical toy

lpviz: Interactive Linear Programming Visualization

BUT System Description for CHiME-9 MCoRec Challenge

Leading Across the Spectrum of Human-AI Relationships: A Conceptual Framework for Increasingly Heterogeneous Teams

Browse by Category

Research Type

Publish Your Research