Gerhard Bauch — Research Repository

AI & Data Science Preprint PDF DOI

Representation Fr\'echet Loss for Visual Generation

Jiawei Yang, Zhengyang Geng, Xuan Ju, Yonglong Tian, Yue Wang · 2026

We show that Fr\'echet Distance (FD), long considered impractical as a training objective, can in fact be effectively optimized in the representation space. Our idea is simple: decouple the population…

Read Paper →

Mathematics Preprint PDF DOI

Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing

Max Lovig · 2026

In modern parametric model training, full-batch gradient descent (and its variants) suffers due to progressively stronger biasing towards the exact realization of training data; this drives the system…

Read Paper →

Mathematics Preprint PDF DOI

Structural properties of Bia{\l}ynicki-Birula decompositions

Teddy Gonzales, Chayim Lowen · 2026

We investigate several aspects of the Bialynicki-Birula decomposition of a smooth complete $\mathbb{G}_m$-variety with finite fixed locus. Our results include novel characterizations of when the Bialy…

Read Paper →

Mathematics Preprint PDF DOI

The Bernstein-von Mises theorem for Bayesian one-pass online learning

Jeyong Lee, Junhyeok Choi, Dongguen Kim, Minwoo Chae · 2026

Bayesian online learning provides a coherent framework for sequential inference. However, its theoretical understanding remains limited, particularly in the one-pass setting. Existing theoretical guar…

Read Paper →

AI & Data Science Preprint PDF DOI

A Short Note on Batch-efficient Divide-and-Conquer Algorithm for EigenDecomposition

Yue Song · 2026

EigenDecomposition (ED) is at the heart of many computer vision algorithms and applications. One crucial bottleneck limiting its usage is the expensive computation cost, particularly for a mini-batch …

Read Paper →

AI & Data Science Preprint PDF DOI

YOSE: You Only Select Essential Tokens for Efficient DiT-based Video Object Removal

Chenyang Wu, Lina Lei, Fan Li, Chun-Le Guo, Dehong Kong, Xinran Qin, Zhixin Wang, Ming-Ming Cheng, Chongyi Li · 2026

Recent advances in Diffusion Transformer (DiT)-based video generation technologies have shown impressive results for video object removal. However, these methods still suffer from substantial inferenc…

Read Paper →

AI & Data Science Preprint PDF DOI

Analytical Correction for Subsampling Bias in Drifting Models

Jiaru Zhang, Zeyun Deng, Juanwu Lu, Ziran Wang, Ruqi Zhang · 2026

Drifting models are capable one-step generative models trained to follow a drifting field. The field combines attractive and repulsive softmax-weighted centroids over the data and current-generator di…

Read Paper →

Mathematics Preprint PDF DOI

Reliability-based Topology Optimization using Large Deviation Theory

Maryam Maghazeh, Ayyappan Unnikrishna Pillai, Mohammad Masiur Rahaman, Subhayan De · 2026

Reliability-based topology optimization (RBTO) requires repeated estimation of small failure probabilities and their gradients, making conventional nested Monte Carlo approaches computationally prohib…

Read Paper →

Mathematics Preprint PDF DOI

Deep Policy Iteration for High-Dimensional Mean-Field Games with Regenerative Reformulation

Shuixin Fang, Shupeng Wang, Zhen Wu, Hui Zhang, Tao Zhou · 2026

This paper develops a deep policy iteration method for high-dimensional finite-horizon mean-field games. We reformulate the game as a regenerative problem with deterministic cycles, which allows polic…

Read Paper →

Computer Science Preprint PDF DOI

COPUS: Co-adaptive Parallelism and Batch Size Selection in Large Language Model Training

Akhmed Sakip, Erland Hilman Fuadi, Omar Sayedelahl, Zonghang Li, Jianshu She, Alham Fikri Aji, Steve Liu, Eric Xing, Qirong Ho · 2026

Training large language models requires jointly configuring two interdependent aspects of the system: the global batch size, which governs statistical efficiency, and the 3D parallelism strategy, whic…

Read Paper →

Mathematics Preprint PDF DOI

A stellated tetrahedron that is probably not Rupert

Tony Zeng · 2026

A convex polyhedron is Rupert if a hole can be cut into it (making its genus $1$) such that an identical copy of the polyhedron can pass through the hole. Resolving a conjecture of Jerrard-Wetzel-Yuan…

Read Paper →

Computer Science Preprint PDF DOI

Quantamination: Dynamic Quantization Leaks Your Data Across the Batch

Hanna Foerster, Ilia Shumailov, Cheng Zhang, Yiren Zhao, Jamie Hayes, Robert Mullins · 2026

Dynamic quantization emerged as a practical approach to increase the utilization and efficiency of the machine learning serving flow. Unlike static quantization, which applies quantization offline, dy…

Read Paper →

Computer Science Preprint PDF DOI

Differentially Private Contrastive Learning via Bounding Group-level Contribution

Kecen Li, Chen Gong, Zinan Lin, Tianhao Wang, Xiaokui Xiao · 2026

Differentially private (DP) contrastive learning aims to learn general-purpose representations from sensitive data, alleviating the privacy leakage concerns of organizations deploying or sharing embed…

Read Paper →

Engineering Preprint PDF DOI

Similarity Choice and Negative Scaling in Supervised Contrastive Learning for Deepfake Audio Detection

Jaskirat Sudan, Hashim Ali, Surya Subramani, Hafiz Malik · 2026

Supervised contrastive learning (SupCon) is widely used to shape representations, but has seen limited targeted study for audio deepfake detection. Existing work typically combines contrastive terms w…

Read Paper →

AI & Data Science Preprint PDF DOI

RaMP: Runtime-Aware Megakernel Polymorphism for Mixture-of-Experts

Vyom Sharma, Debajyoti Datta · 2026

The optimal kernel configuration for Mixture-of-Experts (MoE) inference depends on both batch size and the expert routing distribution, yet production systems dispatch from batch size alone, leaving 1…

Read Paper →

Mathematics Preprint PDF DOI

A structure theorem for sets with doubling $4+\delta$

Yifan Jing, Akshat Mudgal · 2026

We prove a structural result for sets of integers with doubling at most $4 + \delta$, with $\delta>0$ sufficiently small. This generalises earlier work of Eberhard--Green--Manners which dealt with set…

Read Paper →

Computer Science Preprint PDF DOI

Hands-on PDC in Undergraduate Computing Education

Hala ElAarag, Anas Gamal Aly · 2026

Parallel and Distributed Computing (PDC) is a critical yet conceptually challenging area of the undergraduate computer science curriculum. While students often encounter these concepts in theory, few …

Read Paper →

Physics Preprint PDF DOI

NTL-amplified cryogenic light detectors with optically transparent electrodes

Matteo Biassoni, Andrea Nava, Oscar Azzolini, Mattia Beretta, Tommaso Bradanini, Chiara Brofferio, Paolo Carniti, Simone Copello, Mourad El Idrissi, Marco Faverzani, Elena Ferri, Massimo Girola, Luca Gironi, Claudio Gotti, Leonard Imbert, Giorgio Keppel, Nicola Manenti, Ilaria Molinari, Irene Nutini, Maura Pavan, Daniele Peracchi, Gianluigi Pessina, Sonja Schneidewind, Davide Trotta · 2026

The Neganov-Trofimov-Luke (NTL) effect is used by experiments based on cryogenic detectors to boost the sensitivity of light-sensitive devices down to a few optical photons. In this work we introduce …

Read Paper →

Computer Science Preprint PDF DOI

NVLLM: A 3D NAND-Centric Architecture Enabling Edge on-Device LLM Inference

Mingbo Hao, Changwei Yan, Haoyu Cui, Zhihao Yan, Yizhi Ding, Zhangrui Qian, Weiwei Shan · 2026

The rapid growth of LLMs demands high-throughput, memory-capacity-intensive inference on resource-constrained edge devices, where single-batch decoding remains fundamentally memory-bound. Existing out…

Read Paper →

AI & Data Science Preprint PDF DOI

Mini-Batch Class Composition Bias in Link Prediction

Kieran Maguire, Srinandan Dasmahapatra · 2026

Prior work on node classification has shown that Graph Neural Networks (GNNs) can learn representations that transfer across graphs, when underlying graph properties are shared. For a fixed graph, one…

Read Paper →

Browse Research Papers

Representation Fr\'echet Loss for Visual Generation

Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing

Structural properties of Bia{\l}ynicki-Birula decompositions

The Bernstein-von Mises theorem for Bayesian one-pass online learning

A Short Note on Batch-efficient Divide-and-Conquer Algorithm for EigenDecomposition

YOSE: You Only Select Essential Tokens for Efficient DiT-based Video Object Removal

Analytical Correction for Subsampling Bias in Drifting Models

Reliability-based Topology Optimization using Large Deviation Theory

Deep Policy Iteration for High-Dimensional Mean-Field Games with Regenerative Reformulation

COPUS: Co-adaptive Parallelism and Batch Size Selection in Large Language Model Training

A stellated tetrahedron that is probably not Rupert

Quantamination: Dynamic Quantization Leaks Your Data Across the Batch

Differentially Private Contrastive Learning via Bounding Group-level Contribution

Similarity Choice and Negative Scaling in Supervised Contrastive Learning for Deepfake Audio Detection

RaMP: Runtime-Aware Megakernel Polymorphism for Mixture-of-Experts

A structure theorem for sets with doubling $4+\delta$

Hands-on PDC in Undergraduate Computing Education

NTL-amplified cryogenic light detectors with optically transparent electrodes

NVLLM: A 3D NAND-Centric Architecture Enabling Edge on-Device LLM Inference

Mini-Batch Class Composition Bias in Link Prediction

Browse by Category

Research Type

Publish Your Research