P. Kumar in AI & Data Science — Research Repository

AI & Data Science Preprint PDF DOI

Exponential families from a single KL identity

Marc Dymetman · 2026

Exponential families encompass the distributions central to modern machine learning -- softmax, Gaussians, and Boltzmann distributions -- and underlie the theory of variational inference, entropy-regu…

Read Paper →

AI & Data Science Preprint PDF DOI

ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era

Mohit Dubey, Open Gigantic · 2026

Every document format in existence was designed for a human reader moving linearly through text. Autonomous LLM agents do not read - they retrieve. This fundamental mismatch forces agents to inject en…

Read Paper →

AI & Data Science Preprint PDF DOI

The TEA Nets framework combines AI and cognitive network science to model targets, events and actors in text

Sebastiano Franchini, Alexis Carrillo, Edoardo Sebastiano De Duro, Riccardo Improta, Ali Aghazadeh Ardebili, Massimo Stella · 2026

We introduce Target-Event-Agent Networks (TEA Nets) as a computational framework to extract subjects (``Agents"), verbs (``Events"), and objects (``Targets") from texts. Grounded in cognitive network …

Read Paper →

AI & Data Science Preprint PDF DOI

PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations

Yang Zhang, Jiangyuan Zhao, Chenyou Fan, Fangzheng Yan, Tian Li, Haitong Tang, Sen Fu, Xuan'er Wu, Qizhen Weng, Weinan Zhang, Xiu Li, Chi Zhang, Chenjia Bai, Xuelong Li · 2026

Vision-Language-Action (VLA) models advance robotic control via strong visual-linguistic priors. However, existing VLAs predominantly frame pretraining as supervised behavior cloning, overlooking the …

Read Paper →

AI & Data Science Preprint PDF DOI

Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation

Jon-Paul Cacioli · 2026

We adapted the Reliable Change Index (RCI; Jacobson and Truax, 1991) from clinical psychology to item-level LLM version comparison on 2,000 MMLU-Pro items (K=10 samples at T=0.7). Two within-family pa…

Read Paper →

AI & Data Science Preprint PDF DOI

Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

Yuan Sun · 2026

As large language model (LLM) agents are deployed in high-stakes environments, the question of how safely to delegate subtasks to specialized sub-agents becomes critical. Existing work addresses multi…

Read Paper →

AI & Data Science Preprint PDF DOI

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

Juergen Dietrich · 2026

Democratic discourse analysis systems increasingly rely on multi-agent LLM pipelines in which distinct evaluator models are assigned adversarial roles to generate structured, multi-perspective assessm…

Read Paper →

AI & Data Science Preprint PDF DOI

Selective Augmentation: Improving Universal Automatic Phonetic Transcription via G2P Bootstrapping

Tobias Bystrich, Julia M. Pritzen, Christoph A. Schmidt, Claudia Wich-Reif · 2026

In the field of universal automatic phonetic transcription (APT), clean and diverse training transcriptions are required. However, such high-quality data is limited. We propose the bootstrapping appro…

Read Paper →

AI & Data Science Preprint PDF DOI

Learning Rate Transfer in Normalized Transformers

Boris Shigida, Boris Hanin, Andrey Gromov · 2026

The Normalized Transformer, or nGPT (arXiv:2410.01131) achieves impressive training speedups and does not require weight decay or learning rate warmup. However, despite having hyperparameters that exp…

Read Paper →

AI & Data Science Preprint PDF DOI

A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound

Francesco Orabona · 2026

In Orabona and P\'al [2016], we introduced the shifted KT potentials, to remove the $\ln \ln T$ factor in the parameter-free learning with expert bound. In this short technical note, I show that this …

Read Paper →

AI & Data Science Preprint PDF DOI

KAYRA: A Microservice Architecture for AI-Assisted Karyotyping with Cloud and On-Premise Deployment

Attila Pinter, Javier Rico, Attila Repai, Jalal Al-Afandi, Adrienn Eva Borsy, Andras Kozma, Hajnalka Andrikovics, Gyorgy Cserey · 2026

We present KAYRA, an end-to-end karyotyping system that operates inside the operational constraints of a clinical cytogenetic laboratory. KAYRA is architected as a containerized microservice pipeline …

Read Paper →

AI & Data Science Preprint PDF DOI

Nonparametric Testing and Variable Selection for ARCH-m(X) Model

Adriano Zanin Zambom, Qing Wang · 2026

We introduce the ARCH-m(X) model, a semiparametric extension of the ARCH-X framework in which the effect of a multivariate exogenous covariate vector X on the conditional variance is modeled through a…

Read Paper →

AI & Data Science Preprint PDF DOI

Random Cloud: Finding Minimal Neural Architectures Without Training

Javier Gil Blazquez · 2026

I propose the \emph{Random Cloud} method, a training-free approach to neural architecture search that discovers minimal feedforward network topologies through stochastic exploration and progressive st…

Read Paper →

AI & Data Science Preprint PDF DOI

Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation

Ariel Sela · 2026

Multi-agent deliberation systems using large language models (LLMs) are increasingly proposed for policy simulation, yet they suffer from artificial consensus: evaluator agents converge on the same op…

Read Paper →

AI & Data Science Preprint PDF DOI

Layer-wise Lipschitz-Product Control for Deep Kolmogorov--Arnold Network Representations of Compositionally Structured Functions

Aleksander Tankman · 2026

We prove that any continuous function f from [0,1]^n to R representable by a finite computation tree with N internal nodes and compositional sparsity s = O(1) admits a deep Kolmogorov-Arnold Network (…

Read Paper →

AI & Data Science Preprint PDF DOI

CORAL: Adaptive Retrieval Loop for Culturally-Aligned Multilingual RAG

Nayeon Lee, Jiwoo Song, Byeongcheol Kang · 2026

Multilingual retrieval-augmented generation (mRAG) is often implemented within a fixed retrieval space, typically via query or document translation or multilingual embedding vector representations. Ho…

Read Paper →

AI & Data Science Preprint PDF DOI

Prior-Aligned Data Cleaning for Tabular Foundation Models

Laure Berti-Equille · 2026

Tabular Foundation Models (TFMs) achieve state-of-the-art zero-shot accuracy on small tabular datasets by meta-learning over synthetic data-generating processes -- making them highly attractive for pr…

Read Paper →

AI & Data Science Preprint PDF DOI

Semantic Layers for Reliable LLM-Powered Data Analytics: A Paired Benchmark of Accuracy and Hallucination Across Three Frontier Models

Michael Rumiantsau, Ivan Fokeev · 2026

LLMs deployed for natural-language querying of analytical databases suffer from two intertwined failures - incorrect answers and confident hallucinations - both rooted in the same cause: the model is …

Read Paper →

AI & Data Science Preprint PDF DOI

One Perturbation, Two Failure Modes: Probing VLM Safety via Embedding-Guided Typographic Perturbations

Ravikumar Balakrishnan, Sanket Mendapara · 2026

Typographic prompt injection exploits vision language models' (VLMs) ability to read text rendered in images, posing a growing threat as VLMs power autonomous agents. Prior work typically focus on max…

Read Paper →

AI & Data Science Preprint PDF DOI

Theoretical guarantees for stochastic gradient sampling methods via Gaussian convolution inequalities

Daniel Paulin, Peter A. Whalley · 2026

We derive first-order (in the stepsize) bounds on the bias in Wasserstein distances of the invariant measure of stochastic gradient kinetic Langevin dynamics with minimal assumptions on the stochastic…

Read Paper →

Browse Research Papers

Exponential families from a single KL identity

ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era

The TEA Nets framework combines AI and cognitive network science to model targets, events and actors in text

PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations

Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation

Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

Selective Augmentation: Improving Universal Automatic Phonetic Transcription via G2P Bootstrapping

Learning Rate Transfer in Normalized Transformers

A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound

KAYRA: A Microservice Architecture for AI-Assisted Karyotyping with Cloud and On-Premise Deployment

Nonparametric Testing and Variable Selection for ARCH-m(X) Model

Random Cloud: Finding Minimal Neural Architectures Without Training

Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation

Layer-wise Lipschitz-Product Control for Deep Kolmogorov--Arnold Network Representations of Compositionally Structured Functions

CORAL: Adaptive Retrieval Loop for Culturally-Aligned Multilingual RAG

Prior-Aligned Data Cleaning for Tabular Foundation Models

Semantic Layers for Reliable LLM-Powered Data Analytics: A Paired Benchmark of Accuracy and Hallucination Across Three Frontier Models

One Perturbation, Two Failure Modes: Probing VLM Safety via Embedding-Guided Typographic Perturbations

Theoretical guarantees for stochastic gradient sampling methods via Gaussian convolution inequalities

Browse by Category

Research Type

Publish Your Research