Vikash Mishra — Research Repository

Computer Science Preprint PDF DOI

Toward a Characterization of Simulation Between Arithmetic Theories

Hunter Monroe · 2026

We study when a sound arithmetic theory $\mathcal S{\supseteq}S^1_2$ with polynomial-time decidable axioms efficiently proves the bounded consistency statements $Con_{\mathcal S{+}\phi}(n)$ for a true…

Read Paper →

AI & Data Science Preprint PDF DOI

Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs

Naomi Esposito, Anthony Tricarico, Luisa Porzio, Ali Aghazadeh Ardebili, Massimo Stella · 2026

To enhance LLMs' impact on math education, we need data on their mathematical prowess and biases across prompts. To fill this gap, we introduce MEDS (Math Education Digital Shadows) as a dataset mappi…

Read Paper →

AI & Data Science Preprint PDF DOI

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

Juergen Dietrich · 2026

Democratic discourse analysis systems increasingly rely on multi-agent LLM pipelines in which distinct evaluator models are assigned adversarial roles to generate structured, multi-perspective assessm…

Read Paper →

AI & Data Science Preprint PDF DOI

MoRFI: Monotonic Sparse Autoencoder Feature Identification

Dimitris Dimakopoulos, Shay B. Cohen, Ioannis Konstas · 2026

Large language models (LLMs) acquire most of their factual knowledge during the pre-training stage, through next token prediction. Subsequent stages of post-training often introduce new facts outwith …

Read Paper →

Computer Science Preprint PDF DOI

Exploring the Efficiency of 3D-Stacked AI Chip Architecture for LLM Inference with Voxel

Yiqi Liu, Noelle Crawford, Michael Wang, Jilong Xue, Jian Huang · 2026

To overcome the well-known memory bottleneck of AI chips, 3D stacked architectures that employ advanced packaging technology with high-density through-silicon vias (TSVs) pins have proven to be a prom…

Read Paper →

AI & Data Science Preprint PDF DOI

reward-lens: A Mechanistic Interpretability Library for Reward Models

Mohammed Suhail B Nadaf · 2026

Every RLHF-trained language model is shaped by a reward model, yet the mechanistic interpretability toolkit -- logit lens, direct logit attribution, activation patching, sparse autoencoders -- was bui…

Read Paper →

AI & Data Science Preprint PDF DOI

An Investigation of Linguistic Biases in LLM-Based Recommendations

Nitin Venkateswaran, Jason Ang, Deep Adhikari, Tarun Krishna Dasari · 2026

We investigate linguistic biases in LLM-based restaurant and product recommendations given prompts varying across Southern American English (AE), Indian English (IE), and Code-Switched Hindi-English d…

Read Paper →

AI & Data Science Preprint PDF DOI

One Perturbation, Two Failure Modes: Probing VLM Safety via Embedding-Guided Typographic Perturbations

Ravikumar Balakrishnan, Sanket Mendapara · 2026

Typographic prompt injection exploits vision language models' (VLMs) ability to read text rendered in images, posing a growing threat as VLMs power autonomous agents. Prior work typically focus on max…

Read Paper →

AI & Data Science Preprint PDF DOI

Coverage-Based Calibration for Post-Training Quantization via Weighted Set Cover over Outlier Channels

Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma · 2026

Post-Training Quantization (PTQ) compresses large language models to low bit-widths using a small calibration set, and its quality depends strongly on which samples are chosen. We identify a failure m…

Read Paper →

AI & Data Science Preprint PDF DOI

Architecture Determines Observability in Transformers

Thomas Carmichael · 2026

Autoregressive transformers make confident errors, but activation monitoring can catch them only if the model preserves an internal signal that output confidence does not expose. This preservation is …

Read Paper →

AI & Data Science Preprint PDF DOI

Resource-Lean Lexicon Induction for German Dialects

Robert Litschko, Barbara Plank, Diego Frassinelli · 2026

Automatic induction of high-quality dictionaries is essential for building lexical resources, yet low-resource languages and dialects pose several challenges: limited access to annotators, high degree…

Read Paper →

AI & Data Science Preprint PDF DOI

The Override Gap: A Magnitude Account of Knowledge Conflict Failure in Hypernetwork-Based Instant LLM Adaptation

Shuaizhi Cheng, Xiang Shi, Mingwei Li · 2026

Hypernetwork-based methods such as Doc-to-LoRA internalize a document into an LLM's weights in a single forward pass, but they fail systematically on conflicts: when the document contradicts pretraini…

Read Paper →

AI & Data Science Preprint PDF DOI

Supernodes and Halos: Loss-Critical Hubs in LLM Feed-Forward Layers

Audrey Cherilyn, Houman Safaai · 2026

We study the organization of channel-level importance in transformer feed-forward networks (FFNs). Using a Fisher-style loss proxy (LP) based on activation-gradient second moments, we show that loss s…

Read Paper →

Computer Science Preprint PDF DOI

An Empirical Evaluation of Locally Deployed LLMs for Bug Detection in Python Code

Jelena Ilic Vulicevic · 2026

Large language models (LLMs) have demonstrated strong performance on a wide range of software engineering tasks, including code generation and analysis. However, most prior work relies on cloud-based …

Read Paper →

AI & Data Science Preprint PDF DOI

Dharma, Data and Deception: An LLM-Powered Rhetorical Analysis of Cow-Urine Health Claims on YouTube

Sheza Munir, Ratna Kandala, Anamta Khan, Deepti, Joyojeet Pal · 2026

Health misinformation remains one of the most pressing challenges on social media, particularly when cultural traditions intersect with scientific-sounding claims. These dynamics are not only global b…

Read Paper →

Physics Preprint PDF DOI

VLTI-GRAVITY measurements of cool evolved stars: II. Pulsation properties and mass-loss process of the Mira star R Car and the red supergiant VX Sgr

D. Jadlovsky, M. Wittkowski, A. Chiavassa, K. Kravchenko, B. Freytag, S. Hofner, J. Krticka, C. Paladini, G. Rau, M. Broz, T. Granzer, M. Weber · 2026

The mass-loss process of red supergiant (RSG) and asymptotic giant branch (AGB) stars and its relation to variability is poorly constrained. We study two evolved stars, the Mira-type AGB star R Car an…

Read Paper →

AI & Data Science Preprint PDF DOI

Serialisation Strategy Matters: How FHIR Data Format Affects LLM Medication Reconciliation

Sanjoy Pator · 2026

Medication reconciliation at clinical handoffs is a high-stakes, error-prone process. Large language models are increasingly proposed to assist with this task using FHIR-structured patient records, bu…

Read Paper →

AI & Data Science Preprint PDF DOI

Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales

Samuel Salfati · 2026

We present a systematic empirical study of transformer compression through over 40 experiments on GPT-2 (124M parameters) and Mistral 7B (7.24B parameters). Our analysis covers spectral compression, b…

Read Paper →

Computer Science Preprint PDF DOI

Omission Constraints Decay While Commission Constraints Persist in Long-Context LLM Agents

Yeran Gamage · 2026

LLM agents deployed in production operate under operator-defined behavioral policies (system-prompt instructions such as prohibitions on credential disclosure, data exfiltration, and unauthorized outp…

Read Paper →

AI & Data Science Preprint PDF DOI

Can Continual Pre-training Bridge the Performance Gap between General-purpose and Specialized Language Models in the Medical Domain?

Niclas Doll, Jasper Schulze Buschhoff, Shalaka Satheesh, Hammam Abdelwahab, Hector Allende-Cid, Katrin Klug · 2026

This paper narrows the performance gap between small, specialized models and significantly larger general-purpose models through domain adaptation via continual pre-training and merging. We address th…

Read Paper →

Browse Research Papers

Toward a Characterization of Simulation Between Arithmetic Theories

Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

MoRFI: Monotonic Sparse Autoencoder Feature Identification

Exploring the Efficiency of 3D-Stacked AI Chip Architecture for LLM Inference with Voxel

reward-lens: A Mechanistic Interpretability Library for Reward Models

An Investigation of Linguistic Biases in LLM-Based Recommendations

One Perturbation, Two Failure Modes: Probing VLM Safety via Embedding-Guided Typographic Perturbations

Coverage-Based Calibration for Post-Training Quantization via Weighted Set Cover over Outlier Channels

Architecture Determines Observability in Transformers

Resource-Lean Lexicon Induction for German Dialects

The Override Gap: A Magnitude Account of Knowledge Conflict Failure in Hypernetwork-Based Instant LLM Adaptation

Supernodes and Halos: Loss-Critical Hubs in LLM Feed-Forward Layers

An Empirical Evaluation of Locally Deployed LLMs for Bug Detection in Python Code

Dharma, Data and Deception: An LLM-Powered Rhetorical Analysis of Cow-Urine Health Claims on YouTube

VLTI-GRAVITY measurements of cool evolved stars: II. Pulsation properties and mass-loss process of the Mira star R Car and the red supergiant VX Sgr

Serialisation Strategy Matters: How FHIR Data Format Affects LLM Medication Reconciliation

Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales

Omission Constraints Decay While Commission Constraints Persist in Long-Context LLM Agents

Can Continual Pre-training Bridge the Performance Gap between General-purpose and Specialized Language Models in the Medical Domain?

Browse by Category

Research Type

Publish Your Research