11,586+ open-access research outputs.
The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (RLVR). H…
The blazar 3C 279 is well known for its rapid and large-amplitude variability. On 20 December 2013, the source exhibited an orphan {\gamma}-ray flare characterized by a flux-doubling timescale of a fe…
In the (Nesting) Bird Box Problem we are given a polygonal domain P and a number k and we want to know if there is a set B of k points inside P such that no two points in B can see each other. The und…
In many analyses the object reported at the end is not fixed in advance, but is chosen after a preliminary search over variables, subgroups, transformations, models or contrasts. Classical selective-i…
Trust in clinical artificial intelligence (AI) cannot be reduced to model accuracy, fluency of generation, or overall positive user impression. In medicine, trust must be engineered as a measurable sy…
We consider the symmetric Markov random flight $\bold X(t), \; t>0,$ in the Euclidean space $\Bbb R^m, \; m\ge 3$, performed by a particle that moves in $\Bbb R^m$ with constant finite speed and chang…
Knowledge distillation (KD) is a well-known technique to effectively compress a large network (teacher) to a smaller network (student) with little sacrifice in performance. However, most KD methods re…
Knowledge distillation (KD) represents a vital mechanism to transfer expertise from complex teacher networks to efficient student models. However, in decentralized or secure AI ecosystems, privacy reg…
We propose a human in the loop approach for black-box testing of Functional Mock-up Units (FMUs) using Large Language Models (LLMs). The goal is to reduce the manual effort in defining test scenarios …
Detecting sandbagging--the deliberate underperformance on capability evaluations--is an open problem in AI safety. We tested whether symptom validity testing (SVT) logic from clinical malingering dete…
We study the problem of minimizing a multivariate polynomial function over the unit hypercube. By representing the polynomial through a hypergraph and exploiting its sparsity structure, we establish a…
Closed-source frontier labs do not disclose parameter counts, and the standard alternative -- inference economics -- carries $2\times$+ uncertainty from hardware, batching, and serving-stack assumptio…
We analyze device-dependent correlation sets generated by fixed local dichotomic measurements for two-qubit systems in the $(2,m,2)$ Bell scenario. We consider three fundamental state spaces for the c…
Cool, dense condensations such as coronal rain and prominences suggest that coronal plasma can undergo runaway radiative cooling. Connecting this behaviour to linear thermal modes requires us to fully…
Instance-sensitive losses for semantic segmentation such as blob loss and CC loss were designed to address instance imbalance, ensuring small lesions generate the same gradient as large ones, but oper…
Cloud users aim to minimize cost while maximizing performance by selecting the most suitable instance types for their workloads. To reduce expenses, spot instances have been widely adopted due to thei…
We consider a generalization of the two-body contact interaction for nonrelativistic particles confined to a one-dimensional box, in which the interaction is decentered, i.e., the particles interact o…
Multi-component natural language processing (NLP) pipelines are increasingly deployed for high-stakes decisions, yet no existing adversarial method can test their robustness under realistic conditions…
Frontier models push the boundaries of what is learnable at extreme computational costs, yet distillation via sampling reasoning traces exposes closed-source frontier models to adversarial third parti…
Graph neural networks achieve strong node-classification accuracy, but learned message passing entangles ego attributes, neighborhood smoothing, high-pass graph differences, class geometry, and classi…
Free open-access publishing with Google Scholar indexing.
Submission Guide →