4,943+ open-access research outputs.
Due to the scarcity of large-scale in-the-wild triplet data and the improper use of masks, the performance of video virtual try-on models remains limited. In this paper, we first introduce **TripVVT-1…
Mixture-of-Experts (MoE) architectures in Large Language Models (LLMs) have significantly reduced inference costs through sparse activation. However, this sparse activation paradigm also introduces ne…
We introduce equivariant localization as a method for computing the action of probe branes in supergravity backgrounds. We apply it to supersymmetric probe D3-branes in type IIB supersymmetric spaceti…
We introduce an explicit class of tempered Laurent polynomials in the sense of Villegas and Doran--Kerr in $n \leqslant 4$ variables including all Landau--Ginzburg models for smooth Fano threefolds wi…
Extreme Mass Ratio Inspirals (EMRIs) are among the key targe sources for the space-based gravitational wave (GW) detectors. The waveforms of the EMRIs are highly sensitive to the types of the central …
ECLAIRs is a hard X-ray coded-mask telescope onboard the SVOM space mission, designed to detect and localize high-energy transients, in particular gamma-ray bursts. Operating over the 4-150 keV energy…
Large diffusion transformers (DiTs) follow global editing instructions well but consistently leak local edits into unrelated regions, because joint-attention architectures offer no explicit channel te…
Analog circuit design relies heavily on reusing existing intellectual property (IP), yet searching across heterogeneous representations such as SPICE netlists, schematics, and functional descriptions …
Coordinated movement and self-organisation of active self-driven agents is common in nature and is seen across different scales, from herds of animals to collective motion in bacteria. Often, these sy…
While 3D Gaussian Splatting (3DGS) achieves real-time photorealistic rendering, its performance degrades significantly when training images contain transient objects that violate multi-view consistenc…
6D object pose estimation in cluttered scenes remains challenging due to severe occlusion and sensor noise. We propose MAPRPose, a two-stage framework that leverages mask-aware correspondences for pos…
Modern distributed systems produce massive, heterogeneous logs essential for reliability, security, and anomaly detection. Converting these free-form messages into structured templates (log parsing) i…
The standard cosmological model is challenged by an ever-growing collection of observations, which invites (and stimulates) inquiry into possible additions and/or alterations. One such alteration come…
World models derived from large-scale video generative pre-training have emerged as a promising paradigm for generalist robot policy learning. However, standard approaches often focus on high-fidelity…
Masked diffusion language models such as LLaDA2.1 rely on Token-to-Token (T2T) editing to correct their own generation errors: whenever a different token crosses a confidence threshold, the committed …
Millimeter-wave (mmWave) technology is a crucial enabler for next-generation networks because it offers substantially greater available bandwidth. mmWave multiple-input multiple-output (MIMO) systems …
The ensemble Kalman filter (EnKF) is widely used for nonlinear and high-dimensional state estimation because it replaces complex covariance propagation with simple ensemble statistics. However, conven…
3D editing refers to the ability to apply local or global modifications to 3D assets. Effective 3D editing requires maintaining semantic consistency by performing localized changes according to prompt…
Style transfer aims to render a content image with the visual characteristics of a reference style while preserving its underlying semantic layout and structural geometry. While recent diffusion-based…
Visual Foundation Models (VFMs) such as the Segment Anything Model (SAM) have significantly advanced broad use of image segmentation. However, SAM and its variants necessitate substantial manual effor…
Free open-access publishing with Google Scholar indexing.
Submission Guide →