17,620+ open-access research outputs.
This paper examines how different types of large language model (LLM) agents perform on scientific visualization (SciVis) tasks, where users generate visualization workflows from natural-language inst…
Despite the rapid progress of large vision-language models (LVLMs), fine-grained, state-conditioned GUI interaction remains challenging. Current evaluations offer limited coverage, imprecise target-st…
Graphical User Interface (GUI) agents have emerged as a promising paradigm for intelligent systems that perceive and interact with graphical interfaces visually. Yet supervised fine-tuning alone canno…
In heterogeneous network systems such as ecological and social networks, structural stability depends on how connectivity changes under node removal, as different removal sequences can trigger distinc…
While GUI agents have shown impressive capabilities in common computer-use tasks such as OSWorld, current benchmarks mainly focus on isolated and single-application tasks. This overlooks a critical re…
We study the celestial three-gluon amplitude in a dilaton background through the Mellin-Liouville formulation proposed by Stieberger, Taylor and Zhu (STZ). The original map contains an ambiguity in th…
The hubness problem, in which hub embeddings are close to many unrelated examples, occurs often in high-dimensional embedding spaces and may pose a practical threat for purposes such as information re…
Conventional push-broom hyperspectral imaging suffers from slow acquisition speeds, precluding real-time object detection; in contrast, snapshot spectral imaging enables instantaneous hyperspectral im…
We define a torus $U \subset T = (\mathbb{C}^\times)^K$ which acts on the $\Delta$-Springer varieties $Y_{n,\lambda,s}$ defined by Griffin-Levinson-Woo and give a Borel-style presentation for the equi…
This paper develops the numerical inverse scattering transform (NIST) framework for the coupled modified Korteweg-de Vries (mKdV) equation based on its associated Riemann-Hilbert problem. The coupled …
Understanding human actions is critical for advancing behavior analysis in human-robot interaction. Particularly in tasks that demand quick and proactive feedback, robots must recognize human actions …
Modeling 4D human-object interaction (HOI) is a compelling challenge in computer vision and an essential technology powering virtual and mixed-reality applications. While existing works have achieved …
We report the detection of linear polarization in the radio afterglow of GRB 260310A, representing the first centimeter-wavelength polarization detection of a gamma-ray burst (GRB) afterglow and the f…
This paper investigates the continuous-time counterpart of the Q-function for entropy-regularized mean-field control (MFC) with controlled common noise, coined as q-function by Jia and Zhou (2023) in …
Hyperspectral image super-resolution is essential for enhancing the spatial fidelity of HSI data, yet existing deep learning methods often struggle with substantial spectral redundancy and the limited…
Hyperspectral image (HSI) and SAR/LiDAR data offer complementary spectral and structural information for land-cover classification. However, their effective fusion remains challenging due to two major…
Computer-use agents provide a promising path toward general software automation because they can interact directly with arbitrary graphical user interfaces instead of relying on brittle, application-s…
We study properties of the following four classes of operators on the Fock space in $\mathbb C^n:$ 1) weakly localized operators; 2) sufficiently localized operators in the sense of Xia and Zheng; 3) …
We present a detailed study of radio-detected dwarf galaxies (with stellar masses less than 3 billion solar masses) to characterize extreme star formation and search for (variable) radio AGNs. Our sam…
In this paper, we prove several rigidity results for complete noncompact manifolds with nonnegative intermediate curvatures. We show that when either $3\leq n\leq 5$, $1\leq m\leq n-1$, or $6\leq n\le…
Free open-access publishing with Google Scholar indexing.
Submission Guide →