7,788+ open-access research outputs.
The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (RLVR). Hโฆ
Diffuse-interface (phase-field) models are widely used to describe multiphase mixtures and their interfacial dynamics. In multiphase settings, however, the constitutive closure should remain meaningfuโฆ
Despite the rapid progress of large vision-language models (LVLMs), fine-grained, state-conditioned GUI interaction remains challenging. Current evaluations offer limited coverage, imprecise target-stโฆ
We study square-base Calderbank--Shor--Steane (CSS) hypergraph-product codes as a finite-length class for regular high-girth quantum low-density parity-check (LDPC) design. For base matrices of small โฆ
Generative AI is being increasingly integrated into web search for the convenience it provides users. In this work, we aim to understand how generative AI disrupts web search by retrieving and presentโฆ
We present WaferSAGE, a framework for wafer defect visual question answering using small vision-language models. To address data scarcity in semiconductor manufacturing, we propose a three-stage synthโฆ
Recent progress in multimodal large language models (MLLMs) has brought AI capabilities from static offline data processing to real-time streaming interaction, yet they still remain far from human-levโฆ
We investigate the asymmetric freezing of a liquid droplet sliding on an inclined cold surface using numerical simulations based on the lubrication approximation. The combined effects of gravity, capiโฆ
As Competency-Based Education (CBE) is gaining traction around the world, the shift from marks-based assessment to qualitative competency mapping is a manual challenge for educators. This paper tackleโฆ
To usher in the next round of client AI innovation, there is an urgent need to enable efficient, lossless inference of high-accuracy large language models (LLMs) and vision language models (VLMs), joiโฆ
Reinforcement learning (RL) has become a critical paradigm for LLM post-training, yet the rollout phase -- accounting for 50--80% of total step time -- is bottlenecked by skewed generation: long-taileโฆ
Current approaches to lifelong personalization operationalize relevance through semantic proximity, causing them to miss essential user information from topically unrelated interactions. To address thโฆ
Accurate nutrient estimation from unstructured recipe text is an important yet challenging problem in dietary monitoring, due to ambiguous ingredient terminology and highly variable quantity expressioโฆ
The rapid growth of LLMs demands high-throughput, memory-capacity-intensive inference on resource-constrained edge devices, where single-batch decoding remains fundamentally memory-bound. Existing outโฆ
Objective: This study aims to investigate the influence of organ architecture (specifically the distinction between serial and parallel tissue) on the protective FLASH effect when organs are irradiateโฆ
Recommendation system has gained a large popularity for a variety of personalized suggestion tasks, but the ever-increasing number of user data makes real-time processing of recommendation systems difโฆ
Forward iteration of holomorphic self-maps generalizes the iteration of a single function in a natural way. This framework arises in complex dynamics, for instance in the study of wandering domains anโฆ
Chondrules are thought to have formed during transient flash-heating events in dust-enriched regions of the solar protoplanetary disk. Although laboratory studies have characterized the oxygen isotopiโฆ
We propose a contextual cavity/circuit QED analogue and extension of the Stern-Gerlach experiment, where the pseudo-spin of a two-state `atomic' transition plays the role of the ``spin'', while the reโฆ
Proton therapy exploits the finite range of charged particles in tissue to achieve dose distributions no photon based modality can replicate. Yet the modality reaches fewer than 1 percent of patients โฆ
Free open-access publishing with Google Scholar indexing.
Submission Guide โ