Jia Wang — Research Repository

Engineering Preprint PDF DOI

LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models

Hao Chen, Jiaming Liu, Zhonghao Yan, Nuowei Han, Renrui Zhang, Chenyang Gu, Jialin Gao, Ziyu Guo, Siyuan Qian, Yinxi Wang, Peng Jia, Chi-Wing Fu, Shanghang Zhang, Pheng-Ann Heng · 2026

Vision-Language-Action (VLA) models have increasingly incorporated reasoning mechanisms for complex robotic manipulation. However, existing approaches share a critical limitation: whether employing ex…

Read Paper →

Computer Science Preprint PDF DOI

FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption

Yanting Wang, Chenlong Yin, Ying Chen, Jinyuan Jia · 2026

Long-context large language models (LLMs)-for example, Gemini-3.1-Pro and Qwen-3.5-are widely used to empower many real-world applications, such as retrieval-augmented generation, autonomous agents, a…

Read Paper →

AI & Data Science Preprint PDF DOI

Global Optimality for Constrained Exploration via Penalty Regularization

Florian Wolf, Ilyas Fatkhullin, Niao He · 2026

Efficient exploration is a central problem in reinforcement learning and is often formalized as maximizing the entropy of the state-action occupancy measure. While unconstrained maximum-entropy explor…

Read Paper →

AI & Data Science Preprint PDF DOI

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Sudong Wang, Weiquan Huang, Xiaomin Yu, Zuhao Yang, Hehai Lin, Keming Wu, Chaojun Xiao, Chen Chen, Wenxuan Wang, Beier Zhu, Yunjian Zhang, Chengwei Qin · 2026

The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (RLVR). H…

Read Paper →

AI & Data Science Preprint PDF DOI

Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression

Junqi Gao, Dazhi Zhang, Zhichang Guo, Biqing Qi, Yi Ran, Wangmeng Zuo · 2026

Model merging has attracted attention as an effective path toward multi-task adaptation by integrating knowledge from multiple task-specific models. Among existing approaches, dynamic merging mitigate…

Read Paper →

Engineering Preprint PDF DOI

Hierarchical Control for Continuous-time Systems via General Approximate Alternating Simulation Relations

Zhiyuan Huang, Shuo Li, Murat Arcak, Majid Zamani, Bingzhuo Zhong · 2026

This paper introduces a general approximate alternating simulation relation (\emph{$\varepsilon$-gAAS relation}) for continuous-time systems, which relaxes existing simulation relations to tolerate la…

Read Paper →

Computer Science Preprint PDF DOI

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

Muhammad Ihsan Al Hafiz, Artur Podobas · 2026

Spiking neural networks (SNNs) are a promising paradigm for energy-efficient event-driven computation, but large-scale SNN execution remains challenging because sparse spike communication and synchron…

Read Paper →

Physics Preprint PDF DOI

Multimode grating couplers via foundry-compliant inverse design

Hao Li, Nazar Pyvovar, Zhaowei Dai, Owen D. Miller · 2026

We apply a systematic inverse design approach to discover foundry-compliant, multilayer grating couplers that can efficiently couple a number of independent waves from free space to on-chip propagatin…

Read Paper →

Physics Preprint PDF DOI

Branch-Resolved Characterization of Feed-Forward Error in Dynamic Teleportation via Classical Choi Shadows

Mason Edwards, Prabhat Mishra · 2026

Mid-circuit measurement and classical feed-forward are essential primitives for dynamic-circuit teleportation on superconducting quantum processors. However, the error associated with measurement-cond…

Read Paper →

AI & Data Science Preprint PDF DOI

Faster 3D Gaussian Splatting Convergence via Structure-Aware Densification

Linjie Lyu, Ayush Tewari, Jianchun Chen, Thomas Leimkuhler, Christian Theobalt · 2026

3D Gaussian Splatting has emerged as a powerful scene representation for real-time novel-view synthesis. However, its standard adaptive density control relies on screen-space positional gradients, whi…

Read Paper →

Physics Preprint PDF DOI

Learning quantum disentanglement scheduling from reduced states via modular hybrid policies

Y.-X. Xiao, J.-Z. Han, Z. Zheng, Z.-H. Zhang, M. Xue, J. Li, X. Lv · 2026

Quantum control with restricted state access is central to near-term quantum devices, where full wave-function information is unavailable. We study this problem through multiqubit disentanglement sche…

Read Paper →

Physics Preprint PDF DOI

Applications of 1.4 GHz diagnostics to Type Ia Supernova host galaxies

S. Ramaiya, M. J. Jarvis, M. Vincenzi, M. Sullivan, I. H. Whittam · 2026

Type Ia supernova (SN Ia) standardisation parameters exhibit evidence for systematic variation across the host galaxy star-formation rate - stellar mass (SFR$-M_\star$) plane, motivating the incorpora…

Read Paper →

Computer Science Preprint PDF DOI

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

Ugo Dal Lago, Guido Fiorillo, Paolo Pistone · 2026

The problem of determining whether a probabilistic program terminates almost surely (i.e.~with probability one) is undecidable, and actually $\Pi^0_2$-complete. For this reason, a growing literature h…

Read Paper →

Computer Science Preprint PDF DOI

Distributed Santa Claus via Global Rounding

Tijn de Vos, Leo Wennmann, Malte Baumecker, Yannic Maus, Florian Schager · 2026

In this paper, we consider the Santa Claus problem in the CONGEST model. This NP-hard problem can be modeled as a bipartite graph of children and gifts where an edge indicates that a child desires a g…

Read Paper →

AI & Data Science Preprint PDF DOI

Training-Free Tunnel Defect Inspection and Engineering Interpretation via Visual Recalibration and Entity Reconstruction

Shipeng Liu, Liang Zhao, Dengfeng Chen, Zhanping Song · 2026

Tunnel inspection requires outputs that can support defect localization, measurement, severity grading, and engineering documentation. Existing training-free foundation-model pipelines usually stop at…

Read Paper →

Mathematics Preprint PDF DOI

Data-Driven Continuous-Time Linear Quadratic Regulator via Closed-Loop and Reinforcement Learning Parameterizations

Armin Gie{ss}ler, Felix Thommes, Soren Hohmann · 2026

This paper studies data-driven approaches to the continuous-time linear quadratic regulator (LQR) problem based on two existing parameterizations, namely a closed-loop (CL) parameterization from behav…

Read Paper →

AI & Data Science Preprint PDF DOI

From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction

Alex Petrov, Alexander Gusak, Denis Mukha, Dima Korolev · 2026

Persistent AI memory is often reduced to a retrieval problem: store prior interactions as text, embed them, and ask the model to recover relevant context later. This design is useful for thematic reca…

Read Paper →

Mathematics Preprint PDF DOI

Time-dependent Robin heat equation via Markovian switching

Fausto Colantoni · 2026

This paper investigates the heat equation on a bounded domain with a Robin boundary condition, where the reactivity parameter (or killing rate) is modeled as a continuous-time Markov chain. We analyze…

Read Paper →

Physics Preprint PDF DOI

Probing mass inflation in polymerized vacuum regular black holes via colliding null shells

Hongguang Liu, Ioannis Soranidis · 2026

We derive a class of inner-extremal regular black hole solutions characterized by a degenerate inner horizon. These geometries arise as polymerized vacuum configurations inspired by loop quantum gravi…

Read Paper →

Mathematics Preprint PDF DOI

Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing

Max Lovig · 2026

In modern parametric model training, full-batch gradient descent (and its variants) suffers due to progressively stronger biasing towards the exact realization of training data; this drives the system…

Read Paper →

Browse Research Papers

LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models

FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption

Global Optimality for Constrained Exploration via Penalty Regularization

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression

Hierarchical Control for Continuous-time Systems via General Approximate Alternating Simulation Relations

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

Multimode grating couplers via foundry-compliant inverse design

Branch-Resolved Characterization of Feed-Forward Error in Dynamic Teleportation via Classical Choi Shadows

Faster 3D Gaussian Splatting Convergence via Structure-Aware Densification

Learning quantum disentanglement scheduling from reduced states via modular hybrid policies

Applications of 1.4 GHz diagnostics to Type Ia Supernova host galaxies

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

Distributed Santa Claus via Global Rounding

Training-Free Tunnel Defect Inspection and Engineering Interpretation via Visual Recalibration and Entity Reconstruction

Data-Driven Continuous-Time Linear Quadratic Regulator via Closed-Loop and Reinforcement Learning Parameterizations

From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction

Time-dependent Robin heat equation via Markovian switching

Probing mass inflation in polymerized vacuum regular black holes via colliding null shells

Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing

Browse by Category

Research Type

Publish Your Research