Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

VP-VLA: Visual Prompting as an Interface for Vision-Language-Action Models

Zixuan Wang, Yuxin Chen, Yuqi Liu, Jinhui Ye, Pengguang Chen, Changsheng Lu, Shu Liu, Jiaya Jia · 2026

Vision-Language-Action (VLA) models typically map visual observations and linguistic instructions directly to robotic control signals. This "black-box" mapping forces a single forward pass to simultan…

Read Paper →

Engineering Preprint PDF DOI

IGV-RRT: Prior-Real-Time Observation Fusion for Active Object Search in Changing Environments

Wei Zhang, Ping Gong, Yujie Wang, Minghui Bai, Rongfeng Ye, Yinchuan Wang, Yachao Wang, Leilei Yao, Teng Chen, Chen Sun, Chaoqun Wang · 2026

Object Goal Navigation (ObjectNav) in temporally changing indoor environments is challenging because object relocation can invalidate historical scene knowledge. To address this issue, we propose a pr…

Read Paper →

Engineering Preprint PDF DOI

Can a Robot Walk the Robotic Dog: Triple-Zero Collaborative Navigation for Heterogeneous Multi-Agent Systems

Yaxuan Wang, Yifan Xiang, Ke Li, Xun Zhang, BoWen Ye, Zhuochen Fan, Fei Wei, Tong Yang · 2026

We present Triple Zero Path Planning (TZPP), a collaborative framework for heterogeneous multi-robot systems that requires zero training, zero prior knowledge, and zero simulation. TZPP employs a coor…

Read Paper →

Engineering Preprint PDF DOI

Simple Trajectory Smoothing for UAV Reference Path Planning Based on Decoupling, Spatial Modeling and Linear Programming

Mogens Plessen · 2026

A method for trajectory smoothing for UAV reference path planning is presented. It is derived based on the dynamics of a Dubins airplane model, and involves a decoupling step, spatial modeling and lin…

Read Paper →

Engineering Preprint PDF DOI

SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

Weizhe Xu, Mengyu Liu, Fanxin Kong · 2026

Large Language Models (LLMs), deep learning architectures with typically over 10 billion parameters, have recently begun to be integrated into various cyber-physical systems (CPS) such as robotics, in…

Read Paper →

Engineering Preprint PDF DOI

DyGeoVLN: Infusing Dynamic Geometry Foundation Model into Vision-Language Navigation

Xiangchen Liu, Hanghan Zheng, Jeil Jeong, Minsung Yoon, Lin Zhao, Zhide Zhong, Haoang Li, Sung-Eui Yoon · 2026

Vision-language Navigation (VLN) requires an agent to understand visual observations and language instructions to navigate in unseen environments. Most existing approaches rely on static scene assumpt…

Read Paper →

Engineering Preprint PDF DOI

Dynamic Control Barrier Function Regulation with Vision-Language Models for Safe, Adaptive, and Realtime Visual Navigation

Jeffrey Chen, Rohan Chandra · 2026

Robots operating in dynamic, unstructured environments must balance safety and efficiency under potentially limited sensing. While control barrier functions (CBFs) provide principled collision avoidan…

Read Paper →

Engineering Preprint PDF DOI

Approximate Dynamic Programming for Degradation-aware Market Participation of Battery Energy Storage Systems: Bridging Market and Degradation Timescales

Flemming Holtorf, Sungho Shin · 2026

We present an approximate dynamic programming framework for designing degradation-aware market participation policies for battery energy storage systems. The approach employs a tailored value function…

Read Paper →

Engineering Preprint PDF DOI

Exploiting Self-Sustainable Information-Bearing RIS in Underlay CR-NOMA Networks

Zeyang Sun, Shuai Han, Chenyu Wu, Sai Xu, Yuanwei Liu · 2026

Information-bearing reconfigurable intelligent surfaces (IB-RIS) provide a promising solution to self-sustainable and green communications by harvesting ambient radio frequency energy while embedding …

Read Paper →

Engineering Preprint PDF DOI

Koopman-Based Linear MPC for Safe Control using Control Barrier Functions

Shuo Liu, Liang Wu, Dawei Zhang, Jan Drgona, Calin. A. Belta · 2026

This paper proposes a Koopman-based linear model predictive control (LMPC) framework for safety-critical control of nonlinear discrete-time systems. Existing MPC formulations based on discrete-time co…

Read Paper →

Engineering Preprint PDF DOI

Cortical Policy: A Dual-Stream View Transformer for Robotic Manipulation

Xuening Zhang, Qi Lv, Xiang Deng, Miao Zhang, Xingbo Liu, Liqiang Nie · 2026

View transformers process multi-view observations to predict actions and have shown impressive performance in robotic manipulation. Existing methods typically extract static visual representations in …

Read Paper →

Engineering Preprint PDF DOI

Swim2Real: VLM-Guided System Identification for Sim-to-Real Transfer

Kevin Qiu, Kyle Walker, Mike Y. Michelis, Marek Cygan, Josie Hughes · 2026

We present Swim2Real, a pipeline that calibrates a 16-parameter robotic fish simulator from swimming videos using vision-language model (VLM) feedback, requiring no hand-designed search stages. Calibr…

Read Paper →

Engineering Preprint PDF DOI

ROI-Driven Foveated Attention for Unified Egocentric Representations in Vision-Language-Action Systems

Xinhai Sun, Xiang Shi, Menglin Zou, Wenlong Huang · 2026

The development of embodied AI systems is increasingly constrained by the availability and structure of physical interaction data. Despite recent advances in vision-language-action (VLA) models, curre…

Read Paper →

Engineering Preprint PDF DOI

E-SocialNav: Efficient Socially Compliant Navigation with Language Models

Ling Xiao, Daeun Song, Xuesu Xiao, Toshihiko Yamasaki · 2026

Language models (LMs) are increasingly applied to robotic navigation; however, existing benchmarks primarily emphasize navigation success rates while paying limited attention to social compliance. Mor…

Read Paper →

Engineering Preprint PDF DOI

StageCraft: Execution Aware Mitigation of Distractor and Obstruction Failures in VLA Models

Kartikay Milind Pangaonkar, Prabin Rath, Omkar Patil, Nakul Gopalan · 2026

Large scale pre-training on text and image data along with diverse robot demonstrations has helped Vision Language Action models (VLAs) to generalize to novel tasks, objects and scenes. However, these…

Read Paper →

Engineering Preprint PDF DOI

OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement

Jingbin Hu, Haoyu Zhang, Dake Guo, Qirui Zhan, Wenhao Li, Huakang Chen, Guobin Ma, Hanke Xie, Chengyou Wang, Pengyuan Xie, Chuan Xie, Qiang Zhang, Lei Xie · 2026

Large Language Models (LLMs) have advanced audio generation through discrete representation learning. However, most existing neural codecs focus on speech and emphasize reconstruction fidelity, overlo…

Read Paper →

Engineering Preprint PDF DOI

Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models

Zhilong Zhang, Haoxiang Ren, Yihao Sun, Yifei Sheng, Haonan Wang, Haoxin Lin, Zhichao Wu, Pierre-Luc Bacon, Yang Yu · 2026

Vision-Language-Action (VLA) models show strong generalization for robotic control, but finetuning them with reinforcement learning (RL) is constrained by the high cost and safety risks of real-world …

Read Paper →

Engineering Preprint PDF DOI

LASER: Level-Based Asynchronous Scheduling and Execution Regime for Spatiotemporally Constrained Multi-Robot Timber Manufacturing

Zhenxiang Huang, Lior Skoury, Tim Stark, Aaron Wagner, Hans Jakob Wagner, Thomas Wortmann, Achim Menges · 2026

Automating large-scale manufacturing in domains like timber construction requires multi-robot systems to manage tightly coupled spatiotemporal constraints, such as collision avoidance and process-driv…

Read Paper →

Engineering Preprint PDF DOI

Performance Guarantees for Data-Driven Sequential Decision-Making

Bowen Li, Edwin K. P. Chong, Ali Pezeshki · 2026

The solutions to many sequential decision-making problems are characterized by dynamic programming and Bellman's principle of optimality. However, due to the inherent complexity of solving Bellman's e…

Read Paper →

Engineering Preprint PDF DOI

Memory Over Maps: 3D Object Localization Without Reconstruction

Rui Zhou, Xander Yap, Jianwen Cao, Allison Lau, Boyang Sun, Marc Pollefeys · 2026

Target localization is a prerequisite for embodied tasks such as navigation and manipulation. Conventional approaches rely on constructing explicit 3D scene representations to enable target localizati…

Read Paper →

Browse Research Papers

VP-VLA: Visual Prompting as an Interface for Vision-Language-Action Models

IGV-RRT: Prior-Real-Time Observation Fusion for Active Object Search in Changing Environments

Can a Robot Walk the Robotic Dog: Triple-Zero Collaborative Navigation for Heterogeneous Multi-Agent Systems

Simple Trajectory Smoothing for UAV Reference Path Planning Based on Decoupling, Spatial Modeling and Linear Programming

SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

DyGeoVLN: Infusing Dynamic Geometry Foundation Model into Vision-Language Navigation

Dynamic Control Barrier Function Regulation with Vision-Language Models for Safe, Adaptive, and Realtime Visual Navigation

Approximate Dynamic Programming for Degradation-aware Market Participation of Battery Energy Storage Systems: Bridging Market and Degradation Timescales

Exploiting Self-Sustainable Information-Bearing RIS in Underlay CR-NOMA Networks

Koopman-Based Linear MPC for Safe Control using Control Barrier Functions

Cortical Policy: A Dual-Stream View Transformer for Robotic Manipulation

Swim2Real: VLM-Guided System Identification for Sim-to-Real Transfer

ROI-Driven Foveated Attention for Unified Egocentric Representations in Vision-Language-Action Systems

E-SocialNav: Efficient Socially Compliant Navigation with Language Models

StageCraft: Execution Aware Mitigation of Distractor and Obstruction Failures in VLA Models

OmniCodec: Low Frame Rate Universal Audio Codec with Semantic-Acoustic Disentanglement

Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models

LASER: Level-Based Asynchronous Scheduling and Execution Regime for Spatiotemporally Constrained Multi-Robot Timber Manufacturing

Performance Guarantees for Data-Driven Sequential Decision-Making

Memory Over Maps: 3D Object Localization Without Reconstruction

Browse by Category

Research Type

Publish Your Research