Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness

Zihao Zheng, Zhihao Mao, Sicheng Tian, Maoliang Li, Jiayu Chen, Xinhao Sun, Zhaobo Zhang, Xuanzhe Liu, Donggang Cao, Hong Mei, Xiang Chen · 2026

Vision-Language-Action (VLA) Models have become the mainstream solution for robot control, but suffer from slow inference speeds. Speculative Decoding (SD) is a promising acceleration method which can…

Read Paper →

Engineering Preprint PDF DOI

KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition

Gaoge Han, Zhengqing Gao, Ziwen Li, Jiaxin Huang, Shaoli Huang, Fakhri Karray, Mingming Gong, Tongliang Liu · 2026

In this paper, we introduce a novel kinematics-rich vision-language-action (VLA) task, in which language commands densely encode diverse kinematic attributes (such as direction, trajectory, orientatio…

Read Paper →

Engineering Preprint PDF DOI

Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation

Tharun Sethuraman, Subham Agrawal, Nils Dengler, Jorge de Heuvel, Teena Hassan, Maren Bennewitz · 2026

Robots operating in human-shared environments must not only achieve task-level navigation objectives such as safety and efficiency, but also adapt their behavior to human preferences. However, as huma…

Read Paper →

Engineering Preprint PDF DOI

P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation

Tianfu Li, Wenbo Chen, Haoxuan Xu, Xinhu Zheng, Haoang Li · 2026

In Vision-and-Language Navigation (VLN), an agent is required to plan a path to the target specified by the language instruction, using its visual observations. Consequently, prevailing VLN methods pr…

Read Paper →

Engineering Preprint PDF DOI

FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation

Kehan Chen, Yan Huang, Dong An, Jiawei He, Yifei Su, Jing Liu, Nianfeng Liu, Liang Wang · 2026

Existing Vision-Language Navigation (VLN) task requires agents to follow verbose instructions, ignoring some potentially useful global spatial priors, limiting their capability to reason about spatial…

Read Paper →

Engineering Preprint PDF DOI

OmniVLN: Omnidirectional 3D Perception and Token-Efficient LLM Reasoning for Visual-Language Navigation across Air and Ground Platforms

Zhongyuang Liu, Min He, Shaonan Yu, Xinhang Xu, Muqing Cao, Jianping Li, Jianfei Yang, Lihua Xie · 2026

Language-guided embodied navigation requires an agent to interpret object-referential instructions, search across multiple rooms, localize the referenced target, and execute reliable motion toward it.…

Read Paper →

Engineering Preprint PDF DOI

FastLoop: Parallel Loop Closing with GPU-Acceleration in Visual SLAM

Soudabeh Mohammadhashemi, Shishir Gopinath, Kimia Khabiri, Parsa Hosseininejad, Karthik Dantu, Steven Y. Ko · 2026

Visual SLAM systems combine visual tracking with global loop closure to maintain a consistent map and accurate localization. Loop closure is a computationally expensive process as we need to search ac…

Read Paper →

Engineering Preprint PDF DOI

Shielded Reinforcement Learning Under Dynamic Temporal Logic Constraints

Sad{i}k Bera Yuksel, Ali Tevfik Buyukkocak, Derya Aksaray · 2026

Reinforcement Learning (RL) has shown promise in various robotics applications, yet its deployment on real systems is still limited due to safety and operational constraints. The safe RL field has gai…

Read Paper →

Engineering Preprint PDF DOI

ManiTwin: Scaling Data-Generation-Ready Digital Object Dataset to 100K

Kaixuan Wang, Tianxing Chen, Jiawei Liu, Honghao Su, Shaolong Zhu, Minxuan Wang, Zixuan Li, Yue Chen, Huan-ang Gao, Yusen Qin, Jiawei Wang, Qixuan Zhang, Lan Xu, Jingyi Yu, Yao Mu, Ping Luo · 2026

Learning in simulation provides a useful foundation for scaling robotic manipulation capabilities. However, this paradigm often suffers from a lack of data-generation-ready digital assets, in both sca…

Read Paper →

Engineering Preprint PDF DOI

MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation

Abhay Deshpande, Maya Guru, Rose Hendrix, Snehal Jauhri, Ainaz Eftekhar, Rohun Tripathi, Max Argus, Jordi Salvador, Haoquan Fang, Matthew Wallingford, Wilbert Pumacay, Yejin Kim, Quinn Pfeifer, Ying-Chun Lee, Piper Wolters, Omar Rayyan, Mingtong Zhang, Jiafei Duan, Karen Farley, Winson Han, Eli Vanderbilt, Dieter Fox, Ali Farhadi, Georgia Chalvatzaki, Dhruv Shah, Ranjay Krishna · 2026

A prevailing view in robot learning is that simulation alone is not enough; effective sim-to-real transfer is widely believed to require at least some real-world data collection or task-specific fine-…

Read Paper →

Engineering Preprint PDF DOI

DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models

Emily Yue-Ting Jia, Weiduo Yuan, Tianheng Shi, Vitor Guizilini, Jiageng Mao, Yue Wang · 2026

Robotic manipulation requires sophisticated commonsense reasoning, a capability naturally possessed by large-scale Vision-Language Models (VLMs). While VLMs show promise as zero-shot planners, their l…

Read Paper →

Engineering Preprint PDF DOI

Overlapping Covariance Intersection: Fusion with Partial Structural Knowledge of Correlation from Multiple Sources

Leonardo Pedroso, Pedro Batista, W.P.M.H. Heemels · 2026

Emerging large-scale engineering systems rely on distributed fusion for situational awareness, where agents combine noisy local sensor measurements with exchanged information to obtain fused estimates…

Read Paper →

Engineering Preprint PDF DOI

When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making

Jun Liu, Pu Zhao, Zhenglun Kong, Xuan Shen, Peiyan Dong, Fan Yang, Lin Cui, Hao Tang, Geng Yuan, Wei Niu, Wenbin Zhang, Xue Lin, Gaowen Liu, Yanzhi Wang, Dong Huang · 2026

Embodied robotic systems increasingly rely on large language model (LLM)-based agents to support high-level reasoning, planning, and decision-making during interactions with the environment. However, …

Read Paper →

Engineering Preprint PDF DOI

Routing and Control for Marine Oil-Spill Cleanup with a Boom-Towing Vessel Fleet

Snir Carmeli, Adir Morgan, Kiril Solovey · 2026

Marine oil spills damage ecosystems, contaminate coastlines, and disrupt food webs, while imposing substantial economic losses on fisheries and coastal communities. Prior work has demonstrated the fea…

Read Paper →

Engineering Preprint PDF DOI

Scalable Inspection Planning via Flow-based Mixed Integer Linear Programming

Adir Morgan, Kiril Solovey, Oren Salzman · 2026

Inspection planning is concerned with computing the shortest robot path to inspect a given set of points of interest (POIs) using the robot's sensors. This problem arises in a wide range of applicatio…

Read Paper →

Engineering Preprint PDF DOI

Rewarding DINO: Predicting Dense Rewards with Vision Foundation Models

Pierre Krack, Tobias Julg, Wolfram Burgard, Florian Walter · 2026

Well-designed dense reward functions in robot manipulation not only indicate whether a task is completed but also encode progress along the way. Generally, designing dense rewards is challenging and u…

Read Paper →

Engineering Preprint PDF DOI

Enabling Dynamic Tracking in Vision-Language-Action Models via Time-Discrete and Time-Continuous Velocity Feedforward

Johannes Hechtl, Philipp Schmitt, Georg von Wichert, Wolfram Burgard · 2026

While vision-language-action (VLA) models have shown great promise for robot manipulation, their deployment on rigid industrial robots remains challenging due to the inherent trade-off between complia…

Read Paper →

Engineering Preprint PDF DOI

Physics-guided diffusion models for inverse design of disordered metamaterials

Ziyuan Xie, Weipeng Xu, Dazhi Zhao, Wenchang Zhang, Daoyang Dong, Bingbing Xu, Ning Liu, Sheng Mao, Tianju Xue · 2026

Disordered metamaterials are promising for programming physical properties across diverse applications, yet their inverse design remains challenging due to the non-intuitive structure-property relatio…

Read Paper →

Engineering Preprint PDF DOI

PanguMotion: Continuous Driving Motion Forecasting with Pangu Transformers

Quanhao Ren, Yicheng Li, Nan Song · 2026

Motion forecasting is a core task in autonomous driving systems, aiming to accurately predict the future trajectories of surrounding agents to ensure driving safety. Existing methods typically process…

Read Paper →

Engineering Preprint PDF DOI

Structure-Aware Multimodal LLM Framework for Trustworthy Near-Field Beam Prediction

Mengyuan Li, Qianfan Lu, Jiachen Tian, Hongjun Hu, Yu Han, Xiao Li, Chao-kai Wen, Shi Jin · 2026

In near-field extremely large-scale multiple-input multiple-output (XL-MIMO) systems, spherical wavefront propagation expands the traditional beam codebook into the joint angular-distance domain, rend…

Read Paper →

Browse Research Papers

HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness

KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition

Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation

P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation

FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation

OmniVLN: Omnidirectional 3D Perception and Token-Efficient LLM Reasoning for Visual-Language Navigation across Air and Ground Platforms

FastLoop: Parallel Loop Closing with GPU-Acceleration in Visual SLAM

Shielded Reinforcement Learning Under Dynamic Temporal Logic Constraints

ManiTwin: Scaling Data-Generation-Ready Digital Object Dataset to 100K

MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation

DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models

Overlapping Covariance Intersection: Fusion with Partial Structural Knowledge of Correlation from Multiple Sources

When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making

Routing and Control for Marine Oil-Spill Cleanup with a Boom-Towing Vessel Fleet

Scalable Inspection Planning via Flow-based Mixed Integer Linear Programming

Rewarding DINO: Predicting Dense Rewards with Vision Foundation Models

Enabling Dynamic Tracking in Vision-Language-Action Models via Time-Discrete and Time-Continuous Velocity Feedforward

Physics-guided diffusion models for inverse design of disordered metamaterials

PanguMotion: Continuous Driving Motion Forecasting with Pangu Transformers

Structure-Aware Multimodal LLM Framework for Trustworthy Near-Field Beam Prediction

Browse by Category

Research Type

Publish Your Research