Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input

Michael Ziegltrum, Jianhao Jiao, Tianhu Peng, Chengxu Zhou, Dimitrios Kanoulas · 2026

Robotic parkour provides a compelling benchmark for advancing locomotion over highly challenging terrain, including large discontinuities such as elevated steps. Recent approaches have demonstrated im…

Read Paper →

Engineering Preprint PDF DOI

AeroBridge-TTA: Test-Time Adaptive Language-Conditioned Control for UAVs

Lingxue Lyu · 2026

Language-guided unmanned aerial vehicles (UAVs) often fail not from bad reasoning or perception, but from execution mismatch: the gap between a planned trajectory and the controller's ability to tra…

Read Paper →

Engineering Preprint PDF DOI

Quantitative Verification of Finite-Time Constrained Occupation Measures for Continuous-time Stochastic Systems

Bai Xue, C.-H. Luke Ong · 2026

This paper addresses the quantitative verification of finite-time constrained occupation time for stochastic continuous-time systems governed by stochastic differential equations (SDEs). Unlike classi…

Read Paper →

Engineering Preprint PDF DOI

DAG-STL: A Hierarchical Framework for Zero-Shot Trajectory Planning under Signal Temporal Logic Specifications

Ruijia Liu, Ancheng Hou, Xiao Yu, Xiang Yin · 2026

Signal Temporal Logic (STL) is a powerful language for specifying temporally structured robotic tasks. Planning executable trajectories under STL constraints remains difficult when system dynamics and…

Read Paper →

Engineering Preprint PDF DOI

MFMDQwen: Multilingual Financial Misinformation Detection Based on Large Language Model

Zhiwei Liu, Yuyan Wang, Yuechen Jiang, Yupeng Cao, Tianlei Zhu, Xiaorui Guo, Zhiyang Deng, Zhiyuan Yao, Xiao-Yang Liu, Jimin Huang, Sophia Ananiadou · 2026

Financial misinformation poses significant threats to financial market stability and individuals' investment decisions. The multilingual environment and the inherent complexity of financial informatio…

Read Paper →

Engineering Preprint PDF DOI

EmbodiedLGR: Integrating Lightweight Graph Representation and Retrieval for Semantic-Spatial Memory in Robotic Agents

Paolo Riva, Leonardo Gargani, Matteo Frosi, Matteo Matteucci · 2026

As the world of agentic artificial intelligence applied to robotics evolves, the need for agents capable of building and retrieving memories and observations efficiently is increasing. Robots operatin…

Read Paper →

Engineering Preprint PDF DOI

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR

Yuan Xie, Jiaqi Song, Guang Qiu, Xianliang Wang, Kai Qiao, Junfeng Yuan, Shengqing Liu, Yi Zhang, Bowen Chen, Ming Lei, Jie Gao, Jie Wu · 2026

Integrating large language models (LLMs) into automatic speech recognition (ASR) has become a mainstream paradigm in recent years. Although existing LLM-based ASR models demonstrate impressive perform…

Read Paper →

Engineering Preprint PDF DOI

Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models

Haiweng Xu, Sipeng Zheng, Hao Luo, Wanpeng Zhang, Ziheng Xi, Zongqing Lu · 2026

Recent Vision-Language-Action (VLA) models report impressive success rates on standard robotic benchmarks, fueling optimism about general-purpose physical intelligence. However, recent evidence sugges…

Read Paper →

Engineering Preprint PDF DOI

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

Huakang Chen, Jingbin Hu, Liumeng Xue, Qirui Zhan, Wenhao Li, Guobin Ma, Hanke Xie, Dake Guo, Linhan Ma, Yuepeng Jiang, Bengu Wu, Pengyuan Xie, Chuan Xie, Qiang Zhang, Lei Xie · 2026

Instruction-following text-to-speech (TTS) has emerged as an important capability for controllable and expressive speech generation, yet its evaluation remains underdeveloped due to limited benchmark …

Read Paper →

Engineering Preprint PDF DOI

SpaceDex: Generalizable Dexterous Grasping in Tiered Workspaces

Wensheng Wang, Chuanjun Guo, Wei Wei, Tong Wu, Ning Tan · 2026

Generalizable grasping with high-degree-of-freedom (DoF) dexterous hands remains challenging in tiered workspaces, where occlusion, narrow clearances, and height-dependent constraints are substantiall…

Read Paper →

Engineering Preprint PDF DOI

ST-$\pi$: Structured SpatioTemporal VLA for Robotic Manipulation

Chuanhao Ma, Hanyu Zhou, Shihan Peng, Yan Li, Tao Gu, Luxin Yan · 2026

Vision-language-action (VLA) models have achieved great success on general robotic tasks, but still face challenges in fine-grained spatiotemporal manipulation. Typically, existing methods mainly embe…

Read Paper →

Engineering Preprint PDF DOI

SYMBOLIZER: Symbolic Model-free Task Planning with VLMs

Sami Azirar, Zlatan Ajanovic, Hermann Blum · 2026

Traditional Task and Motion Planning (TAMP) systems depend on physics models for motion planning and discrete symbolic models for task planning. Although physics model are often available, symbolic mo…

Read Paper →

Engineering Preprint PDF DOI

ReFineVLA: Multimodal Reasoning-Aware Generalist Robotic Policies via Teacher-Guided Fine-Tuning

Tuan Van Vo, Tan Q. Nguyen, Khang Nguyen, Nhat Xuan Tran, Duy H. M. Nguyen, An T. Le, Ngo Anh Vien, Minh Nhat Vu · 2026

Vision-Language-Action (VLA) models have gained much attention from the research community thanks to their strength in translating multimodal observations with linguistic instructions into desired rob…

Read Paper →

Engineering Preprint PDF DOI

AnchorRefine: Synergy-Manipulation Based on Trajectory Anchor and Residual Refinement for Vision-Language-Action Models

Tingzheng Jia, Kan Guo, Lanping Qian, Yongli Hu, Daxin Tian, Guixian Qu, Chunmian Lin, Baocai Yin, Jiapu Wang · 2026

Precision-critical manipulation requires both global trajectory organization and local execution correction, yet most vision-language-action (VLA) policies generate actions within a single unified spa…

Read Paper →

Engineering Preprint PDF DOI

Trajectory-Based Optimization for Air Traffic Control in the Terminal Maneuvering Area

Yutian Pang, Daniel Delahaye, John-Paul Clarke · 2026

We present a trajectory-based optimization framework for arrival sequencing and scheduling in the terminal maneuvering area (TMA). Unlike node-link scheduling models that reduce trajectories to time-d…

Read Paper →

Engineering Preprint PDF DOI

OmniVLA-RL: A Vision-Language-Action Model with Spatial Understanding and Online RL

Haoxiang Jie, Yaoyuan Yan, Xiangyu Wei, Kailin Wang, Hongjie Yan, Zhiyou Heng, Daocheng Chen · 2026

Visual-Language-Action (VLA) models represent a paradigm shift in embodied AI, yet existing frameworks often struggle with imprecise spatial perception, suboptimal multimodal fusion, and instability i…

Read Paper →

Engineering Preprint PDF DOI

Prosody as Supervision: Bridging the Non-Verbal--Verbal for Multilingual Speech Emotion Recognition

Girish, Mohd Mujtaba Akhtar, Muskaan Singh · 2026

In this work, we introduce a paralinguistic supervision paradigm for low-resource multilingual speech emotion recognition (LRM-SER) that leverages non-verbal vocalizations to exploit prosody-centric e…

Read Paper →

Engineering Preprint PDF DOI

WirelessAgent: A Unified Agent Design for General Wireless Resource Allocation Problem without Current Channel State Information

Ran Yi, Ruopeng Xu, Dongshu Zhao, Zhaoyang Zhang, Baolin Chen, Kai-Kit Wong, Hyundong Shin, Zhaohui Yang · 2026

This paper investigates the agent design for solving the wireless resource allocation problem without sufficient channel state information (CSI), which cannot be effectively solved via conventional me…

Read Paper →

Engineering Preprint PDF DOI

Think before Go: Hierarchical Reasoning for Image-goal Navigation

Pengna Li, Kangyi Wu, Shaoqing Xu, Fang Li, Lin Zhao, Long Chen, Zhi-Xin Yang, Nanning Zheng · 2026

Image-goal navigation steers an agent to a target location specified by an image in unseen environments. Existing methods primarily handle this task by learning an end-to-end navigation policy, which …

Read Paper →

Engineering Preprint PDF DOI

BizCompass: Benchmarking the Reasoning Capabilities of LLMs in Business Knowledge and Applications

Jianing Hao, Yuhe Wu, Yuanjian Xu, Shichang Meng, Shuai Yuan, Wei Zeng, Zixuan Wang, Guang Zhang · 2026

Large language models (LLMs) hold great promise for business applications, yet business analysis remains inherently complex, demanding rigorous reasoning and the integration of diverse knowledge sourc…

Read Paper →

Browse Research Papers

Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input

AeroBridge-TTA: Test-Time Adaptive Language-Conditioned Control for UAVs

Quantitative Verification of Finite-Time Constrained Occupation Measures for Continuous-time Stochastic Systems

DAG-STL: A Hierarchical Framework for Zero-Shot Trajectory Planning under Signal Temporal Logic Specifications

MFMDQwen: Multilingual Financial Misinformation Detection Based on Large Language Model

EmbodiedLGR: Integrating Lightweight Graph Representation and Retrieval for Semantic-Spatial Memory in Robotic Agents

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR

Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

SpaceDex: Generalizable Dexterous Grasping in Tiered Workspaces

ST-$\pi$: Structured SpatioTemporal VLA for Robotic Manipulation

SYMBOLIZER: Symbolic Model-free Task Planning with VLMs

ReFineVLA: Multimodal Reasoning-Aware Generalist Robotic Policies via Teacher-Guided Fine-Tuning

AnchorRefine: Synergy-Manipulation Based on Trajectory Anchor and Residual Refinement for Vision-Language-Action Models

Trajectory-Based Optimization for Air Traffic Control in the Terminal Maneuvering Area

OmniVLA-RL: A Vision-Language-Action Model with Spatial Understanding and Online RL

Prosody as Supervision: Bridging the Non-Verbal--Verbal for Multilingual Speech Emotion Recognition

WirelessAgent: A Unified Agent Design for General Wireless Resource Allocation Problem without Current Channel State Information

Think before Go: Hierarchical Reasoning for Image-goal Navigation

BizCompass: Benchmarking the Reasoning Capabilities of LLMs in Business Knowledge and Applications

Browse by Category

Research Type

Publish Your Research