Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

COLREGs Compliant Collision Avoidance and Grounding Prevention for Autonomous Marine Navigation

Mayur S. Patil, Nataraj Sudharsan, Veneela Ammula, Jude Tomdio, Jin Wang, Michael Kei, Sivakumar Rathinam, Prabhakar R. Pagilla · 2026

Maritime Autonomous Surface Ships (MASS) are increasingly regarded as a promising solution to address crew shortages, improve navigational safety, and improve operational efficiency in the maritime in…

Read Paper →

Engineering Preprint PDF DOI

$\pi$-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

Siting Wang, Xiaofeng Wang, Zheng Zhu, Minnan Pei, Xinyu Cui, Cheng Deng, Jian Zhao, Guan Huang, Haifeng Zhang, Jun Wang · 2026

Flow-based vision-language-action (VLA) models excel in embodied control but suffer from intractable likelihoods during multi-step sampling, hindering online reinforcement learning. We propose \textbf…

Read Paper →

Engineering Preprint PDF DOI

LAD-Drive: Bridging Language and Trajectory with Action-Aware Diffusion Transformers

Fabian Schmidt, Karol Fedurko, Markus Enzweiler, Abhinav Valada · 2026

While multimodal large language models (MLLMs) provide advanced reasoning for autonomous driving, translating their discrete semantic knowledge into continuous trajectories remains a fundamental chall…

Read Paper →

Engineering Preprint PDF DOI

A System-of-Systems Convergence Paradigm for Societal Challenges of the Anthropocene

Megan S. Harris, Mohammad Mahdi Naderi, Ehsanoddin Ghorbanichemazkati, Sina Jangjoo, Emily Lapan, Seyed Amirreza Hosseini, Fabian Schipfer, Stephen Craig, Enayat Moallemi, Inas Khayal, Laura M. Arpan, Tian Tang, John C. Little, Amro M. Farid · 2026

Modern societal challenges, such as climate change, urbanization, and water resource management, demand integrated, multi-discipline, multi-problem approaches to frame and address their complexity. Un…

Read Paper →

Engineering Preprint PDF DOI

Co-designing a Social Robot for Newcomer Children's Cultural and Language Learning

Neil Fernandes, Tehniyat Shahbaz, Emily Davies-Robinson, Yue Hu, Kerstin Dautenhahn · 2026

Newcomer children face barriers in acquiring the host country's language and literacy programs are often constrained by limited staffing, mixed-proficiency cohorts, and short contact time. While Socia…

Read Paper →

Engineering Preprint PDF DOI

A Hetero-functional Graph State Estimator for Watershed Systems: Application to the Chesapeake Bay

Megan S. Harris, John C. Little, Amro M. Farid · 2026

Regional watersheds are complex systems of systems encompassing hydrology, land-use decision-making, estuarine ecological feedbacks, and overlapping governance jurisdictions. Their effective managemen…

Read Paper →

Engineering Preprint PDF DOI

SSMG-Nav: Enhancing Lifelong Object Navigation with Semantic Skeleton Memory Graph

Haochen Niu, Lantao Zhang, Xingwu Ji, Rendong Ying, Peilin Liu, Fei Wen · 2026

Navigating to out-of-sight targets from human instructions in unfamiliar environments is a core capability for service robots. Despite substantial progress, most approaches underutilize reusable, pers…

Read Paper →

Engineering Preprint PDF DOI

Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models

Haoyun Liu, Jianzhuang Zhao, Xinyuan Chang, Tianle Shi, Chuanzhang Meng, Jiayuan Tan, Feng Xiong, Tong Lin, Dongjie Huo, Mu Xu, SongLin Dong, Zhiheng Ma, Yihong Gong, Sheng Zhong · 2026

Despite the rapid progress of Vision-Language-Action (VLA) models, the prevailing paradigm of predicting discrete waypoints remains fundamentally misaligned with the intrinsic continuity of physical m…

Read Paper →

Engineering Preprint PDF DOI

KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

Zihao Zheng, Zhihao Mao, Maoliang Li, Jiayu Chen, Xinhao Sun, Zhaobo Zhang, Donggang Cao, Hong Mei, Xiang Chen · 2026

Vision-Language-Action (VLA) models build a token-domain robot control paradigm, yet suffer from low speed. Speculative Decoding (SD) is an optimization strategy that can boost inference speed. Two ke…

Read Paper →

Engineering Preprint PDF DOI

Investigating Group Relative Policy Optimization for Diffusion Transformer based Text-to-Audio Generation

Yi Gu, Yanqing Liu, Chen Yang, Sheng Zhao · 2026

Text-to-audio (T2A) generation has advanced considerably in recent years, yet existing methods continue to face challenges in accurately rendering complex text prompts, particularly those involving in…

Read Paper →

Engineering Preprint PDF DOI

FATE: Closed-Loop Feasibility-Aware Task Generation with Active Repair for Physically Grounded Robotic Curricula

Bingchuan Wei, Bingqi Huang, Jingheng Ma, Zeyu zhang, Sen Cui · 2026

Recent breakthroughs in generative simulation have harnessed Large Language Models (LLMs) to generate diverse robotic task curricula, yet these open-loop paradigms frequently produce linguistically co…

Read Paper →

Engineering Preprint PDF DOI

SFCo-Nav: Efficient Zero-Shot Visual Language Navigation via Collaboration of Slow LLM and Fast Attributed Graph Alignment

Chaoran Xiong, Litao Wei, Xinhao Hu, Kehui Ma, Ziyi Xia, Zixin Jiang, Zhen Sun, Ling Pei · 2026

Recent advances in large vision-language models (VLMs) and large language models (LLMs) have enabled zero-shot approaches to visual language navigation (VLN), where an agent follows natural language i…

Read Paper →

Engineering Preprint PDF DOI

Entropy-Guided GRVQ for Ultra-Low Bitrate Neural Speech Codec

Yanzhou Ren, Noboru Harada, Daiki Takeuchi, Siyu Chen, Wei Liu, Xiao Zhang, Liyuan Zhang, Takehiro Moriya, Shoji Makino · 2026

Neural audio codec (NAC) is essential for reconstructing high-quality speech signals and generating discrete representations for downstream speech language models. However, ensuring accurate semantic …

Read Paper →

Engineering Preprint PDF DOI

ROSER: Few-Shot Robotic Sequence Retrieval for Scalable Robot Learning

Zillur Rahman, Eddison Pham, Alejandro Daniel Noel, Cristian Meo · 2026

A critical bottleneck in robot learning is the scarcity of task-labeled, segmented training data, despite the abundance of large-scale robotic datasets recorded as long, continuous interaction logs. E…

Read Paper →

Engineering Preprint PDF DOI

Mean-Flow based One-Step Vision-Language-Action

Yang Chen, Xiaoguang Ma, Bin Zhao · 2026

Recent advances in FlowMatching-based Vision-Language-Action (VLA) frameworks have demonstrated remarkable advantages in generating high-frequency action chunks, particularly for highly dexterous robo…

Read Paper →

Engineering Preprint PDF DOI

Non-Markovian Long-Horizon Robot Manipulation via Keyframe Chaining

Yipeng Chen, Wentao Tan, Lei Zhu, Fengling Li, Jingjing Li, Guoli Yang, Heng Tao Shen · 2026

Existing Vision-Language-Action (VLA) models often struggle to generalize to long-horizon tasks due to their heavy reliance on immediate observations. While recent studies incorporate retrieval mechan…

Read Paper →

Engineering Preprint PDF DOI

Jailbreaking Embodied LLMs via Action-level Manipulation

Xinyu Huang, Qiang Yang, Leming Shen, Zijing Ma, Yuanqing Zheng · 2026

Embodied Large Language Models (LLMs) enable AI agents to interact with the physical world through natural language instructions and actions. However, beyond the language-level risks inherent to LLMs …

Read Paper →

Engineering Preprint PDF DOI

Inter-Speaker Relative Cues for Two-Stage Text-Guided Target Speech Extraction

Wang Dai, Archontis Politis, Tuomas Virtanen · 2026

This paper investigates the use of relative cues for text-based target speech extraction (TSE). We first provide a theoretical justification for relative cues from the perspectives of human perception…

Read Paper →

Engineering Preprint PDF DOI

Certifiable Estimation with Factor Graphs

Zhexin Xu, Nikolas R. Sanderson, Hanna Jiamei Zhang, David M. Rosen · 2026

Factor graphs provide a convenient modular modeling language that enables practitioners to design and deploy high-performance robotic state estimation systems by composing simple, reusable building bl…

Read Paper →

Engineering Preprint PDF DOI

From Dialogue to Execution: Mixture-of-Agents Assisted Interactive Planning for Behavior Tree-Based Long-Horizon Robot Execution

Kanata Suzuki, Kazuki Hori, Haruka Miyoshi, Shuhei Kurita, Tetsuya Ogata · 2026

Interactive task planning with large language models (LLMs) enables robots to generate high-level action plans from natural language instructions. However, in long-horizon tasks, such approaches often…

Read Paper →

Browse Research Papers

COLREGs Compliant Collision Avoidance and Grounding Prevention for Autonomous Marine Navigation

$\pi$-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

LAD-Drive: Bridging Language and Trajectory with Action-Aware Diffusion Transformers

A System-of-Systems Convergence Paradigm for Societal Challenges of the Anthropocene

Co-designing a Social Robot for Newcomer Children's Cultural and Language Learning

A Hetero-functional Graph State Estimator for Watershed Systems: Application to the Chesapeake Bay

SSMG-Nav: Enhancing Lifelong Object Navigation with Semantic Skeleton Memory Graph

Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models

KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

Investigating Group Relative Policy Optimization for Diffusion Transformer based Text-to-Audio Generation

FATE: Closed-Loop Feasibility-Aware Task Generation with Active Repair for Physically Grounded Robotic Curricula

SFCo-Nav: Efficient Zero-Shot Visual Language Navigation via Collaboration of Slow LLM and Fast Attributed Graph Alignment

Entropy-Guided GRVQ for Ultra-Low Bitrate Neural Speech Codec

ROSER: Few-Shot Robotic Sequence Retrieval for Scalable Robot Learning

Mean-Flow based One-Step Vision-Language-Action

Non-Markovian Long-Horizon Robot Manipulation via Keyframe Chaining

Jailbreaking Embodied LLMs via Action-level Manipulation

Inter-Speaker Relative Cues for Two-Stage Text-Guided Target Speech Extraction

Certifiable Estimation with Factor Graphs

From Dialogue to Execution: Mixture-of-Agents Assisted Interactive Planning for Behavior Tree-Based Long-Horizon Robot Execution

Browse by Category

Research Type

Publish Your Research