Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio

Yuanjian Chen, Yang Xiao, Han Yin, Xubo Liu, Jinjie Huang, Ting Dang · 2026

Large Audio Language Models (LALMs) are increasingly capable of reasoning over audio. However, existing benchmarks provide limited coverage of reasoning in polyphonic audio, where multiple sound event…

Read Paper →

Engineering Preprint PDF DOI

SeedPolicy: Horizon Scaling via Self-Evolving Diffusion Policy for Robot Manipulation

Youqiang Gui, Yuxuan Zhou, Shen Cheng, Xinyang Yuan, Haoqiang Fan, Peng Cheng, Shuaicheng Liu · 2026

Imitation Learning (IL) enables robots to acquire manipulation skills from expert demonstrations. Diffusion Policy (DP) models multi-modal expert behaviors but suffers performance degradation as obser…

Read Paper →

Engineering Preprint PDF DOI

Direct Contact-Tolerant Motion Planning With Vision Language Models

He Li, Jian Sun, Chengyang Li, Guoliang Li, Qiyu Ruan, Shuai Wang, Chengzhong Xu · 2026

Navigation in cluttered environments often requires robots to tolerate contact with movable or deformable objects to maintain efficiency. Existing contact-tolerant motion planning (CTMP) methods rely …

Read Paper →

Engineering Preprint PDF DOI

VPWEM: Non-Markovian Visuomotor Policy with Working and Episodic Memory

Yuheng Lei, Zhixuan Liang, Hongyuan Zhang, Ping Luo · 2026

Imitation learning from human demonstrations has achieved significant success in robotic control, yet most visuomotor policies still condition on single-step observations or short-context histories, m…

Read Paper →

Engineering Preprint PDF DOI

Set-Membership Localization via Range Measurements

Giuseppe C. Calafiore · 2026

In this paper we discuss a classical geometrical problem of estimating an unknown point's location in $\Real{n}$ from several noisy measurements of the Euclidean distances from this point to a set of …

Read Paper →

Engineering Preprint PDF DOI

On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

Pradyumna Tambwekar, Andrew Silva, Deepak Gopinath, Jonathan DeCastro, Xiongyi Cui, Guy Rosman · 2026

Embodied foundation models are increasingly performant in real-world domains such as robotics or autonomous driving. These models are often deployed in interactive or assistive settings, where it is i…

Read Paper →

Engineering Preprint PDF DOI

LLM-Guided Decentralized Exploration with Self-Organizing Robot Teams

Hiroaki Kawashima, Shun Ikejima, Takeshi Takai, Mikita Miyaguchi, Yasuharu Kunii · 2026

When individual robots have limited sensing capabilities or insufficient fault tolerance, it becomes necessary for multiple robots to form teams during exploration, thereby increasing the collective o…

Read Paper →

Engineering Preprint PDF DOI

LEGS-POMDP: Language and Gesture-Guided Object Search in Partially Observable Environments

Ivy Xiao He, Stefanie Tellex, Jason Xinyu Liu · 2026

To assist humans in open-world environments, robots must interpret ambiguous instructions to locate desired objects. Foundation model-based approaches excel at multimodal grounding, but they lack a pr…

Read Paper →

Engineering Preprint PDF DOI

Python Bindings for a Large C++ Robotics Library: The Case of OMPL

Weihang Guo, Theodoros Tyrovouzis, Lydia E. Kavraki · 2026

Python bindings are a critical bridge between high-performance C++ libraries and the flexibility of Python, enabling rapid prototyping, reproducible experiments, and integration with simulation and le…

Read Paper →

Engineering Preprint PDF DOI

RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies

Yinpei Dai, Hongze Fu, Jayjun Lee, Yuejiang Liu, Haoran Zhang, Jianing Yang, Chelsea Finn, Nima Fazeli, Joyce Chai · 2026

Memory is critical for long-horizon and history-dependent robotic manipulation. Such tasks often involve counting repeated actions or manipulating objects that become temporarily occluded. Recent visi…

Read Paper →

Engineering Preprint PDF DOI

From Local Corrections to Generalized Skills: Improving Neuro-Symbolic Policies with MEMO

Benjamin A. Christie, Yinlong Dai, Mohammad Bararjanianbahnamiri, Simon Stepputtis, Dylan P. Losey · 2026

Recent works use a neuro-symbolic framework for general manipulation policies. The advantage of this framework is that -- by applying off-the-shelf vision and language models -- the robot can break co…

Read Paper →

Engineering Preprint PDF DOI

BrainWhisperer: Leveraging Large-Scale ASR Models for Neural Speech Decoding

Tommaso Boccato, Michal Olak, Matteo Ferrante · 2026

Decoding continuous speech from intracortical recordings is a central challenge for brain-computer interfaces (BCIs), with transformative potential for individuals with conditions that impair their ab…

Read Paper →

Engineering Preprint PDF DOI

Enhancing Power Systems Transmission Adequacy via Optimal BESS Siting and Sizing using Benders Decomposition with Feasibility Cuts

Ginevra Larroux, Matthieu Jacobs, Keyu Jia, Fabrizio Sossan, Mario Paolone · 2026

This work presents a general framework for the operationally driven optimal siting and sizing of battery energy storage systems in power transmission networks, aimed at enhancing their resource adequa…

Read Paper →

Engineering Preprint PDF DOI

GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

Mingleyang Li, Yuran Wang, Yue Chen, Tianxing Chen, Jiaqi Liang, Zishun Shen, Haoran Lu, Ruihai Wu, Hao Dong · 2026

Garment manipulation has attracted increasing attention due to its critical role in home-assistant robotics. However, the majority of existing garment manipulation works assume an initial state consis…

Read Paper →

Engineering Preprint PDF DOI

Lightweight Visual Reasoning for Socially-Aware Robots

Alessio Galatolo, Ronald Cumbal, Alexandros Rouchitsas, Katie Winkle, Didem Gurdur Broo, Ginevra Castellano · 2026

Robots operating in shared human environments must not only navigate, interact, and detect their surroundings, they must also interpret and respond to dynamic, and often unpredictable, human behaviour…

Read Paper →

Engineering Preprint PDF DOI

IROSA: Interactive Robot Skill Adaptation using Natural Language

Markus Knauer, Samuel Bustamante, Thomas Eiband, Alin Albu-Schaffer, Freek Stulp, Joao Silverio · 2026

Foundation models have demonstrated impressive capabilities across diverse domains, while imitation learning provides principled methods for robot skill adaptation from limited data. Combining these a…

Read Paper →

Engineering Preprint PDF DOI

MRPoS: Mixed Reality-Based Robot Navigation Interface Using Spatial Pointing and Speech with Large Language Model

Eduardo Iglesius, Masato Kobayashi, Yuki Uranishi · 2026

Recent advancements have made robot navigation more intuitive by transitioning from traditional 2D displays to spatially aware Mixed Reality (MR) systems. However, current MR interfaces often rely on …

Read Paper →

Engineering Preprint PDF DOI

SkillVLA: Tackling Combinatorial Diversity in Dual-Arm Manipulation via Skill Reuse

Xuanran Zhai, Zekai Huang, Longyan Wu, Qianyou Zhao, Qiaojun Yu, Jieji Ren, Ce Hao, Harold Soh · 2026

Recent progress in vision-language-action (VLA) models has demonstrated strong potential for dual-arm manipulation, enabling complex behaviors and generalization to unseen environments. However, mains…

Read Paper →

Engineering Preprint PDF DOI

Cognition to Control - Multi-Agent Learning for Human-Humanoid Collaborative Transport

Hao Zhang, Ding Zhao, H. Eric Tseng · 2026

Effective human-robot collaboration (HRC) requires translating high-level intent into contact-stable whole-body motion while continuously adapting to a human partner. Many vision-language-action (VLA)…

Read Paper →

Engineering Preprint PDF DOI

Large-Language-Model-Guided State Estimation for Partially Observable Task and Motion Planning

Yoonwoo Kim, Raghav Arora, Roberto Martin-Martin, Peter Stone, Ben Abbatematteo, Yoonchang Sung · 2026

Robot planning in partially observable environments, where not all objects are known or visible, is a challenging problem, as it requires reasoning under uncertainty through partially observable Marko…

Read Paper →

Browse Research Papers

PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio

SeedPolicy: Horizon Scaling via Self-Evolving Diffusion Policy for Robot Manipulation

Direct Contact-Tolerant Motion Planning With Vision Language Models

VPWEM: Non-Markovian Visuomotor Policy with Working and Episodic Memory

Set-Membership Localization via Range Measurements

On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

LLM-Guided Decentralized Exploration with Self-Organizing Robot Teams

LEGS-POMDP: Language and Gesture-Guided Object Search in Partially Observable Environments

Python Bindings for a Large C++ Robotics Library: The Case of OMPL

RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies

From Local Corrections to Generalized Skills: Improving Neuro-Symbolic Policies with MEMO

BrainWhisperer: Leveraging Large-Scale ASR Models for Neural Speech Decoding

Enhancing Power Systems Transmission Adequacy via Optimal BESS Siting and Sizing using Benders Decomposition with Feasibility Cuts

GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

Lightweight Visual Reasoning for Socially-Aware Robots

IROSA: Interactive Robot Skill Adaptation using Natural Language

MRPoS: Mixed Reality-Based Robot Navigation Interface Using Spatial Pointing and Speech with Large Language Model

SkillVLA: Tackling Combinatorial Diversity in Dual-Arm Manipulation via Skill Reuse

Cognition to Control - Multi-Agent Learning for Human-Humanoid Collaborative Transport

Large-Language-Model-Guided State Estimation for Partially Observable Task and Motion Planning

Browse by Category

Research Type

Publish Your Research