Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

MistyPilot: An Agentic Fast-Slow Thinking LLM Framework for Misty Social Robots

Xiao Wang, Lu Dong, Jingchen Sun, Ifeoma Nwogu, Srirangaraj Setlur, Venu Govindaraju · 2026

With the availability of open APIs in social robots, it has become easier to customize general-purpose tools to meet users' needs. However, interpreting high-level user instructions, selecting and con…

Read Paper →

Engineering Preprint PDF DOI

MEM: Multi-Scale Embodied Memory for Vision Language Action Models

Marcel Torne, Karl Pertsch, Homer Walke, Kyle Vedder, Suraj Nair, Brian Ichter, Allen Z. Ren, Haohuan Wang, Jiaming Tang, Kyle Stachowicz, Karan Dhabalia, Michael Equi, Quan Vuong, Jost Tobias Springenberg, Sergey Levine, Chelsea Finn, Danny Driess · 2026

Conventionally, memory in end-to-end robotic learning involves inputting a sequence of past observations into the learned policy. However, in complex multi-stage real-world tasks, the robot's memory m…

Read Paper →

Engineering Preprint PDF DOI

Act-Observe-Rewrite: Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation

Vaishak Kumar · 2026

Can a multimodal language model learn to manipulate physical objects by reasoning about its own failures-without gradient updates, demonstrations, or reward engineering? We argue the answer is yes, un…

Read Paper →

Engineering Preprint PDF DOI

Multidisciplinary Design Optimization of a Low-Thrust Asteroid Orbit Insertion Using Electric Propulsion

Yacob Medhin, Tushar Sial, Simone Servadio · 2026

Low-thrust electric propulsion missions are often designed under simplifying assumptions such as constant thrust or fixed specific impulse, neglecting the strong coupling between trajectory dynamics, …

Read Paper →

Engineering Preprint PDF DOI

The PARLO Dementia Corpus: A German Multi-Center Resource for Alzheimer's Disease

Franziska Braun, Christopher Witzl, Florian Honig, Elmar Noth, Tobias Bocklet, Korbinian Riedhammer · 2026

Early and accessible detection of Alzheimer's disease (AD) remains a major challenge, as current diagnostic methods often rely on costly and invasive biomarkers. Speech and language analysis has emerg…

Read Paper →

Engineering Preprint PDF DOI

EEG-Based Brain-LLM Interface for Human Preference Aligned Generation

Junzi Zhang, Jianing Shen, Weijie Tu, Yi Zhang, Hailin Zhang, Tom Gedeon, Bin Jiang, Yue Yao · 2026

Large language models (LLMs) are becoming an increasingly important component of human--computer interaction, enabling users to coordinate a wide range of intelligent agents through natural language. …

Read Paper →

Engineering Preprint PDF DOI

Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping

William Liang, Sam Wang, Hung-Ju Wang, Osbert Bastani, Yecheng Jason Ma, Dinesh Jayaraman · 2026

The ability to conduct and learn from interaction and experience is a central challenge in robotics, offering a scalable alternative to labor-intensive human demonstrations. However, realizing such "p…

Read Paper →

Engineering Preprint PDF DOI

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

Ziyang Gong, Zehang Luo, Anke Tang, Zhe Liu, Shi Fu, Zhi Hou, Ganlin Yang, Weiyun Wang, Xiaofeng Wang, Jianbo Liu, Gen Luo, Haolan Kang, Shuang Luo, Yue Zhou, Yong Luo, Li Shen, Xiaosong Jia, Yao Mu, Xue Yang, Chunxiao Liu, Junchi Yan, Hengshuang Zhao, Dacheng Tao, Xiaogang Wang · 2026

Universal embodied intelligence demands robust generalization across heterogeneous embodiments, such as autonomous driving, robotics, and unmanned aerial vehicles (UAVs). However, existing embodied br…

Read Paper →

Engineering Preprint PDF DOI

From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?

Shinas Shaji, Fabian Huppertz, Alex Mitrevski, Sebastian Houben · 2026

In order to flexibly act in an everyday environment, a robotic agent needs a variety of cognitive capabilities that enable it to reason about plans and perform execution recovery. Large language model…

Read Paper →

Engineering Preprint PDF DOI

DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming

Shuide Wen, Sungil Seok, Beier Ku, Richee Li, Yubin He, Bowen Qu, Yang Yang, Ping Su, Can Jiao · 2026

We present DLIOS, a Large Language Model (LLM)-augmented real-time multi-modal interactive enhancement overlay system for Douyin (TikTok) live streaming. DLIOS employs a three-layer transparent window…

Read Paper →

Engineering Preprint PDF DOI

MA-CoNav: A Master-Slave Multi-Agent Framework with Hierarchical Collaboration and Dual-Level Reflection for Long-Horizon Embodied VLN

Ling Luo, Qianqian Bai · 2026

Vision-Language Navigation (VLN) aims to empower robots with the ability to perform long-horizon navigation in unfamiliar environments based on complex linguistic instructions. Its success critically …

Read Paper →

Engineering Preprint PDF DOI

CASSR: Continuous A-Star Search through Reachability for real time footstep planning

Jiayi Wang, Steve Tonneau · 2026

Footstep planning involves a challenging combinatorial search. Traditional A* approaches require discretising reachability constraints, while Mixed-Integer Programming (MIP) supports continuous formul…

Read Paper →

Engineering Preprint PDF DOI

Does Fine-tuning by Reinforcement Learning Improve Generalization in Binary Speech Deepfake Detection?

Xin Wang, Ge Wanying, Junichi Yamagishi · 2026

Building speech deepfake detection models that are generalizable to unseen attacks remains a challenging problem. Although the field has shifted toward a pre-training and fine-tuning paradigm using sp…

Read Paper →

Engineering Preprint PDF DOI

CoFL: Continuous Flow Fields for Language-Conditioned Navigation

Haokun Liu, Zhaoqi Ma, Yicheng Chen, Masaki Kitagawa, Wentao Zhang, Zicen Xiong, Jinjie Li, Moju Zhao · 2026

Existing language-conditioned navigation systems typically rely on modular pipelines or trajectory generators, but the latter use each scene--instruction annotation mainly to supervise one start-condi…

Read Paper →

Engineering Preprint PDF DOI

Benchmarking Speech Systems for Frontline Health Conversations: The DISPLACE-M Challenge

Dhanya E, Ankita Meena, Manas Nanivadekar, Noumida A, Victor Azad, Ashwini Nagaraj Shenoy, Pratik Roy Chowdhuri, Shobhit Banga, Vanshika Chhabra, Chitralekha Bhat, Shareef babu Kalluri, Srikanth Raj Chetupalli, Deepu Vijayasenan, Sriram Ganapathy · 2026

The DIarization and Speech Processing for LAnguage understanding in Conversational Environments - Medical (DISPLACE-M) challenge introduces a conversational AI benchmark for understanding goal-oriente…

Read Paper →

Engineering Preprint PDF DOI

Agentic Self-Evolutionary Replanning for Embodied Navigation

Guoliang Li, Ruihua Han, Chengyang Li, He Li, Shuai Wang, Wenchao Ding, Hong Zhang, Chengzhong Xu · 2026

Failure is inevitable for embodied navigation in complex environments. To enhance the resilience, replanning (RP) is a viable option, where the robot is allowed to fail, but is capable of adjusting pl…

Read Paper →

Engineering Preprint PDF DOI

IMR-LLM: Industrial Multi-Robot Task Planning and Program Generation using Large Language Models

Xiangyu Su, Juzhan Xu, Oliver van Kaick, Kai Xu, Ruizhen Hu · 2026

In modern industrial production, multiple robots often collaborate to complete complex manufacturing tasks. Large language models (LLMs), with their strong reasoning capabilities, have shown potential…

Read Paper →

Engineering Preprint PDF DOI

cuNRTO: GPU-Accelerated Nonlinear Robust Trajectory Optimization

Jiawei Wang, Arshiya Taj Abdul, Evangelos A. Theodorou · 2026

Robust trajectory optimization enables autonomous systems to operate safely under uncertainty by computing control policies that satisfy the constraints for all bounded disturbances. However, these pr…

Read Paper →

Engineering Preprint PDF DOI

LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics

Justin Williams, Kishor Datta Gupta, Roy George, Mrinmoy Sarkar · 2026

Vision-Language-Action (VLA) models provide a unified framework for perception, language conditioning, and action generation, but many existing systems remain difficult to deploy in embedded robotic s…

Read Paper →

Engineering Preprint PDF DOI

Give me scissors: Collision-Free Dual-Arm Surgical Assistive Robot for Instrument Delivery

Xuejin Luo, Shiquan Sun, Runshi Zhang, Ruizhi Zhang, Junchen Wang · 2026

During surgery, scrub nurses are required to frequently deliver surgical instruments to surgeons, which can lead to physical fatigue and decreased focus. Robotic scrub nurses provide a promising solut…

Read Paper →

Browse Research Papers

MistyPilot: An Agentic Fast-Slow Thinking LLM Framework for Misty Social Robots

MEM: Multi-Scale Embodied Memory for Vision Language Action Models

Act-Observe-Rewrite: Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation

Multidisciplinary Design Optimization of a Low-Thrust Asteroid Orbit Insertion Using Electric Propulsion

The PARLO Dementia Corpus: A German Multi-Center Resource for Alzheimer's Disease

EEG-Based Brain-LLM Interface for Human Preference Aligned Generation

Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?

DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming

MA-CoNav: A Master-Slave Multi-Agent Framework with Hierarchical Collaboration and Dual-Level Reflection for Long-Horizon Embodied VLN

CASSR: Continuous A-Star Search through Reachability for real time footstep planning

Does Fine-tuning by Reinforcement Learning Improve Generalization in Binary Speech Deepfake Detection?

CoFL: Continuous Flow Fields for Language-Conditioned Navigation

Benchmarking Speech Systems for Frontline Health Conversations: The DISPLACE-M Challenge

Agentic Self-Evolutionary Replanning for Embodied Navigation

IMR-LLM: Industrial Multi-Robot Task Planning and Program Generation using Large Language Models

cuNRTO: GPU-Accelerated Nonlinear Robust Trajectory Optimization

LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics

Give me scissors: Collision-Free Dual-Arm Surgical Assistive Robot for Instrument Delivery

Browse by Category

Research Type

Publish Your Research