Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

AutoSiMP: Autonomous Topology Optimization from Natural Language via LLM-Driven Problem Configuration and Adaptive Solver Control

Shaoliang Yang, Jun Wang, Yunsheng Wang · 2026

We present AutoSiMP, an autonomous pipeline that transforms a natural-language structural problem description into a validated, binary topology without manual configuration. The pipeline comprises fiv…

Read Paper →

Engineering Preprint PDF DOI

ROSClaw: An OpenClaw ROS 2 Framework for Agentic Robot Control and Interaction

Irvin Steve Cardenas, Marcus Anthony Arnett, Natalie Catherine Yeo, Lucky Sah, Jong-Hoon Kim · 2026

Foundation models can endow robots with open-ended reasoning, language understanding, and adaptive planning, yet connecting a model to a physical robot today requires bespoke integration that couples …

Read Paper →

Engineering Preprint PDF DOI

VLA-OPD: Bridging Offline SFT and Online RL for Vision-Language-Action Models via On-Policy Distillation

Zhide Zhong, Haodong Yan, Junfeng Li, Junjie He, Tianran Zhang, Haoang Li · 2026

Although pre-trained Vision-Language-Action (VLA) models exhibit impressive generalization in robotic manipulation, post-training remains crucial to ensure reliable performance during deployment. Howe…

Read Paper →

Engineering Preprint PDF DOI

Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining

Wenyao Zhang, Bozhou Zhang, Zekun Qi, Wenjun Zeng, Xin Jin, Li Zhang · 2026

Vision-language-action (VLA) models have shown great potential in building generalist robots, but still face a dilemma-misalignment of 2D image forecasting and 3D action prediction. Besides, such a vi…

Read Paper →

Engineering Preprint PDF DOI

The Multi-AMR Buffer Storage, Retrieval, and Reshuffling Problem: Exact and Heuristic Approaches

Max Disselnmeyer, Thomas Bomer, Laura Dorr, Bastian Amberg, Anne Meyer · 2026

Buffer zones are essential in production systems to decouple sequential processes. In dense floor storage environments, such as space-constrained brownfield facilities, manual operation is increasingl…

Read Paper →

Engineering Preprint PDF DOI

Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

John Bateman, Andy M. Tyrrell, Jihong Zhu · 2026

Programming robots to perform complex tasks is often difficult and time consuming, requiring expert knowledge and skills in robot software and sometimes hardware. Imitation learning is a method for tr…

Read Paper →

Engineering Preprint PDF DOI

Adapt as You Say: Online Interactive Bimanual Skill Adaptation via Human Language Feedback

Zhuo Li, Dianxi Li, Tao Teng, Quentin Rouxel, Zhipeng Dong, Dennis Hong, Darwin Caldwell, Fei Chen · 2026

Developing general-purpose robots capable of autonomously operating in human living environments requires the ability to adapt to continuously evolving task conditions. However, adapting high-dimensio…

Read Paper →

Engineering Preprint PDF DOI

Generalizable task-oriented object grasping through LLM-guided ontology and similarity-based planning

Hao Chen, Takuya Kiyokawa, Weiwei Wan, Kensuke Harada · 2026

Task-oriented grasping (TOG) is more challenging than simple object grasping because it requires precise identification of object parts and careful selection of grasping areas to ensure effective and …

Read Paper →

Engineering Preprint PDF DOI

DiffusionAnything: End-to-End In-context Diffusion Learning for Unified Navigation and Pre-Grasp Motion

Iana Zhura, Yara Mahmoud, Jeffrin Sam, Hung Khang Nguyen, Didar Seyidov, Miguel Altamirano Cabrera, Dzmitry Tsetserukou · 2026

Efficiently predicting motion plans directly from vision remains a fundamental challenge in robotics, where planning typically requires explicit goal specification and task-specific design. Recent vis…

Read Paper →

Engineering Preprint PDF DOI

DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching

Jiayi Chen, Wenxuan Song, Shuai Chen, Jingbo Wang, Zhijun Li, Haoang Li · 2026

Vision--Language--Action (VLA) models that encode actions using a discrete tokenization scheme are increasingly adopted for robotic manipulation, but existing decoding paradigms remain fundamentally l…

Read Paper →

Engineering Preprint PDF DOI

SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation

Jiwen Zhang, Xiangyu Shi, Siyuan Wang, Zerui Li, Zhongyu Wei, Qi Wu · 2026

Vision-and-Language Navigation (VLN) has recently benefited from Multimodal Large Language Models (MLLMs), enabling zero-shot navigation. While recent exploration-based zero-shot methods have shown pr…

Read Paper →

Engineering Preprint PDF DOI

Experimental study on surveillance video-based indoor occupancy measurement with occupant-centric control

Irfan Qaisar, Kailai Sun, Qingshan Jia, Qianchuan Zhao · 2026

Accurate occupancy information is essential for closed-loop occupant-centric control (OCC) in smart buildings. However, existing vision-based occupancy measurement methods often struggle to provide st…

Read Paper →

Engineering Preprint PDF DOI

Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation

Dianyu Zhong, Tian Xing, Kailai Sun, Xu Yang, Heye Huang, Irfan Qaisar, Tinggang Jia, Shaobo Wang, Qianchuan Zhao · 2026

Heating, ventilation, and air conditioning (HVAC) systems account for a substantial share of building energy consumption. Environmental uncertainty and dynamic occupancy behavior bring challenges in d…

Read Paper →

Engineering Preprint PDF DOI

Policy-Guided World Model Planning for Language-Conditioned Visual Navigation

Amirhosein Chahe, Lifeng Zhou · 2026

Navigating to a visually specified goal given natural language instructions remains a fundamental challenge in embodied AI. Existing approaches either rely on reactive policies that struggle with long…

Read Paper →

Engineering Preprint PDF DOI

Adapting Segment Anything Model 3 for Concept-Driven Lesion Segmentation in Medical Images: An Experimental Study

Guoping Xu, Jayaram K. Udupa, Yubing Tong, Xin Long, Ying Zhang, Jie Deng, Weiguo Lu, You Zhang · 2026

Accurate lesion segmentation is essential in medical image analysis, yet most existing methods are designed for specific anatomical sites or imaging modalities, limiting their generalizability. Recent…

Read Paper →

Engineering Preprint PDF DOI

On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins

Lekshmi P, Neha Karanjkar · 2026

LLM-assisted modeling holds the potential to rapidly build executable Digital Twins of complex systems from only coarse descriptions and sensor data. However, resilience to LLM hallucination, human ov…

Read Paper →

Engineering Preprint PDF DOI

Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

Zehao Wang, Huaide Jiang, Shuaiwu Dong, Yuping Wang, Hang Qiu, Jiachen Li · 2026

Human driving behavior is inherently personal, which is shaped by long-term habits and influenced by short-term intentions. Individuals differ in how they accelerate, brake, merge, yield, and overtake…

Read Paper →

Engineering Preprint PDF DOI

A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

Giulio Pisaneschi, Pierpaolo Serio, Estelle Gerbier, Andrea Dan Ryals, Lorenzo Pollini, Mario G. C. A. Cimino · 2026

This paper presents an experimental platform for studying intentional-state attribution toward a non-humanoid robot. The system combines a simulated robot, realistic task environments, and large langu…

Read Paper →

Engineering Preprint PDF DOI

Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

Abdullah Hamdi, Changchun Yang, Xin Gao · 2026

Early screening via colonoscopy is critical for colon cancer prevention, yet developing robust AI systems for this domain is hindered by the lack of densely annotated, long-sequence video datasets. Ex…

Read Paper →

Engineering Preprint PDF DOI

LILAC: Language-Conditioned Object-Centric Optical Flow for Open-Loop Trajectory Generation

Motonari Kambara, Koki Seno, Tomoya Kaichi, Yanan Wang, Komei Sugiura · 2026

We address language-conditioned robotic manipulation using flow-based trajectory generation, which enables training on human and web videos of object manipulation and requires only minimal embodiment-…

Read Paper →

Browse Research Papers

AutoSiMP: Autonomous Topology Optimization from Natural Language via LLM-Driven Problem Configuration and Adaptive Solver Control

ROSClaw: An OpenClaw ROS 2 Framework for Agentic Robot Control and Interaction

VLA-OPD: Bridging Offline SFT and Online RL for Vision-Language-Action Models via On-Policy Distillation

Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining

The Multi-AMR Buffer Storage, Retrieval, and Reshuffling Problem: Exact and Heuristic Approaches

Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

Adapt as You Say: Online Interactive Bimanual Skill Adaptation via Human Language Feedback

Generalizable task-oriented object grasping through LLM-guided ontology and similarity-based planning

DiffusionAnything: End-to-End In-context Diffusion Learning for Unified Navigation and Pre-Grasp Motion

DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching

SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation

Experimental study on surveillance video-based indoor occupancy measurement with occupant-centric control

Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation

Policy-Guided World Model Planning for Language-Conditioned Visual Navigation

Adapting Segment Anything Model 3 for Concept-Driven Lesion Segmentation in Medical Images: An Experimental Study

On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins

Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

LILAC: Language-Conditioned Object-Centric Optical Flow for Open-Loop Trajectory Generation

Browse by Category

Research Type

Publish Your Research