Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

VLMaterial: Vision-Language Model-Based Camera-Radar Fusion for Physics-Grounded Material Identification

Jiangyou Zhu, He Chen · 2026

Accurate material recognition is a fundamental capability for intelligent perception systems to interact safely and effectively with the physical world. For instance, distinguishing visually similar o…

Read Paper →

Engineering Preprint PDF DOI

HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models

Shuiyuan Wang, Zhixian Zhao, Hongfei Xue, Chengyou Wang, Shuai Wang, Hui Bu, Xin Xu, Lei Xie · 2026

Evaluating the emotional intelligence (EI) of audio language models (ALMs) is critical. However, existing benchmarks mostly rely on synthesized speech, are limited to single-turn interactions, and dep…

Read Paper →

Engineering Preprint PDF DOI

DA-PTQ: Drift-Aware Post-Training Quantization for Efficient Vision-Language-Action Models

Siyuan Xu, Tianshi Wang, Fengling Li, Lei Zhu, Heng Tao Shen · 2026

Vision-Language-Action models (VLAs) have demonstrated strong potential for embodied AI, yet their deployment on resource-limited robots remains challenging due to high memory and computational demand…

Read Paper →

Engineering Preprint PDF DOI

ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation

Yiran Qin, Jiahua Ma, Li Kang, Wenzhan Li, Yihang Jiao, Xin Wen, Xiufeng Song, Heng Zhou, Jiwen Yu, Zhenfei Yin, Xihui Liu, Philip Torr, Yilun Du, Ruimao Zhang · 2026

Recent advancements in foundational models, such as large language models and world models, have greatly enhanced the capabilities of robotics, enabling robots to autonomously perform complex tasks. H…

Read Paper →

Engineering Preprint PDF DOI

CLASP: Closed-loop Asynchronous Spatial Perception for Open-vocabulary Desktop Object Grasping

Yiran Ling, Wenxuan Li, Siying Dong, Yize Zhang, Xiaoyao Huang, Jing Jiang, Ruonan Li, Jie Liu · 2026

Robot grasping of desktop object is widely used in intelligent manufacturing, logistics, and agriculture.Although vision-language models (VLMs) show strong potential for robotic manipulation, their de…

Read Paper →

Engineering Preprint PDF DOI

Learning to Forget -- Hierarchical Episodic Memory for Lifelong Robot Deployment

Leonard Barmann, Joana Plewnia, Alex Waibel, Tamim Asfour · 2026

Robots must verbalize their past experiences when users ask "Where did you put my keys?" or "Why did the task fail?" Yet maintaining life-long episodic memory (EM) from continuous multimodal perceptio…

Read Paper →

Engineering Preprint PDF DOI

Speaker Attributed Automatic Speech Recognition Using Speech Aware LLMS

Hagai Aronowitz, Zvi Kons, Avihu Dekel, George Saon, Ron Hoory · 2026

Speaker-Attributed Automatic Speech Recognition (SAA) enhances traditional ASR systems by incorporating relative speaker identity tags directly into the transcript (e.g., [Speaker 1]:, [Speaker 2]:). …

Read Paper →

Engineering Preprint PDF DOI

CLAW: Composable Language-Annotated Whole-body Motion Generation

Jianuo Cao, Yuxin Chen, Masayoshi Tomizuka · 2026

Training language-conditioned whole-body controllers for humanoid robots demands large-scale motion-language datasets. Existing approaches based on motion capture are costly and limited in diversity, …

Read Paper →

Engineering Preprint PDF DOI

RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering

Zhuoyu Wu, Wenhui Ou, Pei-Sze Tan, Wenqi Fang, Sailaja Rajanala, Raphael C.-W. Phan · 2026

Retrieving procedure-oriented evidence from materials science papers is difficult because key synthesis details are often scattered across long, context-heavy documents and are not well captured by pa…

Read Paper →

Engineering Preprint PDF DOI

Ro-SLM: Onboard Small Language Models for Robot Task Planning and Operation Code Generation

Wenhao Wang, Yanyan Li, Long Jiao, Jiawei Yuan · 2026

Recent advances in large language models (LLMs) provide robots with contextual reasoning abilities to comprehend human instructions. Yet, current LLM-enabled robots typically depend on cloud-based mod…

Read Paper →

Engineering Preprint PDF DOI

Optimization Under Uncertainty for Energy Infrastructure Planning: A Synthesis of Methods, Tools, and Open Challenges

Rahman Khorramfar, Aron Brenner, Lara Booth, Ana Rivera, Ruaridh Macdonald, Priya Donti, Saurabh Amin · 2026

Energy infrastructure planning under uncertainty has become increasingly complex as electrification, interdependence between energy carriers, decarbonization, and extreme weather events reshape long-t…

Read Paper →

Engineering Preprint PDF DOI

VLN-NF: Feasibility-Aware Vision-and-Language Navigation with False-Premise Instructions

Hung-Ting Su, Ting-Jun Wang, Jia-Fong Yeh, Min Sun, Winston H. Hsu · 2026

Conventional Vision-and-Language Navigation (VLN) benchmarks assume instructions are feasible and the referenced target exists, leaving agents ill-equipped to handle false-premise goals. We introduce …

Read Paper →

Engineering Preprint PDF DOI

AnySlot: Goal-Conditioned Vision-Language-Action Policies for Zero-Shot Slot-Level Placement

Zhaofeng Hu, Sifan Zhou, Qinbo Zhang, Rongtao Xu, Qi Su, Ci-Jyun Liang · 2026

Vision-Language-Action (VLA) policies have emerged as a versatile paradigm for generalist robotic manipulation. However, precise object placement under compositional language instructions remains a ma…

Read Paper →

Engineering Preprint PDF DOI

LLM-enabled Antenna Partitioning and Beamforming Optimization for Segmented Pinching

Qian Gao, Ruikang Zhong, Hyundong Shin, Yuanwei Liu · 2026

Integrated sensing and communication (ISAC) requires spatial architectures that can flexibly balance data transmission and environment sensing. Segmented pinching antenna-assisted ISAC provides such f…

Read Paper →

Engineering Preprint PDF DOI

Graph-Enhanced LLM for SWAN-ISAC

Qian Gao, Ruikang Zhong, Yuanwei Liu · 2026

Segmented pinching antenna assisted integrated sensing and communication (ISAC) systems enable flexible spatial resource utilization by allowing different waveguide segments to be dynamically configur…

Read Paper →

Engineering Preprint PDF DOI

Wearable AI in the Era of Large Sensor Models

Yize Cai, Baoshen Guo, Guobin Shen, Zhiqing Hong · 2026

As an effective approach to understanding the human-centric physical world, Wearable Artificial Intelligence (AI), which leverages multimodal wearable sensors to understand human physiology and behavi…

Read Paper →

Engineering Preprint PDF DOI

STRONG-VLA: Decoupled Robustness Learning for Vision-Language-Action Models under Multimodal Perturbations

Yuhan Xie, Yuping Yan, Yunqi Zhao, Handing Wang, Yaochu Jin · 2026

Despite their strong performance in embodied tasks, recent Vision-Language-Action (VLA) models remain highly fragile under multimodal perturbations, where visual corruption and linguistic noise jointl…

Read Paper →

Engineering Preprint PDF DOI

Agentic Application in Power Grid Static Analysis: Automatic Code Generation and Error Correction

Qinjuan Wang, Shan Yang, Yongli Zhu · 2026

This paper introduces an LLM agent that automates power grid static analysis by converting natural language into MATPOWER scripts. The framework utilizes DeepSeek-OCR to build an enhanced vector datab…

Read Paper →

Engineering Preprint PDF DOI

GPU-Accelerated Continuous-Time Successive Convexification for Contact-Implicit Legged Locomotion

Samuel C. Buckner, Purnanand Elango · 2026

Contact-implicit trajectory optimization (CITO) enables the automatic discovery of contact sequences, but most methods rely on fine time discretization to capture all contact events accurately, which …

Read Paper →

Engineering Preprint PDF DOI

ProGAL-VLA: Grounded Alignment through Prospective Reasoning in Vision-Language-Action Models

Nastaran Darabi, Amit Ranjan Trivedi · 2026

Vision language action (VLA) models enable generalist robotic agents but often exhibit language ignorance, relying on visual shortcuts and remaining insensitive to instruction changes. We present Pros…

Read Paper →

Browse Research Papers

VLMaterial: Vision-Language Model-Based Camera-Radar Fusion for Physics-Grounded Material Identification

HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models

DA-PTQ: Drift-Aware Post-Training Quantization for Efficient Vision-Language-Action Models

ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation

CLASP: Closed-loop Asynchronous Spatial Perception for Open-vocabulary Desktop Object Grasping

Learning to Forget -- Hierarchical Episodic Memory for Lifelong Robot Deployment

Speaker Attributed Automatic Speech Recognition Using Speech Aware LLMS

CLAW: Composable Language-Annotated Whole-body Motion Generation

RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering

Ro-SLM: Onboard Small Language Models for Robot Task Planning and Operation Code Generation

Optimization Under Uncertainty for Energy Infrastructure Planning: A Synthesis of Methods, Tools, and Open Challenges

VLN-NF: Feasibility-Aware Vision-and-Language Navigation with False-Premise Instructions

AnySlot: Goal-Conditioned Vision-Language-Action Policies for Zero-Shot Slot-Level Placement

LLM-enabled Antenna Partitioning and Beamforming Optimization for Segmented Pinching

Graph-Enhanced LLM for SWAN-ISAC

Wearable AI in the Era of Large Sensor Models

STRONG-VLA: Decoupled Robustness Learning for Vision-Language-Action Models under Multimodal Perturbations

Agentic Application in Power Grid Static Analysis: Automatic Code Generation and Error Correction

GPU-Accelerated Continuous-Time Successive Convexification for Contact-Implicit Legged Locomotion

ProGAL-VLA: Grounded Alignment through Prospective Reasoning in Vision-Language-Action Models

Browse by Category

Research Type

Publish Your Research