Programming Languages in Engineering — Research Repository

Engineering Preprint PDF DOI

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Zaijing Li, Bing Hu, Rui Shao, Gongwei Chen, Dongmei Jiang, Pengwei Xie, Jianye Hao, Liqiang Nie · 2026

Hierarchical Vision-Language-Action (VLA) models have rapidly become a dominant paradigm for robotic manipulation. It typically comprising a Vision-Language backbone for perception and understanding, …

Read Paper →

Engineering Preprint PDF DOI

Human-to-Robot Interaction: Learning from Video Demonstration for Robot Imitation

Thanh Nguyen Canh, Thanh-Tuan Tran, Haolan Zhang, Ziyan Gao, Nak Young Chong, Xiem HoangVan · 2026

Learning from Demonstration (LfD) offers a promising paradigm for robot skill acquisition. Recent approaches attempt to extract manipulation commands directly from video demonstrations, yet face two c…

Read Paper →

Engineering Preprint PDF DOI

Evaluation and Benchmarking Suite for Financial Large Language Models and Agents

Shengyuan Lin, Kaiwen He, Jaisal Patel, Qinchuan Zhang, Chris Ding, James Tang, Keyi Wang, Yupeng Cao, Yan Wang, Kairong Xiao, Vincent Caldeira, Matt White, Xiao-Yang Liu Yanglet · 2026

Over the past three years, the financial services industry has witnessed Large Language Models (LLMs) and agents transitioning from the exploration stage to readiness and governance stages. Financial …

Read Paper →

Engineering Preprint PDF DOI

TactEx: An Explainable Multimodal Robotic Interaction Framework for Human-Like Touch and Hardness Estimation

Felix Verstraete, Lan Wei, Wen Fan, Dandan Zhang · 2026

Accurate perception of object hardness is essential for safe and dexterous contact-rich robotic manipulation. Here, we present TactEx, an explainable multimodal robotic interaction framework that unif…

Read Paper →

Engineering Preprint PDF DOI

[b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic

Kwanghee Choi, Eunjung Yeo, Cheol Jun Cho, David Harwath, David R. Mortensen · 2026

Self-supervised speech models (S3Ms) are known to encode rich phonetic information, yet how this information is structured remains underexplored. We conduct a comprehensive study across 96 languages t…

Read Paper →

Engineering Preprint PDF DOI

Habilis-$\beta$: A Fast-Motion and Long-Lasting On-Device Vision-Language-Action Model

Tommoro Robotics: Jesoon Kang, Taegeon Park, Jisu An, Soo Min Kimm, Jaejoon Kim, Jinu Pahk, Byungju Kim, Junseok Lee, Namheon Baek, Sungwan Ha, Hojun Baek, Eduardo Ayerve Cruz, Wontae Kim, Junghyeon Choi, Yousuk Lee, Joonmo Han, Sunghyun Cho, Sunghyun Kwon, Soyoung Lee, Jun Ki Lee, Seung-Joon Yi, Byoung-Tak Zhang, Theo Taeyeong Kim · 2026

We introduce Habilis-$\beta$, a fast-motion and long-lasting on-device vision-language-action (VLA) model designed for real-world deployment. Current VLA evaluation remains largely confined to single-…

Read Paper →

Engineering Preprint PDF DOI

RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning

Seungku Kim, Suhyeok Jang, Byungjun Yoon, Dongyoung Kim, John Won, Jinwoo Shin · 2026

Synthetic data generated by video generative models has shown promise for robot learning as a scalable pipeline, but it often suffers from inconsistent action quality due to imperfectly generated vide…

Read Paper →

Engineering Preprint PDF DOI

OVerSeeC: Open-Vocabulary Costmap Generation from Satellite Images and Natural Language

Rwik Rana, Jesse Quattrociocchi, Dongmyeong Lee, Christian Ellis, Amanda Adkins, Adam Uccello, Garrett Warnell, Joydeep Biswas · 2026

Aerial imagery provides essential global context for autonomous navigation, enabling route planning at scales inaccessible to onboard sensing. We address the problem of generating global costmaps for …

Read Paper →

Engineering Preprint PDF DOI

Convex Block-Cholesky Approach to Risk-Constrained Low-thrust Trajectory Design under Operational Uncertainty

Kenshiro Oguri, Gregory Lantoine · 2026

Designing robust trajectories under uncertainties is an emerging technology that may represent a key paradigm shift in space mission design. As we pursue more ambitious scientific goals (e.g., multi-m…

Read Paper →

Engineering Preprint PDF DOI

How Fast Can I Run My VLA? Demystifying VLA Inference Performance with VLA-Perf

Wenqi Jiang, Jason Clemons, Karu Sankaralingam, Christos Kozyrakis · 2026

Vision-Language-Action (VLA) models have recently demonstrated impressive capabilities across various embodied AI tasks. While deploying VLA models on real-world robots imposes strict real-time infere…

Read Paper →

Engineering Preprint PDF DOI

Zero-shot Interactive Perception

Venkatesh Sripada, Frank Guerin, Amir Ghalamzan · 2026

Interactive perception (IP) enables robots to extract hidden information in their workspace and execute manipulation plans by physically interacting with objects and altering the state of the environm…

Read Paper →

Engineering Preprint PDF DOI

Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation

Muhammad Tayyab Khan, Lequn Chen, Wenhe Feng, Seung Ki Moon · 2026

Manufacturing automation in process planning, inspection planning, and digital-thread integration depends on a unified specification that binds the geometric features of a 3D CAD model to the geometri…

Read Paper →

Engineering Preprint PDF DOI

SimVLA: A Simple VLA Baseline for Robotic Manipulation

Yuankai Luo, Woping Chen, Tong Liang, Baiqiao Wang, Zhenguo Li · 2026

Vision-Language-Action (VLA) models have emerged as a promising paradigm for general-purpose robotic manipulation, leveraging large-scale pre-training to achieve strong performance. The field has rapi…

Read Paper →

Engineering Preprint PDF DOI

Hybrid System Planning using a Mixed-Integer ADMM Heuristic and Hybrid Zonotopes

Joshua A. Robbins, Andrew F. Thompson, Jonah J. Glunt, Herschel C. Pangborn · 2026

Embedded optimization-based planning for hybrid systems is challenging due to the use of mixed-integer programming, which is computationally intensive and often sensitive to the specific numerical for…

Read Paper →

Engineering Preprint PDF DOI

IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control

Qilong Cheng, Matthew Mackay, Ali Bereyhi · 2026

Robotic camera systems enable dynamic, repeatable motion beyond human capabilities, yet their adoption remains limited by the high cost and operational complexity of industrial-grade platforms. We pre…

Read Paper →

Engineering Preprint PDF DOI

CC-G2PnP: Streaming Grapheme-to-Phoneme and prosody with Conformer-CTC for unsegmented languages

Yuma Shirahata, Ryuichi Yamamoto · 2026

We propose CC-G2PnP, a streaming grapheme-to-phoneme and prosody (G2PnP) model to connect large language model and text-to-speech in a streaming manner. CC-G2PnP is based on Conformer-CTC architecture…

Read Paper →

Engineering Preprint PDF DOI

Environment-Aware Network-Level Design of Generalized Pinching-Antenna Systems--Part II: Geometry-Aware Case

Yanqing Xu, Zhiguo Ding, Xiu Yin Zhang, Trung Q. Duong, Tsung-Hui Chang · 2026

This two-part paper aims to develop an environment-aware network-level design framework for generalized pinching-antenna systems to overcome the limitations of conventional link-level optimization, wh…

Read Paper →

Engineering Preprint PDF DOI

BEMEval-Doc2Schema: Benchmarking Large Language Models for Structured Data Extraction in Building Energy Modeling

Yiyuan Jia, Xiaoqin Fu, Liang Zhang · 2026

Recent advances in foundation models, including large language models (LLMs), have created new opportunities to automate building energy modeling (BEM). However, systematic evaluation has remained cha…

Read Paper →

Engineering Preprint PDF DOI

MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation

Mehrshad Taji, Arad Mahdinezhad Kashani, Iman Ahmadi, AmirHossein Jadidi, Saina Kashani, Babak Khalaj · 2026

Task planning for robotic manipulation with large language models (LLMs) is an emerging area. Prior approaches rely on specialized models, fine tuning, or prompt tuning, and often operate in an open l…

Read Paper →

Engineering Preprint PDF DOI

Low-Thrust Trajectory Optimization for Cubesat Lunar Mission: HORYU-VI

Omer Burak Iskender, Keck Voon Ling, Mengu Cho, Sangkyun Kim, Necmi Cihan Orger · 2026

This paper presents a low-thrust trajectory optimization strategy to achieve a near-circular lunar orbit for a CubeSat injected into a lunar flyby trajectory. The 12U CubeSat HORYU-VI is equipped with…

Read Paper →

Browse Research Papers

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Human-to-Robot Interaction: Learning from Video Demonstration for Robot Imitation

Evaluation and Benchmarking Suite for Financial Large Language Models and Agents

TactEx: An Explainable Multimodal Robotic Interaction Framework for Human-Like Touch and Hardness Estimation

[b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic

Habilis-$\beta$: A Fast-Motion and Long-Lasting On-Device Vision-Language-Action Model

RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning

OVerSeeC: Open-Vocabulary Costmap Generation from Satellite Images and Natural Language

Convex Block-Cholesky Approach to Risk-Constrained Low-thrust Trajectory Design under Operational Uncertainty

How Fast Can I Run My VLA? Demystifying VLA Inference Performance with VLA-Perf

Zero-shot Interactive Perception

Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation

SimVLA: A Simple VLA Baseline for Robotic Manipulation

Hybrid System Planning using a Mixed-Integer ADMM Heuristic and Hybrid Zonotopes

IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control

CC-G2PnP: Streaming Grapheme-to-Phoneme and prosody with Conformer-CTC for unsegmented languages

Environment-Aware Network-Level Design of Generalized Pinching-Antenna Systems--Part II: Geometry-Aware Case

BEMEval-Doc2Schema: Benchmarking Large Language Models for Structured Data Extraction in Building Energy Modeling

MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation

Low-Thrust Trajectory Optimization for Cubesat Lunar Mission: HORYU-VI

Browse by Category

Research Type

Publish Your Research