Calvin Tsay in Engineering — Research Repository

Engineering Preprint PDF DOI

How VLAs (Really) Work In Open-World Environments

Amir Rasouli, Yangzheng Wu, Zhiyuan Li, Rui Heng Yang, Xuan Zhao, Charles Eret, Sajjad Pakdamansavoji · 2026

Vision-language-action models (VLAs) have been extensively used in robotics applications, achieving great success in various manipulation problems. More recently, VLAs have been used in long-horizon t…

Read Paper →

Engineering Preprint PDF DOI

DART: Learning-Enhanced Model Predictive Control for Dual-Arm Non-Prehensile Manipulation

Autrio Das, Shreya Bollimuntha, Madala Venkata Renu Jeevesh, Keshab Patra, Tashmoy Ghosh, Nagamanikandan Govindan, Arun Kumar Singh, K Madhava Krishna · 2026

What appears effortless to a human waiter remains a major challenge for robots. Manipulating objects nonprehensilely on a tray is inherently difficult, and the complexity is amplified in dual-arm sett…

Read Paper →

Engineering Preprint PDF DOI

AnchorRefine: Synergy-Manipulation Based on Trajectory Anchor and Residual Refinement for Vision-Language-Action Models

Tingzheng Jia, Kan Guo, Lanping Qian, Yongli Hu, Daxin Tian, Guixian Qu, Chunmian Lin, Baocai Yin, Jiapu Wang · 2026

Precision-critical manipulation requires both global trajectory organization and local execution correction, yet most vision-language-action (VLA) policies generate actions within a single unified spa…

Read Paper →

Engineering Preprint PDF DOI

Learning to Forget -- Hierarchical Episodic Memory for Lifelong Robot Deployment

Leonard Barmann, Joana Plewnia, Alex Waibel, Tamim Asfour · 2026

Robots must verbalize their past experiences when users ask "Where did you put my keys?" or "Why did the task fail?" Yet maintaining life-long episodic memory (EM) from continuous multimodal perceptio…

Read Paper →

Engineering Preprint PDF DOI

CableTract: A Co-Designed Cable-Driven Field Robot for Low-Compaction, Off-Grid Capable Agriculture

Ozgur Yilmaz · 2026

Conventional field operations spend most of their energy moving the tractor body, not the implement. Yet feasibility studies for novel agricultural vehicles rarely tie mechanics, energy harvest, draft…

Read Paper →

Engineering Preprint PDF DOI

The Sustainability Gap in Robotics: A Large-Scale Survey of Sustainability Awareness in 50,000 Research Articles

Antun Skuric, Leandro Von Werra, Thomas Wolf · 2026

We present a large-scale survey of sustainability communication and motivation in robotics research. Our analysis covers nearly 50,000 open-access papers from arXiv's cs.RO category published between …

Read Paper →

Engineering Preprint PDF DOI

Reset-Free Reinforcement Learning for Real-World Agile Driving: An Empirical Study

Kohei Honda, Hirotaka Hosogaya · 2026

This paper presents an empirical study of reset-free reinforcement learning (RL) for real-world agile driving, in which a physical 1/10-scale vehicle learns continuously on a slippery indoor track wit…

Read Paper →

Engineering Preprint PDF DOI

Towards Multi-Object Nonprehensile Transportation via Shared Teleoperation: A Framework Based on Virtual Object Model Predictive Control

Xinyang Fan, Zhaoyang Chen, Shu Xin, Yi Ren, Zainan Jiang, Fenglei Ni, Hong Liu · 2026

Multi-object nonprehensile transportation in teleoperation demands simultaneous trajectory tracking and tray orientation control. Existing methods often struggle with model dependency, uncertain param…

Read Paper →

Engineering Preprint PDF DOI

See Something, Say Something: Context-Criticality-Aware Mobile Robot Communication for Hazard Mitigations

Bhavya Oza, Devam Shah, Ghanashyama Prabhu, Devika Kodi, Aliasghar Arab · 2026

The proverb ``see something, say something'' captures a core responsibility of autonomous mobile robots in safety-critical situations: when they detect a hazard, they must communicate--and do so quick…

Read Paper →

Engineering Preprint PDF DOI

ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation

Hongyu Yan, Qiwei Li, Jiaolong Yang, Yadong Mu · 2026

Most existing vision-language-action (VLA) models for robotic manipulation lack progress awareness, typically relying on hand-crafted heuristics for task termination. This limitation is particularly s…

Read Paper →

Engineering Preprint PDF DOI

Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining

Wenyao Zhang, Bozhou Zhang, Zekun Qi, Wenjun Zeng, Xin Jin, Li Zhang · 2026

Vision-language-action (VLA) models have shown great potential in building generalist robots, but still face a dilemma-misalignment of 2D image forecasting and 3D action prediction. Besides, such a vi…

Read Paper →

Engineering Preprint PDF DOI

Adapt as You Say: Online Interactive Bimanual Skill Adaptation via Human Language Feedback

Zhuo Li, Dianxi Li, Tao Teng, Quentin Rouxel, Zhipeng Dong, Dennis Hong, Darwin Caldwell, Fei Chen · 2026

Developing general-purpose robots capable of autonomously operating in human living environments requires the ability to adapt to continuously evolving task conditions. However, adapting high-dimensio…

Read Paper →

Engineering Preprint PDF DOI

DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching

Jiayi Chen, Wenxuan Song, Shuai Chen, Jingbo Wang, Zhijun Li, Haoang Li · 2026

Vision--Language--Action (VLA) models that encode actions using a discrete tokenization scheme are increasingly adopted for robotic manipulation, but existing decoding paradigms remain fundamentally l…

Read Paper →

Engineering Preprint PDF DOI

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation

Yang Liu, Pengxiang Ding, Tengyue Jiang, Xudong Wang, Wenxuan Song, Minghui Lin, Han Zhao, Hongyin Zhang, Zifeng Zhuang, Wei Zhao, Siteng Huang, Jinkui Shi, Donglin Wang · 2026

Vision-Language-Action (VLA) models aim to control robots for manipulation from visual observations and natural-language instructions. However, existing hierarchical and autoregressive paradigms often…

Read Paper →

Engineering Preprint PDF DOI

Controlling Fish Schools via Reinforcement Learning of Virtual Fish Movement

Yusuke Nishii, Hiroaki Kawashima · 2026

This study investigates a method to guide and control fish schools using virtual fish trained with reinforcement learning. We utilize 2D virtual fish displayed on a screen to overcome technical challe…

Read Paper →

Engineering Preprint PDF DOI

Bounds on Agreement between Subjective and Objective Measurements

Jaden Pieper, Stephen D. Voran · 2026

Objective estimators of multimedia quality are often judged by comparing estimates with subjective "truth data," most often via Pearson correlation coefficient (PCC) or mean-squared error (MSE). But s…

Read Paper →

Engineering Preprint PDF DOI

SmoothTurn: Learning to Turn Smoothly for Agile Navigation with Quadrupedal Robots

Zunzhi You, Haolan Guo, Yunke Wang, Chang Xu · 2026

Quadrupedal robots show great potential for valuable real-world applications such as fire rescue and industrial inspection. Such applications often require urgency and the ability to navigate agilely,…

Read Paper →

Engineering Preprint PDF DOI

SteadyTray: Learning Object Balancing Tasks in Humanoid Tray Transport via Residual Reinforcement Learning

Anlun Huang, Zhenyu Wu, Soofiyan Atar, Yuheng Zhi, Michael Yip · 2026

Stabilizing unsecured payloads against the inherent oscillations of dynamic bipedal locomotion remains a critical engineering bottleneck for humanoids in unstructured environments. To solve this, we i…

Read Paper →

Engineering Preprint PDF DOI

DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds

Siqi Pei, Andras Palffy, Dariu M. Gavrila · 2026

4D radars, which provide 3D point cloud data along with Doppler velocity, are attractive components of modern automated driving systems due to their low cost and robustness under adverse weather condi…

Read Paper →

Engineering Preprint PDF DOI

Trajectory Optimization for Self-Wrap-Aware Cable-Towed Planar Object Manipulation under Implicit Tension Constraints

Yu Li, Amin Fakhari, Hamid Sadeghian · 2026

Cable/rope elements are pervasive in deformable-object manipulation, often serving as a deformable force-transmission medium whose routing and contact determine how wrenches are delivered. In cable-to…

Read Paper →

Browse Research Papers

How VLAs (Really) Work In Open-World Environments

DART: Learning-Enhanced Model Predictive Control for Dual-Arm Non-Prehensile Manipulation

AnchorRefine: Synergy-Manipulation Based on Trajectory Anchor and Residual Refinement for Vision-Language-Action Models

Learning to Forget -- Hierarchical Episodic Memory for Lifelong Robot Deployment

CableTract: A Co-Designed Cable-Driven Field Robot for Low-Compaction, Off-Grid Capable Agriculture

The Sustainability Gap in Robotics: A Large-Scale Survey of Sustainability Awareness in 50,000 Research Articles

Reset-Free Reinforcement Learning for Real-World Agile Driving: An Empirical Study

Towards Multi-Object Nonprehensile Transportation via Shared Teleoperation: A Framework Based on Virtual Object Model Predictive Control

See Something, Say Something: Context-Criticality-Aware Mobile Robot Communication for Hazard Mitigations

ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation

Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining

Adapt as You Say: Online Interactive Bimanual Skill Adaptation via Human Language Feedback

DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation

Controlling Fish Schools via Reinforcement Learning of Virtual Fish Movement

Bounds on Agreement between Subjective and Objective Measurements

SmoothTurn: Learning to Turn Smoothly for Agile Navigation with Quadrupedal Robots

SteadyTray: Learning Object Balancing Tasks in Humanoid Tray Transport via Residual Reinforcement Learning

DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds

Trajectory Optimization for Self-Wrap-Aware Cable-Towed Planar Object Manipulation under Implicit Tension Constraints

Browse by Category

Research Type

Publish Your Research