Visual Perception in Engineering — Research Repository

Engineering Preprint PDF DOI

Intention-Aware Semantic Agent Communications for AI Glasses

Peiwen Jiang, Fangyu Liu, Jiajia Guo, Chao-Kai Wen, Shi Jin, Jun Zhang · 2026

Smart glasses are emerging as a promising interface between humans and artificial intelligence (AI) agents, enabling first-person perception, contextual awareness, and real-time assistance. However, c…

Read Paper →

Engineering Preprint PDF DOI

Tube Diffusion Policy: Reactive Visual-Tactile Policy Learning for Contact-rich Manipulation

Teng Xue, Alberto Rigo, Bingjian Huang, Jiayi Shen, Zhengtong Xu, Nick Colonnese, Amirhossein H. Memar · 2026

Contact-rich manipulation is central to many everyday human activities, requiring continuous adaptation to contact uncertainty and external disturbances through multi-modal perception, particularly vi…

Read Paper →

Engineering Preprint PDF DOI

PhysCodeBench: Benchmarking Physics-Aware Symbolic Simulation of 3D Scenes via Self-Corrective Multi-Agent Refinement

Tianyidan Xie, Peiyu Wang, Yuyi Qian, Yuxuan Wang, Rui Ma, Ying Tai, Song Wu, Qian Wang, Lanjun Wang, Zili Yi · 2026

Physics-aware symbolic simulation of 3D scenes is critical for robotics, embodied AI, and scientific computing, requiring models to understand natural language descriptions of physical phenomena and t…

Read Paper →

Engineering Preprint PDF DOI

An Efficient Beam Search Algorithm for Active Perception in Mobile Robotics

Kaixian Qu, Han Wang, Victor Klemm, Cesar Cadena, Marco Hutter · 2026

Active perception is a fundamental problem in autonomous robotics in which the robot must decide where to move and what to sense in order to obtain the most informative observations for accomplishing …

Read Paper →

Engineering Preprint PDF DOI

Modular Sensory Stream for Integrating Physical Feedback in Vision-Language-Action Models

Jimin Lee, Huiwon Jang, Myungkyu Koo, Jungwoo Park, Jinwoo Shin · 2026

Humans understand and interact with the real world by relying on diverse physical feedback beyond visual perception. Motivated by this, recent approaches attempt to incorporate physical sensory signal…

Read Paper →

Engineering Preprint PDF DOI

BridgeACT: Bridging Human Demonstrations to Robot Actions via Unified Tool-Target Affordances

Yifan Han, Jianxiang Liu, Haoyu Zhang, Yuqi Gu, Yunhan Guo, Wenzhao Lian · 2026

Learning robot manipulation from human videos is appealing due to the scale and diversity of human demonstrations, but transferring such demonstrations to executable robot behavior remains challenging…

Read Paper →

Engineering Preprint PDF DOI

Cooperative Informative Sensing for Monitoring Dynamic Indoor Environments via Multi-Agent Reinforcement Learning

Kanghoon Lee, Matthew M. Sato, Jinnyeong Yang, Seungro Lee, Sujin Lee, Jiachen Li, Kuk-Jin Yoon, Jinkyoo Park, Kincho H. Law, Yoonjin Yoon · 2026

Monitoring human activity in indoor environments is important for applications such as facility management, safety assessment, and space utilization analysis. While mobile robot teams offer the potent…

Read Paper →

Engineering Preprint PDF DOI

Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training

Suning Huang, Jiaqi Shao, Ke Wang, Qianzhong Chen, Jiankai Sun, Yanjiang Guo, Mac Schwager, Jeannette Bohg · 2026

Have you ever post-trained a generalist vision-language-action (VLA) policy on a small demonstration dataset, only to find that it stops responding to new instructions and is limited to behaviors obse…

Read Paper →

Engineering Preprint PDF DOI

Collaborative Trajectory Prediction via Late Fusion

Nadya Abdel Madjid, Murad Mebrahtu, Zakhar Yagudin, Bilal Hassan, Naoufel Werghi, Jorge Dias, Dzmitry Tsetserukou, Majid Khonji · 2026

Predicting future trajectories of surrounding traffic agents is critical for safe autonomous navigation and collision avoidance. Despite all advances in the trajectory forecasting realm, the predictio…

Read Paper →

Engineering Preprint PDF DOI

Are Natural-Domain Foundation Models Effective for Accelerated Cardiac MRI Reconstruction?

Anam Hashmi, Mayug Maniparambil, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor · 2026

The emergence of large-scale pretrained foundation models has transformed computer vision, enabling strong performance across diverse downstream tasks. However, their potential for physics-based inver…

Read Paper →

Engineering Preprint PDF DOI

A General EM-Based Channel Model for Reconfigurable Antenna Systems

Chen Xu, Xianghao Yu · 2026

Reconfigurable antenna systems (RASs), such as fluid antennas and movable antennas, are poised to play a pivotal role in sixth-generation (6G) systems by dynamically adapting the antenna elements for …

Read Paper →

Engineering Preprint PDF DOI

CodeGraphVLP: Code-as-Planner Meets Semantic-Graph State for Non-Markovian Vision-Language-Action Models

Khoa Vo, Sieu Tran, Taisei Hanyu, Yuki Ikebe, Duy Nguyen, Bui Duy Quoc Nghi, Minh Vu, Anthony Gunderman, Chase Rainwater, Anh Nguyen, Ngan Le · 2026

Vision-Language-Action (VLA) models promise generalist robot manipulation, but are typically trained and deployed as short-horizon policies that assume the latest observation is sufficient for action …

Read Paper →

Engineering Preprint PDF DOI

Virtualizing the Senses: Enabling High-Precision ISAC on Commercial Cellular Infrastructure

Henglin Pu, Husheng Li · 2026

Integrated sensing and communication (ISAC) is poised to be a defining feature of 6G networks, promising to transform cellular base stations (BSs) into ubiquitous radar sensors. However, a significant…

Read Paper →

Engineering Preprint PDF DOI

Long-Horizon Manipulation via Trace-Conditioned VLA Planning

Isabella Liu, An-Chieh Cheng, Rui Yan, Geng Chen, Ri-Zhao Qiu, Xueyan Zou, Sha Yi, Hongxu Yin, Xiaolong Wang, Sifei Liu · 2026

Long-horizon manipulation remains challenging for vision-language-action (VLA) policies: real tasks are multi-step, progress-dependent, and brittle to compounding execution errors. We present LoHo-Man…

Read Paper →

Engineering Preprint PDF DOI

PHOTON: Non-Invasive Optical Tracking of Key-Lever Motion in Historical Keyboard Instruments

Noah Jaffe, John Ashley Burgoyne · 2026

This paper introduces PHOTON (PHysical Optical Tracking of Notes), a non-invasive optical sensing system for measuring key-lever motion in historical keyboard instruments. PHOTON tracks the vertical d…

Read Paper →

Engineering Preprint PDF DOI

Using Assembly Language for Creating Games

Haris Turkmanovic, David Vukoje, Aleksandra Lekic, Milan Prokin · 2026

The aim of this paper is to demonstrate some interesting and useful approaches for writing a program in the assembly language. In order to demonstrate the possibilities of the assembly language, a pro…

Read Paper →

Engineering Preprint PDF DOI

Ufil: A Unified Framework for Infrastructure-based Localization

Simon Schafer, Lucas Hegerath, Marius Molz, Massimo Marcon, Bassam Alrifaee · 2026

Infrastructure-based localization enhances road safety and traffic management by providing state estimates of road users. Development is hindered by fragmented, application-specific stacks that tightl…

Read Paper →

Engineering Preprint PDF DOI

Encrypted Visual Feedback Control Using RLWE-Based Cryptosystem

Taichi Ikezaki, Kaoru Teranishi · 2026

This study proposes an encrypted visual feedback control algorithm for regulating a one-dimensional stage using Ring Learning With Errors (RLWE) encryption. The proposed algorithm performs both featur…

Read Paper →

Engineering Preprint PDF DOI

A Replicable Robotics Awareness Method Using LLM-Enabled Robotics Interaction: Evidence from a Corporate Challenge

S. A. Prieto, M. A. Gopee, Y. Ben Arab, B. Garcia de Soto, J. Esteba, P. Olivera Brizzio · 2026

Large language models are increasingly being explored as interfaces between humans and robotic systems, yet there remains limited evidence on how such technologies can be used not only for interaction…

Read Paper →

Engineering Preprint PDF DOI

A Deployable Embodied Vision-Language Navigation System with Hierarchical Cognition and Context-Aware Exploration

Kuan Xu, Ruimeng Liu, Yizhuo Yang, Denan Liang, Tongxing Jin, Shenghai Yuan, Chen Wang, Lihua Xie · 2026

Bridging the gap between embodied intelligence and embedded deployment remains a key challenge in intelligent robotic systems, where perception, reasoning, and planning must operate under strict const…

Read Paper →

Browse Research Papers

Intention-Aware Semantic Agent Communications for AI Glasses

Tube Diffusion Policy: Reactive Visual-Tactile Policy Learning for Contact-rich Manipulation

PhysCodeBench: Benchmarking Physics-Aware Symbolic Simulation of 3D Scenes via Self-Corrective Multi-Agent Refinement

An Efficient Beam Search Algorithm for Active Perception in Mobile Robotics

Modular Sensory Stream for Integrating Physical Feedback in Vision-Language-Action Models

BridgeACT: Bridging Human Demonstrations to Robot Actions via Unified Tool-Target Affordances

Cooperative Informative Sensing for Monitoring Dynamic Indoor Environments via Multi-Agent Reinforcement Learning

Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training

Collaborative Trajectory Prediction via Late Fusion

Are Natural-Domain Foundation Models Effective for Accelerated Cardiac MRI Reconstruction?

A General EM-Based Channel Model for Reconfigurable Antenna Systems

CodeGraphVLP: Code-as-Planner Meets Semantic-Graph State for Non-Markovian Vision-Language-Action Models

Virtualizing the Senses: Enabling High-Precision ISAC on Commercial Cellular Infrastructure

Long-Horizon Manipulation via Trace-Conditioned VLA Planning

PHOTON: Non-Invasive Optical Tracking of Key-Lever Motion in Historical Keyboard Instruments

Using Assembly Language for Creating Games

Ufil: A Unified Framework for Infrastructure-based Localization

Encrypted Visual Feedback Control Using RLWE-Based Cryptosystem

A Replicable Robotics Awareness Method Using LLM-Enabled Robotics Interaction: Evidence from a Corporate Challenge

A Deployable Embodied Vision-Language Navigation System with Hierarchical Cognition and Context-Aware Exploration

Browse by Category

Research Type

Publish Your Research