Expertini Research Research

Browse Research Papers

14,737+ open-access research outputs.

โœ• Clear
๐Ÿ” visual perception ๐Ÿ“‚ Engineering
Showing 14737 results for "visual perception" in Engineering
Engineering Preprint PDF DOI

Intention-Aware Semantic Agent Communications for AI Glasses

Peiwen Jiang, Fangyu Liu, Jiajia Guo, Chao-Kai Wen, Shi Jin, Jun Zhang ยท 2026

Smart glasses are emerging as a promising interface between humans and artificial intelligence (AI) agents, enabling first-person perception, contextual awareness, and real-time assistance. However, cโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Tube Diffusion Policy: Reactive Visual-Tactile Policy Learning for Contact-rich Manipulation

Teng Xue, Alberto Rigo, Bingjian Huang, Jiayi Shen, Zhengtong Xu, Nick Colonnese, Amirhossein H. Memar ยท 2026

Contact-rich manipulation is central to many everyday human activities, requiring continuous adaptation to contact uncertainty and external disturbances through multi-modal perception, particularly viโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

PhysCodeBench: Benchmarking Physics-Aware Symbolic Simulation of 3D Scenes via Self-Corrective Multi-Agent Refinement

Tianyidan Xie, Peiyu Wang, Yuyi Qian, Yuxuan Wang, Rui Ma, Ying Tai, Song Wu, Qian Wang, Lanjun Wang, Zili Yi ยท 2026

Physics-aware symbolic simulation of 3D scenes is critical for robotics, embodied AI, and scientific computing, requiring models to understand natural language descriptions of physical phenomena and tโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

An Efficient Beam Search Algorithm for Active Perception in Mobile Robotics

Kaixian Qu, Han Wang, Victor Klemm, Cesar Cadena, Marco Hutter ยท 2026

Active perception is a fundamental problem in autonomous robotics in which the robot must decide where to move and what to sense in order to obtain the most informative observations for accomplishing โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Modular Sensory Stream for Integrating Physical Feedback in Vision-Language-Action Models

Jimin Lee, Huiwon Jang, Myungkyu Koo, Jungwoo Park, Jinwoo Shin ยท 2026

Humans understand and interact with the real world by relying on diverse physical feedback beyond visual perception. Motivated by this, recent approaches attempt to incorporate physical sensory signalโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

BridgeACT: Bridging Human Demonstrations to Robot Actions via Unified Tool-Target Affordances

Yifan Han, Jianxiang Liu, Haoyu Zhang, Yuqi Gu, Yunhan Guo, Wenzhao Lian ยท 2026

Learning robot manipulation from human videos is appealing due to the scale and diversity of human demonstrations, but transferring such demonstrations to executable robot behavior remains challengingโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Cooperative Informative Sensing for Monitoring Dynamic Indoor Environments via Multi-Agent Reinforcement Learning

Kanghoon Lee, Matthew M. Sato, Jinnyeong Yang, Seungro Lee, Sujin Lee, Jiachen Li, Kuk-Jin Yoon, Jinkyoo Park, Kincho H. Law, Yoonjin Yoon ยท 2026

Monitoring human activity in indoor environments is important for applications such as facility management, safety assessment, and space utilization analysis. While mobile robot teams offer the potentโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training

Suning Huang, Jiaqi Shao, Ke Wang, Qianzhong Chen, Jiankai Sun, Yanjiang Guo, Mac Schwager, Jeannette Bohg ยท 2026

Have you ever post-trained a generalist vision-language-action (VLA) policy on a small demonstration dataset, only to find that it stops responding to new instructions and is limited to behaviors obseโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Collaborative Trajectory Prediction via Late Fusion

Nadya Abdel Madjid, Murad Mebrahtu, Zakhar Yagudin, Bilal Hassan, Naoufel Werghi, Jorge Dias, Dzmitry Tsetserukou, Majid Khonji ยท 2026

Predicting future trajectories of surrounding traffic agents is critical for safe autonomous navigation and collision avoidance. Despite all advances in the trajectory forecasting realm, the predictioโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Are Natural-Domain Foundation Models Effective for Accelerated Cardiac MRI Reconstruction?

Anam Hashmi, Mayug Maniparambil, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor ยท 2026

The emergence of large-scale pretrained foundation models has transformed computer vision, enabling strong performance across diverse downstream tasks. However, their potential for physics-based inverโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A General EM-Based Channel Model for Reconfigurable Antenna Systems

Chen Xu, Xianghao Yu ยท 2026

Reconfigurable antenna systems (RASs), such as fluid antennas and movable antennas, are poised to play a pivotal role in sixth-generation (6G) systems by dynamically adapting the antenna elements for โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

CodeGraphVLP: Code-as-Planner Meets Semantic-Graph State for Non-Markovian Vision-Language-Action Models

Khoa Vo, Sieu Tran, Taisei Hanyu, Yuki Ikebe, Duy Nguyen, Bui Duy Quoc Nghi, Minh Vu, Anthony Gunderman, Chase Rainwater, Anh Nguyen, Ngan Le ยท 2026

Vision-Language-Action (VLA) models promise generalist robot manipulation, but are typically trained and deployed as short-horizon policies that assume the latest observation is sufficient for action โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Virtualizing the Senses: Enabling High-Precision ISAC on Commercial Cellular Infrastructure

Henglin Pu, Husheng Li ยท 2026

Integrated sensing and communication (ISAC) is poised to be a defining feature of 6G networks, promising to transform cellular base stations (BSs) into ubiquitous radar sensors. However, a significantโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Long-Horizon Manipulation via Trace-Conditioned VLA Planning

Isabella Liu, An-Chieh Cheng, Rui Yan, Geng Chen, Ri-Zhao Qiu, Xueyan Zou, Sha Yi, Hongxu Yin, Xiaolong Wang, Sifei Liu ยท 2026

Long-horizon manipulation remains challenging for vision-language-action (VLA) policies: real tasks are multi-step, progress-dependent, and brittle to compounding execution errors. We present LoHo-Manโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

PHOTON: Non-Invasive Optical Tracking of Key-Lever Motion in Historical Keyboard Instruments

Noah Jaffe, John Ashley Burgoyne ยท 2026

This paper introduces PHOTON (PHysical Optical Tracking of Notes), a non-invasive optical sensing system for measuring key-lever motion in historical keyboard instruments. PHOTON tracks the vertical dโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Using Assembly Language for Creating Games

Haris Turkmanovic, David Vukoje, Aleksandra Lekic, Milan Prokin ยท 2026

The aim of this paper is to demonstrate some interesting and useful approaches for writing a program in the assembly language. In order to demonstrate the possibilities of the assembly language, a proโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Ufil: A Unified Framework for Infrastructure-based Localization

Simon Schafer, Lucas Hegerath, Marius Molz, Massimo Marcon, Bassam Alrifaee ยท 2026

Infrastructure-based localization enhances road safety and traffic management by providing state estimates of road users. Development is hindered by fragmented, application-specific stacks that tightlโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Encrypted Visual Feedback Control Using RLWE-Based Cryptosystem

Taichi Ikezaki, Kaoru Teranishi ยท 2026

This study proposes an encrypted visual feedback control algorithm for regulating a one-dimensional stage using Ring Learning With Errors (RLWE) encryption. The proposed algorithm performs both featurโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Replicable Robotics Awareness Method Using LLM-Enabled Robotics Interaction: Evidence from a Corporate Challenge

S. A. Prieto, M. A. Gopee, Y. Ben Arab, B. Garcia de Soto, J. Esteba, P. Olivera Brizzio ยท 2026

Large language models are increasingly being explored as interfaces between humans and robotic systems, yet there remains limited evidence on how such technologies can be used not only for interactionโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Deployable Embodied Vision-Language Navigation System with Hierarchical Cognition and Context-Aware Exploration

Kuan Xu, Ruimeng Liu, Yizhuo Yang, Denan Liang, Tongxing Jin, Shenghai Yuan, Chen Wang, Lihua Xie ยท 2026

Bridging the gap between embodied intelligence and embedded deployment remains a key challenge in intelligent robotic systems, where perception, reasoning, and planning must operate under strict constโ€ฆ

Read Paper โ†’
โ† Prev Page 3 of 737 Next โ†’