Expertini Research Research

Browse Research Papers

14,584+ open-access research outputs.

✕ Clear
🔍 visual perception 📂 Engineering 📄 Preprint
Showing 14584 results for "visual perception" in Engineering · Preprint
Engineering Preprint PDF DOI

OmniRobotHome: A Multi-Camera Platform for Real-Time Multiadic Human-Robot Interaction

Junyoung Lee, Sookwan Han, Jeonghwan Kim, Inhee Lee, Mingi Choi, Jisoo Kim, Wonjung Woo, Hanbyul Joo · 2026

Human-robot collaboration has been studied primarily in dyadic or sequential settings. However, real homes require multiadic collaboration, where multiple humans and robots share a workspace, acting c…

Read Paper →
Engineering Preprint PDF DOI

LiDAR-based Dynamic Blockage Prediction: A Data-driven Approach for Learning Interactive Bayesian Models

Saleemullah Memon, Ali Krayani, Pamela Zontone, Lucio Marcenaro, David Martin Gomez, Carlo Regazzoni · 2026

Vehicular sensing-based intelligence has made substantial progress in transportation systems, leading to higher levels of safety and sustainability for smart cities and autonomous systems. This paper …

Read Paper →
Engineering Preprint PDF DOI

Dreaming Across Towns: Semantic Rollout and Town-Adversarial Regularization for Zero-Shot Held-Out-Town Fixed-Route Driving in CARLA

Feeza Khan Khanzada, Jaerock Kwon · 2026

Learned driving agents often degrade when deployed in unseen environments. This paper studies a deliberately bounded instance of that problem in the CARLA simulator: zero-shot transfer of a closed-loo…

Read Paper →
Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung · 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annota…

Read Paper →
Engineering Preprint PDF DOI

MotuBrain: An Advanced World Action Model for Robot Control

MotuBrain Team, Chendong Xiang, Fan Bao, Haitian Liu, Hengkai Tan, Hongzhe Bi, James Li, Jiabao Liu, Jingrui Pang, Kiro Jing, Louis Liu, Mengchen Cai, Rongxu Cui, Ruowen Zhao, Runqing Wang, Shuhe Huang, Yao Feng, Yinze Rong, Zeyuan Wang, Jun Zhu · 2026

Vision-Language-Action (VLA) models achieve strong semantic generalization but often lack fine-grained modeling of world dynamics. Recent work explores video generation models as a foundation for worl…

Read Paper →
Engineering Preprint PDF DOI

Connected Dependability Cage: Run-Time Function and Anomaly Monitoring for the Development and Operation of Safe Automated Vehicles

Iqra Aslam, Nour Habib, Abhishek Buragohain, Meng Zhang, Andreas Rausch, Vaibhav Tiwari, Mohamed Benchat · 2026

The advancement of automated vehicles introduces complex safety challenges, particularly in dynamic and unpredictable environments where AI-enabled perception systems must operate reliably. Ensuring c…

Read Paper →
Engineering Preprint PDF DOI

BUT System Description for CHiME-9 MCoRec Challenge

Dominik Klement, Alexander Polok, Nguyen Hai Phong, Prachi Singh, Lukas Burget · 2026

Multi-talker automatic speech recognition (ASR) in conversational recordings remains an open problem, particularly in scenarios with large portion of overlapping speech where identifying and transcrib…

Read Paper →
Engineering Preprint PDF DOI

An Experimental Modular Instrument With a Haptic Feedback Framework for Robotic Surgery Training

Walid Shaker, Mustafa Suphi Erden · 2026

Robotic-assisted surgery offers significant clinical advantages but largely eliminates direct haptic feedback, increasing the risk of excessive tool-tissue interaction forces. Although recent commerci…

Read Paper →
Engineering Preprint PDF DOI

A Real-time Scale-robust Network for Glottis Segmentation in Nasal Transnasal Intubation

Yang Zhou, Chaoyong Zhang, Ruoyi Hao, Huilin Pan, Yang Zhang, Hongliang Ren · 2026

Nasotracheal intubation (NTI) is a critical clinical procedure for establishing and maintaining patient airway patency. Machine-assisted NTI has emerged as a pivotal approach for optimizing procedural…

Read Paper →
Engineering Preprint PDF DOI

DOT-Sim: Differentiable Optical Tactile Simulation with Precise Real-to-Sim Physical Calibration

Yang You, Won Kyung Do, Aiden Swann, Rika Antonova, Monroe Kennedy, Leonidas Guibas · 2026

Simulating optical tactile sensors presents significant challenges due to their high deformability and intricate optical properties. To address these issues and enable a physically accurate simulation…

Read Paper →
Engineering Preprint PDF DOI

Learning Tactile-Aware Quadrupedal Loco-Manipulation Policies

Pokuang Zhou, Yuhao Zhou, Quan Luu, Seungho Han, Heng Zhang, Binghao Huang, Yunzhu Li, Arash Ajoudani, Zhengtong Xu, Yu She · 2026

Quadrupedal loco-manipulation is commonly built on visual perception and proprioception. Yet reliable contact-rich manipulation remains difficult: vision and proprioception alone cannot resolve uncert…

Read Paper →
Engineering Preprint PDF DOI

Real-Time GPU-Accelerated Monte Carlo Evaluation of Safety-Critical AEB Systems Under Uncertainty

Akshay Karjol, Shadi Alawneh · 2026

Automatic Emergency Braking (AEB) systems represent a safety-critical national interest, with the National Highway Traffic Safety Administration (NHTSA) Federal Motor Vehicle Safety Standard (FMVSS No…

Read Paper →
Engineering Preprint PDF DOI

A New Location Estimator for Mixed LOS & NLOS scenarios

Gaurav Duggal, Richard M. Buehrer, Harpreet S. Dhillon, Jeffrey H. Reed · 2026

Time-of-arrival (TOA)-based localization in mixed line-of-sight (LOS) and non-line-of-sight (NLOS) environments is challenging because conventional Euclidean range models do not capture diffraction-do…

Read Paper →
Engineering Preprint PDF DOI

Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising

Jun Guo, Qiwei Li, Peiyan Li, Zilong Chen, Nan Sun, Yifei Su, Heyun Wang, Yuan Zhang, Xinghang Li, Huaping Liu · 2026

We propose X-WAM, a Unified 4D World Model that unifies real-time robotic action execution and high-fidelity 4D world synthesis (video + 3D reconstruction) in a single framework, addressing the critic…

Read Paper →
Engineering Preprint PDF DOI

CRLB and Parameter Estimation for OFDM-ISAC with Non-Uniform Sparse Resource Allocation

Wenjie Zhang, Qianglong Dai, Xiaoli Xu, Ruoguang Li, Yong Zeng · 2026

Integrated sensing and communication (ISAC) holds great promise in expanding the applications of wireless communication networks. However, in current communication-centric systems, the time-frequency …

Read Paper →
Engineering Preprint PDF DOI

3D Generation for Embodied AI and Robotic Simulation: A Survey

Tianwei Ye, Yifan Mao, Minwen Liao, Jian Liu, Chunchao Guo, Dazhao Du, Quanxin Shou, Fangqi Zhu, Song Guo · 2026

Embodied AI and robotic systems increasingly depend on scalable, diverse, and physically grounded 3D content for simulation-based training and real-world deployment. While 3D generative modeling has a…

Read Paper →
Engineering Preprint PDF DOI

HiPAN: Hierarchical Posture-Adaptive Navigation for Quadruped Robots in Unstructured 3D Environments

Jeil Jeong, Minsung Yoon, Seokryun Choi, Heechan Shin, Taegeun Yang, Sung-eui Yoon · 2026

Navigating quadruped robots in unstructured 3D environments poses significant challenges, requiring goal-directed motion, effective exploration to escape from local minima, and posture adaptation to t…

Read Paper →
Engineering Preprint PDF DOI

Adaptive Transform Coding for Semantic Compression

Andriy Enttsel, Vincent Corlay · 2026

Visual data compression is shifting from human-centered reconstruction to machine-oriented representation coding. In this setting, an image is often mapped to a compact semantic embedding, which is th…

Read Paper →
Engineering Preprint PDF DOI

Alter-Art: Exploring Embodied Artistic Creation through a Robot Avatar

Do Won Park, Samuele Bordini, Giorgio Grioli, Manuel G. Catalano, Antonio Bicchi · 2026

As with every emerging technology, new tools in the hands of artists reshape the nature of artwork creation. Current frameworks for robotics in arts deploy the robot as an autonomous creator or a coll…

Read Paper →
Engineering Preprint PDF DOI

Risk-Aware Multi-Market Scheduling of Virtual Power Plants with Dynamic Network Tariffs

Lorenzo Zapparoli, Paul Fath, Blazhe Gjorgiev, Giovanni Sansavini · 2026

As the penetration of distributed energy resources (DERs) increases, harnessing their flexibility becomes critical for power system operations. Virtual power plants (VPPs) offer a promising solution. …

Read Paper →
Page 1 of 730 Next →