Expertini Research Research

Browse Research Papers

39,379+ open-access research outputs.

โœ• Clear
๐Ÿ” avoidance learning ๐Ÿ“‚ Engineering
Showing 39379 results for "avoidance learning" in Engineering
Engineering Preprint PDF DOI

Semantic Sensing: A Task-Oriented Paradigm

Xiaoqi Zhang, J. Andrew Zhang, Chang Liu, Weijie Yuan, Geoffrey Ye Li, Moeness G. Amin ยท 2026

Sensing and communication are fundamental enablers of next-generation networks. While communication technologies have advanced significantly, sensing remains limited to conventional parameter estimatiโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Pontryagin Method of Model-based Reinforcement Learning via Hamiltonian Actor-Critic

Chengyang Gu, Yuxin Pan, Hui Xiong, Yize Chen ยท 2026

Model-based reinforcement learning (MBRL) improves sample efficiency by leveraging learned dynamics models for policy optimization. However, the effectiveness of methods such as actor-critic is often โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

AutoWorld: Scaling Multi-Agent Traffic Simulation with Self-Supervised World Models

Mozhgan Pourkeshavatz, Tianran Liu, Nicholas Rhinehart ยท 2026

Multi-agent traffic simulation is central to developing and testing autonomous driving systems. Recent data-driven simulators have achieved promising results, but rely heavily on supervised learning fโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

World2Rules: A Neuro-Symbolic Framework for Learning World-Governing Safety Rules for Aviation

Haichuan Wang, Jay Patrikar, Sebastian Scherer ยท 2026

Many real-world safety-critical systems are governed by explicit rules that define unsafe world configurations and constrain agent interactions. In practice, these rules are complex and context-dependโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Optimistic Online LQR via Intrinsic Rewards

Marcell Bartos, Bruce D. Lee, Lenart Treven, Andreas Krause, Florian Dorfler, Melanie N. Zeilinger ยท 2026

Optimism in the face of uncertainty is a popular approach to balance exploration and exploitation in reinforcement learning. Here, we consider the online linear quadratic regulator (LQR) problem, i.e.โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Computational Framework for Cross-Domain Mission Design and Onboard Cognitive Decision Support

J. de Curto, Adrianne Schneider, Ricardo Yanez, Maria Begara, Alvaro Rodriguez, Javier Lopez, Martina Fraga, Ignacio Gomez, Arman Akdag, Sumit Kulkarni, Siddhant Nair, Kiyan Govender, Eian Wratchford, Eli Lynskey, Seamus Dunlap, Cooper Nervick, Nicolas Tete, Rocio Fernandez, Pablo Gonzalez, Elena Municio, I. de Zarza ยท 2026

The design of distributed autonomous systems for operation beyond reliable ground contact presents a fundamental tension: as round-trip communication latency grows, the set of decisions delegable to gโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Covariance-Domain Near-Field Channel Estimation under Hybrid Compression: USW/Fresnel Model, Curvature Learning, and KL Covariance Fitting

R{i}fat Volkan Senyuva ยท 2026

Near-field propagation in extremely large aperture arrays requires joint angle-range estimation. In hybrid architectures, only $N_\mathrm{RF}\ll M$ compressed snapshots are available per slot, making โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Robust Multi-Agent Reinforcement Learning for Small UAS Separation Assurance under GPS Degradation and Spoofing

Alex Zongo, Filippos Fotiadis, Ufuk Topcu, Peng Wei ยท 2026

We address robust separation assurance for small Unmanned Aircraft Systems (sUAS) under GPS degradation and spoofing via Multi-Agent Reinforcement Learning (MARL). In cooperative surveillance, each aiโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Associative Memory System via Threshold Linear Networks

Qin He, Jing Shuang Li ยท 2026

Humans learn and form memories in stochastic environments. Auto-associative memory systems model these processes by storing patterns and later recovering them from corrupted versions. Here, memories aโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning

Philip Schroeder, Thomas Weng, Karl Schmeckpeper, Eric Rosen, Stephen Hart, Ondrej Biza ยท 2026

Vision-language models (VLMs) have shown impressive capabilities across diverse tasks, motivating efforts to leverage these models to supervise robot learning. However, when used as evaluators in reinโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Can Hierarchical Cross-Modal Fusion Predict Human Perception of AI Dubbed Content?

Ashwini Dasare, Nirmesh Shah, Ashishkumar Gudmalwar, Pankaj Wasnik ยท 2026

Evaluating AI generated dubbed content is inherently multi-dimensional, shaped by synchronization, intelligibility, speaker consistency, emotional alignment, and semantic context. Human Mean Opinion Sโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Learning a dynamic four-chamber shape model of the human heart for 95,695 UK Biobank participants

Qiang Ma, Qingjie Meng, Yicheng Wu, Shuo Wang, Mengyun Qiao, Steven Niederer, Declan P. O'Regan, Paul M. Matthews, Wenjia Bai ยท 2026

The human heart is a sophisticated system composed of four cardiac chambers with distinct shapes, which function in a coordinated manner. Existing shape models of the heart mainly focus on the ventricโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Convex Route to Thermomechanics: Learning Internal Energy and Dissipation

Hagen Holthusen, Paul Steinmann, Ellen Kuhl ยท 2026

We present a physics-based neural network framework for the discovery of constitutive models in fully coupled thermomechanics. In contrast to classical formulations based on the Helmholtz energy, we aโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Vision-Based Robotic Disassembly Combined with Real-Time MFA Data Acquisition

Federico Zocco, Maria Pozzi, Monica Malvezzi ยท 2026

Stable and reliable supplies of rare-Earth minerals and critical raw materials (CRMs) are essential for the development of the European Union. Since a large share of these materials enters the Union fโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Dynamic Lookahead Distance via Reinforcement Learning-Based Pure Pursuit for Autonomous Racing

Mohamed Elgouhary, Amr S. El-Wakeel ยท 2026

Pure Pursuit (PP) is a widely used path-tracking algorithm in autonomous vehicles due to its simplicity and real-time performance. However, its effectiveness is sensitive to the choice of lookahead diโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Feel Robot Feels: Tactile Feedback Array Glove for Dexterous Manipulation

Feiyu Jia, Xiaojie Niu, Sizhe Yang, Qingwei Ben, Tao Huang, Feng zhao, Jingbo Wang, Jiangmiao Pang ยท 2026

Teleoperation is a key approach for collecting high-quality, physically consistent demonstrations for robotic manipulation. However, teleoperation for dexterous manipulation remains constrained by: (iโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Intelligent Radio Resource Slicing for 6G In-Body Subnetworks

Samira Abdelrahman, Hossam Farag ยท 2026

6G In-body Subnetworks (IBSs) represent a key enabler for supporting standalone eXtended Reality (XR) applications. IBSs are expected to operate as an underlay to existing cellular networks, giving riโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

RAD-LAD: Rule and Language Grounded Autonomous Driving in Real-Time

Anurag Ghosh, Srinivasa Narasimhan, Manmohan Chandraker, Francesco Pittaluga ยท 2026

We present LAD, a real-time language--action planner with an interruptible architecture that produces a motion plan in a single forward pass (~20 Hz) or generates textual reasoning alongside a motion โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Tac2Real: Reliable and GPU Visuotactile Simulation for Online Reinforcement Learning and Zero-Shot Real-World Deployment

Ningyu Yan, Shuai Wang, Xing Shen, Hui Wang, Hanqing Wang, Yang Xiang, Jiangmiao Pang ยท 2026

Visuotactile sensors are indispensable for contact-rich robotic manipulation tasks. However, policy learning with tactile feedback in simulation, especially for online reinforcement learning (RL), remโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Tele-Catch: Adaptive Teleoperation for Dexterous Dynamic 3D Object Catching

Weiguang Zhao, Junting Dong, Rui Zhang, Kailin Li, Qin Zhao, Kaizhu Huang ยท 2026

Teleoperation is a key paradigm for transferring human dexterity to robots, yet most prior work targets objects that are initially static, such as grasping or manipulation. Dynamic object catch, whereโ€ฆ

Read Paper โ†’
โ† Prev Page 33 of 1969 Next โ†’