Expertini Research Research

Browse Research Papers

41,527+ open-access research outputs.

โœ• Clear
๐Ÿ” machine learning ๐Ÿ“‚ Engineering
Showing 41527 results for "machine learning" in Engineering
Engineering Preprint PDF DOI

Agent-Driven Autonomous Reinforcement Learning Research: Iterative Policy Improvement for Quadruped Locomotion

Nimesh Khandelwal, Shakti S. Gupta ยท 2026

This paper documents a case study in agent-driven autonomous reinforcement learning research for quadruped locomotion. The setting was not a fully self-starting research system. A human provided high-โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Rainbow-DemoRL: Combining Improvements in Demonstration-Augmented Reinforcement Learning

Dwait Bhatt, Shih-Chieh Chou, Nikolay Atanasov ยท 2026

Several approaches have been proposed to improve the sample efficiency of online reinforcement learning (RL) by leveraging demonstrations collected offline. The offline data can be used directly as trโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

D-SPEAR: Dual-Stream Prioritized Experience Adaptive Replay for Stable Reinforcement Learning in Robotic Manipulation

Yu Zhang, Karl Mason ยท 2026

Robotic manipulation remains challenging for reinforcement learning due to contact-rich dynamics, long horizons, and training instability. Although off-policy actor-critic algorithms such as SAC and Tโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Learning swarm behaviour from a flock of homing pigeons using inverse optimal control

Afreen Islam ยท 2026

In this work, Global Position System (GPS) data from a flock of homing pigeons are analysed. The flocking behaviour of the considered homing pigeons is formulated as a swarm optimal trajectory trackinโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Where-to-Learn: Analytical Policy Gradient Directed Exploration for On-Policy Robotic Reinforcement Learning

Leixin Chang, Xinchen Yao, Ben Liu, Liangjing Yang, Hua Chen ยท 2026

On-policy reinforcement learning (RL) algorithms have demonstrated great potential in robotic control, where effective exploration is crucial for efficient and high-quality policy learning. However, hโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

MetaTune: Adjoint-based Meta-tuning via Robotic Differentiable Dynamics

Xiexin Peng, Bingheng Wang, Tao Zhang, Ying Zheng ยท 2026

Disturbance observer-based control has shown promise in robustifying robotic systems against uncertainties. However, tuning such systems remains challenging due to the strong coupling between controllโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Prospect Theoretic Approach to Pursuit-evasion Differential Games with Risk Aversion and Probability Sensitivity

Zili Wang, Hao Yang, Xiangxiang Wang, Bin Jiang, Long Wang, Marios M. Polycarpou ยท 2026

This paper considers for the first time pursuit-evasion (PE) differential games with irrational perceptions of both pursuer and evader on probabilistic characteristics of environmental uncertainty. Fiโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

HiFlow: Tokenization-Free Scale-Wise Autoregressive Policy Learning via Flow Matching

Daichi Yashima, Koki Seno, Shuhei Kurita, Yusuke Oda, Komei Sugiura ยท 2026

Coarse-to-fine autoregressive modeling has recently shown strong promise for visuomotor policy learning, combining the inference efficiency of autoregressive methods with the global trajectory coherenโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Budgeted Robust Intervention Design for Financial Networks with Common Asset Exposures

Giuseppe C. Calafiore ยท 2026

In the context of containment of default contagion in financial networks, we here study a regulator that allocates pre-shock capital or liquidity buffers across banks connected by interbank liabilitieโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

DeepBayesFlow: A Bayesian Structured Variational Framework for Generalizable Prostate Segmentation via Expressive Posteriors and SDE-Girsanov Uncertainty Modeling

Zhuoyi Fang ยท 2026

Automatic prostate MRI segmentation faces persistent challenges due to inter-patient anatomical variability, blurred tissue boundaries, and distribution shifts arising from diverse imaging protocols. โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Autonomous overtaking trajectory optimization using reinforcement learning and opponent pose estimation

Matej Rene Cihlar, Luka Siktar, Branimir Caran, Marko Svaco ยท 2026

Vehicle overtaking is one of the most complex driving maneuvers for autonomous vehicles. To achieve optimal autonomous overtaking, driving systems rely on multiple sensors that enable safe trajectory โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

An End-to-end Flight Control Network for High-speed UAV Obstacle Avoidance based on Event-Depth Fusion

Dikai Shang, Jingyue Zhao, Shi Xu, Nanyang Ye, Lei Wang ยท 2026

Achieving safe, high-speed autonomous flight in complex environments with static, dynamic, or mixed obstacles remains challenging, as a single perception modality is incomplete. Depth cameras are effeโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Proprioceptive feedback paradigm for safe and resilient motion control

Mrdjan Jankovic ยท 2026

Proprioception is a human sense that provides feedback from muscles and joints about body position and motion. This key capability keeps us upright, moving, and responding quickly to slips or stumblesโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Data-driven discovery and control of multistable nonlinear systems and hysteresis via structured Neural ODEs

Ike Griss Salas, Ethan King ยท 2026

Many engineered physical processes exhibit nonlinear but asymptotically stable dynamics that converge to a finite set of equilibria determined by control inputs. Identifying such systems from data is โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

On-Device Super Resolution Imaging Using Low-Cost SPAD Array and Embedded Lightweight Deep Learning

Zhenya Zang, Xingda Li, David Day Uei Li ยท 2026

This work presents a lightweight super-resolution (LiteSR) neural network for depth and intensity images acquired from a consumer-grade single-photon avalanche diode (SPAD) array with a 48x32 spatial โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

UMI-Underwater: Learning Underwater Manipulation without Underwater Teleoperation

Hao Li, Long Yin Chung, Jack Goler, Ryan Zhang, Xiaochi Xie, Huy Ha, Shuran Song, Mark Cutkosky ยท 2026

Underwater robotic grasping is difficult due to degraded, highly variable imagery and the expense of collecting diverse underwater demonstrations. We introduce a system that (i) autonomously collects โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Beyond Freshness and Semantics: A Coupon-Collector Framework for Effective Status Updates

Youssef Ahmed, Arnob Ghosh, Chih-Chun Wang, Ness B. Shroff ยท 2026

For status update systems operating over unreliable energy-constrained wireless channels, we address Weaver's long-standing Level-C question: do my packets actually improve the plant's behavior? Each โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Gaussian Mixture Model Based Bayesian Learning for Sparse Channel Estimation in Orthogonal Time Frequency Space Modulated Systems

Surbhi Gehlot, Suraj Srivastava, Sandeep Kumar Yadav, Lajos Hanzo ยท 2026

A novel Gaussian mixture model (GMM) aided sparse Bayesian learning (SBL) framework is proposed for channel state information (CSI) estimation in orthogonal time-frequency space (OTFS) modulated systeโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

VLA-OPD: Bridging Offline SFT and Online RL for Vision-Language-Action Models via On-Policy Distillation

Zhide Zhong, Haodong Yan, Junfeng Li, Junjie He, Tianran Zhang, Haoang Li ยท 2026

Although pre-trained Vision-Language-Action (VLA) models exhibit impressive generalization in robotic manipulation, post-training remains crucial to ensure reliable performance during deployment. Howeโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Ruka-v2: Tendon Driven Open-Source Dexterous Hand with Wrist and Abduction for Robot Learning

Xinqi Lucas Liu, Ruoxi Hu, Alejandro Ojeda Olarte, Zhuoran Chen, Kenny Ma, Charles Cheng Ji, Lerrel Pinto, Raunaq Bhirangi, Irmak Guzey ยท 2026

Lack of accessible and dexterous robot hardware has been a significant bottleneck to achieving human-level dexterity in robots. Last year, we released Ruka, a fully open-sourced, tendon-driven humanoiโ€ฆ

Read Paper โ†’
โ† Prev Page 39 of 2077 Next โ†’