Machine Learning in Engineering — Research Repository

Engineering Preprint PDF DOI

Partial Motion Imitation for Learning Cart Pushing with Legged Manipulators

Mili Das, Morgan Byrd, Donghoon Baek, Sehoon Ha · 2026

Loco-manipulation is a key capability for legged robots to perform practical mobile manipulation tasks, such as transporting and pushing objects, in real-world environments. However, learning robust l…

Read Paper →

Engineering Preprint PDF DOI

Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining

Wenyao Zhang, Bozhou Zhang, Zekun Qi, Wenjun Zeng, Xin Jin, Li Zhang · 2026

Vision-language-action (VLA) models have shown great potential in building generalist robots, but still face a dilemma-misalignment of 2D image forecasting and 3D action prediction. Besides, such a vi…

Read Paper →

Engineering Preprint PDF DOI

Meta-Adaptive Beam Search Planning for Transformer-Based Reinforcement Learning Control of UAVs with Overhead Manipulators under Flight Disturbances

Hazim Alzorgan, Sayed Pedram Haeri Boroujeni, Abolfazl Razi · 2026

Drones equipped with overhead manipulators offer unique capabilities for inspection, maintenance, and contact-based interaction. However, the motion of the drone and its manipulator is tightly linked,…

Read Paper →

Engineering Preprint PDF DOI

Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

John Bateman, Andy M. Tyrrell, Jihong Zhu · 2026

Programming robots to perform complex tasks is often difficult and time consuming, requiring expert knowledge and skills in robot software and sometimes hardware. Imitation learning is a method for tr…

Read Paper →

Engineering Preprint PDF DOI

120 Minutes and a Laptop: Minimalist Image-goal Navigation via Unsupervised Exploration and Offline RL

Xiaoming Liu, Borong Zhang, Qingbiao Li, Steven Morad · 2026

The prevailing paradigm for image-goal visual navigation often assumes access to large-scale datasets, substantial pretraining, and significant computational resources. In this work, we challenge this…

Read Paper →

Engineering Preprint PDF DOI

Generalizable task-oriented object grasping through LLM-guided ontology and similarity-based planning

Hao Chen, Takuya Kiyokawa, Weiwei Wan, Kensuke Harada · 2026

Task-oriented grasping (TOG) is more challenging than simple object grasping because it requires precise identification of object parts and careful selection of grasping areas to ensure effective and …

Read Paper →

Engineering Preprint PDF DOI

Adapting Frozen Mono-modal Backbones for Multi-modal Registration via Contrast-Agnostic Instance Optimization

Yi Zhang, Yidong Zhao, Qian Tao · 2026

Deformable image registration remains a central challenge in medical image analysis, particularly under multi-modal scenarios where intensity distributions vary significantly across scans. While deep …

Read Paper →

Engineering Preprint PDF DOI

Realtime-VLA V2: Learning to Run VLAs Fast, Smooth, and Accurate

Chen Yang, Yucheng Hu, Yunchao Ma, Yunhuan Yang, Jing Tan, Haoqiang Fan · 2026

In deployment of the VLA models to real-world robotic tasks, execution speed matters. In previous work arXiv:2510.26742 we analyze how to make neural computation of VLAs on GPU fast. However, we leave…

Read Paper →

Engineering Preprint PDF DOI

DiffusionAnything: End-to-End In-context Diffusion Learning for Unified Navigation and Pre-Grasp Motion

Iana Zhura, Yara Mahmoud, Jeffrin Sam, Hung Khang Nguyen, Didar Seyidov, Miguel Altamirano Cabrera, Dzmitry Tsetserukou · 2026

Efficiently predicting motion plans directly from vision remains a fundamental challenge in robotics, where planning typically requires explicit goal specification and task-specific design. Recent vis…

Read Paper →

Engineering Preprint PDF DOI

DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching

Jiayi Chen, Wenxuan Song, Shuai Chen, Jingbo Wang, Zhijun Li, Haoang Li · 2026

Vision--Language--Action (VLA) models that encode actions using a discrete tokenization scheme are increasingly adopted for robotic manipulation, but existing decoding paradigms remain fundamentally l…

Read Paper →

Engineering Preprint PDF DOI

Uncertainty-Aware Mapping from 3D Keypoints to Anatomical Landmarks for Markerless Biomechanics

Cesare Davide Pace, Alessandro Marco De Nunzio, Claudio De Stefano, Francesco Fontanella, Mario Molinara · 2026

Markerless biomechanics increasingly relies on 3D skeletal keypoints extracted from video, yet downstream biomechanical mappings typically treat these estimates as deterministic, providing no principl…

Read Paper →

Engineering Preprint PDF DOI

Reliability-Aware Weighted Multi-Scale Spatio-Temporal Maps for Heart Rate Monitoring

Arpan Bairagi, Rakesh Dey, Siladittya Manna, Umapada Pal · 2026

Remote photoplethysmography (rPPG) allows for the contactless estimation of physiological signals from facial videos by analyzing subtle skin color changes. However, rPPG signals are extremely suscept…

Read Paper →

Engineering Preprint PDF DOI

FINDER: Zero-Shot Field-Integrated Network for Distortion-free EPI Reconstruction in Diffusion MRI

Namgyu Han, Seong Dae Yun, Chaeeun Lim, Sunghyun Seok, Sunju Kim, Yoonhwan Kim, Yohan Jun, Tae Hyung Kim, Berkin Bilgic, Jaejin Cho · 2026

Echo-planar imaging (EPI) remains the cornerstone of diffusion MRI, but it is prone to severe geometric distortions due to its rapid sampling scheme that renders the sequence highly sensitive to $B_{0…

Read Paper →

Engineering Preprint PDF DOI

Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation

Dianyu Zhong, Tian Xing, Kailai Sun, Xu Yang, Heye Huang, Irfan Qaisar, Tinggang Jia, Shaobo Wang, Qianchuan Zhao · 2026

Heating, ventilation, and air conditioning (HVAC) systems account for a substantial share of building energy consumption. Environmental uncertainty and dynamic occupancy behavior bring challenges in d…

Read Paper →

Engineering Preprint PDF DOI

Cone-Beam CT Image Quality Enhancement Using A Latent Diffusion Model Trained with Simulated CBCT Artifacts

Naruki Murahashi, Mitsuhiro Nakamura, Megumi Nakao · 2026

Cone-beam computed tomography (CBCT) images are problematic in clinical medicine because of their low contrast and high artifact content compared with conventional CT images. Although there are some s…

Read Paper →

Engineering Preprint PDF DOI

Fractional Risk Analysis of Stochastic Systems with Jumps and Memory

Yimeng Sun, Zhuoyuan Wang, Xiaole Zhang, Heng Ping, Jintang Xue, Paul Bogdan, Yorie Nakahira · 2026

Accurate risk assessment is essential for safety-critical autonomous and control systems under uncertainty. In many real-world settings, stochastic dynamics exhibit asymmetric jumps and long-range mem…

Read Paper →

Engineering Preprint PDF DOI

Data-Driven Probabilistic Fault Detection and Identification via Density Flow Matching

Joshua D. Ibrahim, Mahdi Taheri, Soon-Jo Chung, Fred Y. Hadaegh · 2026

Fault detection and identification (FDI) is critical for maintaining the safety and reliability of systems subject to actuator and sensor faults. In this paper, the problem of FDI for nonlinear contro…

Read Paper →

Engineering Preprint PDF DOI

Global Location-Invariant Peak Storm Surge Prediction

Benjamin Pachev, Prateek Arora, Jinpai Zhao, Eirik Valseth · 2026

Storm surge is a significant threat to coastal communities across the globe, responsible for loss of life and enormous property damage. Consequently, significant efforts have been expended to develop …

Read Paper →

Engineering Preprint PDF DOI

Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

Maeva Guerrier, Karthik Soma, Jana Pavlasek, Giovanni Beltrame · 2026

Visual Navigation Models (VNMs) promise generalizable, robot navigation by learning from large-scale visual demonstrations. Despite growing real-world deployment, existing evaluations rely almost excl…

Read Paper →

Engineering Preprint PDF DOI

Emergent Neural Automaton Policies: Learning Symbolic Structure from Visuomotor Trajectories

Yiyuan Pan, Xusheng Luo, Hanjiang Hu, Peiqi Yu, Changliu Liu · 2026

Scaling robot learning to long-horizon tasks remains a formidable challenge. While end-to-end policies often lack the structural priors needed for effective long-term reasoning, traditional neuro-symb…

Read Paper →

Browse Research Papers

Partial Motion Imitation for Learning Cart Pushing with Legged Manipulators

Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining

Meta-Adaptive Beam Search Planning for Transformer-Based Reinforcement Learning Control of UAVs with Overhead Manipulators under Flight Disturbances

Addressing Ambiguity in Imitation Learning through Product of Experts based Negative Feedback

120 Minutes and a Laptop: Minimalist Image-goal Navigation via Unsupervised Exploration and Offline RL

Generalizable task-oriented object grasping through LLM-guided ontology and similarity-based planning

Adapting Frozen Mono-modal Backbones for Multi-modal Registration via Contrast-Agnostic Instance Optimization

Realtime-VLA V2: Learning to Run VLAs Fast, Smooth, and Accurate

DiffusionAnything: End-to-End In-context Diffusion Learning for Unified Navigation and Pre-Grasp Motion

DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching

Uncertainty-Aware Mapping from 3D Keypoints to Anatomical Landmarks for Markerless Biomechanics

Reliability-Aware Weighted Multi-Scale Spatio-Temporal Maps for Heart Rate Monitoring

FINDER: Zero-Shot Field-Integrated Network for Distortion-free EPI Reconstruction in Diffusion MRI

Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation

Cone-Beam CT Image Quality Enhancement Using A Latent Diffusion Model Trained with Simulated CBCT Artifacts

Fractional Risk Analysis of Stochastic Systems with Jumps and Memory

Data-Driven Probabilistic Fault Detection and Identification via Density Flow Matching

Global Location-Invariant Peak Storm Surge Prediction

Can Vision Foundation Models Navigate? Zero-Shot Real-World Evaluation and Lessons Learned

Emergent Neural Automaton Policies: Learning Symbolic Structure from Visuomotor Trajectories

Browse by Category

Research Type

Publish Your Research