Mario Marchese in Engineering — Research Repository

Engineering Preprint PDF DOI

Robust Graph Matching through Semantic Relationship Generation for SLAM

David Perez-Saura, Jose Andres Millan-Romera, Miguel Fernandez-Cortizas, Holger Voos, Pascual Campoy, Jose Luis Sanchez-Lopez · 2026

Graph-based representations such as Scene Graphs enable localization in structured indoor environments by matching a locally observed graph, constructed from sensor data, to a prior map. This process …

Read Paper →

Engineering Preprint PDF DOI

Move-Then-Operate: Behavioral Phasing for Human-Like Robotic Manipulation

Haoming Xu, Lei Lei, Jie Gu, Chu Tang, Jingmin Chen, Ruiqi Wang · 2026

We present Move-Then-Operate, a Vision language action framework that explicitly decouples robotic manipulation into two distinct behavioral phases: coarse relocation (move) and contact-critical inter…

Read Paper →

Engineering Preprint PDF DOI

Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training

Suning Huang, Jiaqi Shao, Ke Wang, Qianzhong Chen, Jiankai Sun, Yanjiang Guo, Mac Schwager, Jeannette Bohg · 2026

Have you ever post-trained a generalist vision-language-action (VLA) policy on a small demonstration dataset, only to find that it stops responding to new instructions and is limited to behaviors obse…

Read Paper →

Engineering Preprint PDF DOI

JAX-BEM: Gradient-Based Acoustic Shape Optimisation via a Differentiable Boundary Element Method

James Hipperson, Jonathan Hargreaves, Trevor Cox · 2026

Engineering structures are increasingly designed using numerical optimisation. However, traditional optimisation methods can be challenging with multiple objectives and many parameters. In machine lea…

Read Paper →

Engineering Preprint PDF DOI

A Universal Systematic Method to Generate Error Patterns on Memoryless Channels

Marwan Jalaleddine, Jiajie Li, Syed Mohsin Abbas, Warren J. Gross · 2026

The high computational cost of approaching the performance of Maximum-likelihood (ML) decoding has limited its practical use for decades. Because the complexity grows exponentially with the message le…

Read Paper →

Engineering Preprint PDF DOI

Autonomous Vehicle Collision Avoidance With Racing Parameterized Deep Reinforcement Learning

Shathushan Sivashangaran, Vihaan Dutta, Apoorva Khairnar, Sepideh Gohari, Azim Eskandarian · 2026

Road traffic accidents are a leading cause of fatalities worldwide. In the US, human error causes 94% of crashes, resulting in excess of 7,000 pedestrian fatalities and $500 billion in costs annually.…

Read Paper →

Engineering Preprint PDF DOI

Tree Learning: A Multi-Skill Continual Learning Framework for Humanoid Robots

Yifei Yan, Linqi Ye · 2026

As reinforcement learning for humanoid robots evolves from single-task to multi-skill paradigms, efficiently expanding new skills while avoiding catastrophic forgetting has become a key challenge in e…

Read Paper →

Engineering Preprint PDF DOI

Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models

Longhao Li, Hongjie Chen, Zehan Li, Qihan Hu, Jian Kang, Jie Li, Lei Xie, Yongxiang Li · 2026

Recent advances in reasoning models have driven significant progress in text and multimodal domains, yet audio reasoning remains relatively limited. Only a few Large Audio Language Models (LALMs) inco…

Read Paper →

Engineering Preprint PDF DOI

Quantized Online LQR

Barron Han, Victoria Kostina, Babak Hassibi · 2026

We study online linear-quadratic regulation (LQR) with unknown dynamics under communication rate constraints. Classical networked control quantizes the plant state at every time step, requiring $O(T)$…

Read Paper →

Engineering Preprint PDF DOI

Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching

Jeremy Dao, Alan Fern · 2026

Simulation trained legged locomotion policies often exhibit performance loss on hardware due to dynamics discrepancies between the simulator and the real world, highlighting the need for approaches th…

Read Paper →

Engineering Preprint PDF DOI

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

Changi Hong, Yoonah Song, Hwayoung Park, Chaewoon Bang, Dayeon Gu, Do Hyun Lee, Hong Kook Kim · 2026

Recently, artificial intelligence-based dubbing technology has advanced, enabling automated dubbing (AD) to convert the source speech of a video into target speech in different languages. However, nat…

Read Paper →

Engineering Preprint PDF DOI

TASU2: Controllable CTC Simulation for Alignment and Low-Resource Adaptation of Speech LLMs

Jing Peng, Chenghao Wang, Yi Yang, Lirong Qian, Junjie Li, Yu Xi, Shuai Wang, Kai Yu · 2026

Speech LLM post-training increasingly relies on efficient cross-modal alignment and robust low-resource adaptation, yet collecting large-scale audio-text pairs remains costly. Text-only alignment meth…

Read Paper →

Engineering Preprint PDF DOI

Spatio-Temporal Grounding of Large Language Models from Perception Streams

Jacob Anderson, Bardh Hoxha, Georgios Fainekos, Hideki Okamoto, Danil Prokhorov · 2026

Embodied-AI agents must reason about how objects move and interact in 3-D space over time, yet existing smaller frontier Large Language Models (LLMs) still mis-handle fine-grained spatial relations, m…

Read Paper →

Engineering Preprint PDF DOI

Drift-Based Policy Optimization: Native One-Step Policy Learning for Online Robot Control

Yuxuan Gao, Yedong Shen, Shiqi Zhang, Wenhao Yu, Yifan Duan, Jia pan, Jiajia Wu, Jiajun Deng, Yanyong Zhang · 2026

Although multi-step generative policies achieve strong performance in robotic manipulation by modeling multimodal action distributions, they require multi-step iterative denoising at inference time. E…

Read Paper →

Engineering Preprint PDF DOI

An Open-Source LiDAR and Monocular Off-Road Autonomous Navigation Stack

Remi Marsal, Quentin Picard, Adrien Poire, Sebastien Kerbourc'h, Thibault Toralba, Clement Yver, Alexandre Chapoutot, David Filliat · 2026

Off-road autonomous navigation demands reliable 3D perception for robust obstacle detection in challenging unstructured terrain. While LiDAR is accurate, it is costly and power-intensive. Monocular de…

Read Paper →

Engineering Preprint PDF DOI

ReVAR: A Data-Driven Algorithm for Generating Aero-Optic Phase Screens

Jeffrey W. Utley, Gregery T. Buzzard, Charles A. Bouman, Matthew R. Kemnetz · 2026

The propagation of light through a turbulent flow field around an aircraft results in optical distortions commonly known as aero-optic effects. The development of methods to mitigate these effects req…

Read Paper →

Engineering Preprint PDF DOI

Diff-VS: Efficient Audio-Aware Diffusion U-Net for Vocals Separation

Yun-Ning (Amy) Hung, Richard Vogl, Filip Korzeniowski, Igor Pereira · 2026

While diffusion models are best known for their performance in generative tasks, they have also been successfully applied to many other tasks, including audio source separation. However, current gener…

Read Paper →

Engineering Preprint PDF DOI

A Pontryagin Method of Model-based Reinforcement Learning via Hamiltonian Actor-Critic

Chengyang Gu, Yuxin Pan, Hui Xiong, Yize Chen · 2026

Model-based reinforcement learning (MBRL) improves sample efficiency by leveraging learned dynamics models for policy optimization. However, the effectiveness of methods such as actor-critic is often …

Read Paper →

Engineering Preprint PDF DOI

Fundamental Limits of Man-in-the-Middle Attack Detection in Model-Free Reinforcement Learning

Rishi Rani, Massimo Franceschetti · 2026

We consider the problem of learning-based man-in-the-middle (MITM) attacks in cyber-physical systems (CPS), and extend our previously proposed Bellman Deviation Detection (BDD) framework for model-fre…

Read Paper →

Engineering Preprint PDF DOI

Beyond Freshness and Semantics: A Coupon-Collector Framework for Effective Status Updates

Youssef Ahmed, Arnob Ghosh, Chih-Chun Wang, Ness B. Shroff · 2026

For status update systems operating over unreliable energy-constrained wireless channels, we address Weaver's long-standing Level-C question: do my packets actually improve the plant's behavior? Each …

Read Paper →

Browse Research Papers

Robust Graph Matching through Semantic Relationship Generation for SLAM

Move-Then-Operate: Behavioral Phasing for Human-Like Robotic Manipulation

Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training

JAX-BEM: Gradient-Based Acoustic Shape Optimisation via a Differentiable Boundary Element Method

A Universal Systematic Method to Generate Error Patterns on Memoryless Channels

Autonomous Vehicle Collision Avoidance With Racing Parameterized Deep Reinforcement Learning

Tree Learning: A Multi-Skill Continual Learning Framework for Humanoid Robots

Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models

Quantized Online LQR

Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

TASU2: Controllable CTC Simulation for Alignment and Low-Resource Adaptation of Speech LLMs

Spatio-Temporal Grounding of Large Language Models from Perception Streams

Drift-Based Policy Optimization: Native One-Step Policy Learning for Online Robot Control

An Open-Source LiDAR and Monocular Off-Road Autonomous Navigation Stack

ReVAR: A Data-Driven Algorithm for Generating Aero-Optic Phase Screens

Diff-VS: Efficient Audio-Aware Diffusion U-Net for Vocals Separation

A Pontryagin Method of Model-based Reinforcement Learning via Hamiltonian Actor-Critic

Fundamental Limits of Man-in-the-Middle Attack Detection in Model-Free Reinforcement Learning

Beyond Freshness and Semantics: A Coupon-Collector Framework for Effective Status Updates

Browse by Category

Research Type

Publish Your Research