Patrick Hayden in Engineering — Research Repository

Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung · 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annota…

Read Paper →

Engineering Preprint PDF DOI

VEHRON: A Configuration-Driven BEV Simulation Framework for Subsystem-Level Studies

Subramanyam Natarajan · 2026

In practical early-stage battery-electric vehicle studies, analysis workflows may become fragmented across spreadsheets, notebooks, and project-specific scripts, making reuse, audit, and extension har…

Read Paper →

Engineering Preprint PDF DOI

Data-Driven Privacy-Preserving Modeling and Frequency Regulation with Aggregated Electric Vehicles via Bilinear Hidden Markov Model

Yiping Liu, Xiaozhe Wang, Geza Joos · 2026

Vehicle-to-Grid (V2G) technology allows bidirectional power flow for real-time grid support, making electric vehicles (EVs) well-suited for ancillary services such as frequency regulation. However, ex…

Read Paper →

Engineering Preprint PDF DOI

A Hidden Markov Framework for Physically Interpretable Arc Stability Dynamics in Welding Systems

Hidir Selcuk Nogay · 2026

Electric arc welding (EAW) exhibits strongly non stationary and temporally evolving behavior, making reliable assessment of arc stability difficult using conventional frame based approaches. In this s…

Read Paper →

Engineering Preprint PDF DOI

LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation

Xiangchen Wang, Weiye Zhu, Teng Wang, TianTian Geng, Zekai Zhang, Zhiyuan Qi, Jinyu Yang, Feng Zheng · 2026

Recent navigation systems achieve strong benchmark results, yet real-world deployment often remains visibly stop-and-go. This bottleneck arises because the sense-inference-execution loop is still bloc…

Read Paper →

Engineering Preprint PDF DOI

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

Huakang Chen, Jingbin Hu, Liumeng Xue, Qirui Zhan, Wenhao Li, Guobin Ma, Hanke Xie, Dake Guo, Linhan Ma, Yuepeng Jiang, Bengu Wu, Pengyuan Xie, Chuan Xie, Qiang Zhang, Lei Xie · 2026

Instruction-following text-to-speech (TTS) has emerged as an important capability for controllable and expressive speech generation, yet its evaluation remains underdeveloped due to limited benchmark …

Read Paper →

Engineering Preprint PDF DOI

Abstract Sim2Real through Approximate Information States

Yunfu Deng, Yuhao Li, Josiah P. Hanna · 2026

In recent years, reinforcement learning (RL) has shown remarkable success in robotics when a fast and accurate simulator is available for a given task. When using RL and simulation, more simulator rea…

Read Paper →

Engineering Preprint PDF DOI

Tree Learning: A Multi-Skill Continual Learning Framework for Humanoid Robots

Yifei Yan, Linqi Ye · 2026

As reinforcement learning for humanoid robots evolves from single-task to multi-skill paradigms, efficiently expanding new skills while avoiding catastrophic forgetting has become a key challenge in e…

Read Paper →

Engineering Preprint PDF DOI

Structural Limits of Soft Fusion in Multi-Warden Covert Communication

Abbas Arghavani, Subhrakanti Dey, Anders Ahlen · 2026

This paper investigates covert wireless communication with a Fusion Center (FC) that aggregates raw energy measurements from multiple Wardens via soft fusion. Extending our prior work on power-thresho…

Read Paper →

Engineering Preprint PDF DOI

Investing Is Compression

Oscar Stiffelman · 2026

In 1956 John Kelly wrote a paper at Bell Labs describing the relationship between gambling and Information Theory. What came to be known as the Kelly Criterion is both an objective and a closed-form s…

Read Paper →

Engineering Preprint PDF DOI

Network Reconstruction in Consensus Algorithms with Hidden Agents

Melvyn Tyloo · 2026

Reconstructing the parameters that encode the influence between model variables based on time-series measurements represents an outstanding question in the theory of complex network-coupled systems. H…

Read Paper →

Engineering Preprint PDF DOI

DreamControl-v2: Simpler and Scalable Autonomous Humanoid Skills via Trainable Guided Diffusion Priors

Sudarshan Harithas, Sangkyung Kwak, Pushkal Katara, Srujan Deolasee, Dvij Kalaria, Srinath Sridhar, Sai Vemprala, Ashish Kapoor, Jonathan Chung-Kuan Huang · 2026

Developing robust autonomous loco-manipulation skills for humanoids remains an open problem in robotics. While RL has been applied successfully to legged locomotion, applying it to complex, interactio…

Read Paper →

Engineering Preprint PDF DOI

PARAFAC-Based Channel Estimation for Beyond Diagonal Reconfigurable Surfaces

Gilderlan Tavares de Araujo, Bruno Sokal, Andre L. F. de Almeida · 2026

Channel estimation is a central bottleneck in BD-RIS-assisted MIMO systems. The richer inter-element coupling that enables large performance gains also makes training and hardware control substantiall…

Read Paper →

Engineering Preprint PDF DOI

SqueezeComposer: Temporal Speed-up is A Simple Trick for Long-form Music Composing

Jianyi Chen, Rongxiu Zhong, Shilei Zhang, Kun Qian, Jinglei Liu, Yike Guo, Wei Xue · 2026

Composing coherent long-form music remains a significant challenge due to the complexity of modeling long-range dependencies and the prohibitive memory and computational requirements associated with l…

Read Paper →

Engineering Preprint PDF DOI

Seeing Where to Deploy: Metric RGB-Based Traversability Analysis for Aerial-to-Ground Hidden Space Inspection

Seoyoung Lee, Shaekh Mohammad Shithil, Durgakant Pushp, Lantao Liu, Zhangyang Wang · 2026

Inspection of confined infrastructure such as culverts often requires accessing hidden spaces whose entrances are reachable primarily from elevated viewpoints. Aerial-ground cooperation enables a UAV …

Read Paper →

Engineering Preprint PDF DOI

MOS-Bias: From Hidden Gender Bias to Gender-Aware Speech Quality Assessment

Wenze Ren, Yi-Cheng Lin, Wen-Chin Huang, Erica Cooper, Ryandhimas E. Zezario, Hsin-Min Wang, Hung-yi Lee, Yu Tsao · 2026

The Mean Opinion Score (MOS) serves as the standard metric for speech quality assessment, yet biases in human annotations remain underexplored. We conduct the first systematic analysis of gender bias …

Read Paper →

Engineering Preprint PDF DOI

Energy-Aware Multi-Exit TinyML for Smart Zero-Energy Devices

Shahab Jahanbazi, Mateen Ashraf, Lieven De Strycker, Jeroen Famaey, Onel L. A. Lopez · 2026

The proliferation of smart and autonomous systems has motivated a shift toward executing intelligence directly on edge devices. This shift becomes particularly challenging for zero-energy devices (ZED…

Read Paper →

Engineering Preprint PDF DOI

Is Your Safe Controller Actually Safe? A Critical Review of CBF Tautologies and Hidden Assumptions

Taekyung Kim · 2026

This tutorial provides a critical review of the practical application of Control Barrier Functions (CBFs) in robotic safety. While the theoretical foundations of CBFs are well-established, I identify …

Read Paper →

Engineering Preprint PDF DOI

Quantum Technologies and Edge Devices in Electrical Grids: Opportunities, Challenges, and Future Directions

Marjorie Hoegen, Rene Glebke, M. Sahnawaz Alam, Alessandro David, Juan Navarro Arenas, Nikolaus Wirtz, Mario Albanese, Daniele Carta, Felix Motzoi, Antonello Monti, Carsten Schuck, Andrea Benigni, Klaus Wehrle, Ferdinanda Ponci · 2026

In modern power systems, edge devices serve as local hubs that collect data, perform on-site computing, sense electrical parameters, execute control actions, and communicate with neighboring edge devi…

Read Paper →

Engineering Preprint PDF DOI

Robust Skills, Brittle Grounding: Diagnosing Restricted Generalization in Vision-Language Action Policies via Multi-Object Picking

David Emukpere, Romain Deffayet, Jean-Michel Renders · 2026

Vision-language action (VLA) policies often report strong manipulation benchmark performance with relatively few demonstrations, but it remains unclear whether this reflects robust language-to-object …

Read Paper →

Browse Research Papers

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

VEHRON: A Configuration-Driven BEV Simulation Framework for Subsystem-Level Studies

Data-Driven Privacy-Preserving Modeling and Frequency Regulation with Aggregated Electric Vehicles via Bilinear Hidden Markov Model

A Hidden Markov Framework for Physically Interpretable Arc Stability Dynamics in Welding Systems

LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

Abstract Sim2Real through Approximate Information States

Tree Learning: A Multi-Skill Continual Learning Framework for Humanoid Robots

Structural Limits of Soft Fusion in Multi-Warden Covert Communication

Investing Is Compression

Network Reconstruction in Consensus Algorithms with Hidden Agents

DreamControl-v2: Simpler and Scalable Autonomous Humanoid Skills via Trainable Guided Diffusion Priors

PARAFAC-Based Channel Estimation for Beyond Diagonal Reconfigurable Surfaces

SqueezeComposer: Temporal Speed-up is A Simple Trick for Long-form Music Composing

Seeing Where to Deploy: Metric RGB-Based Traversability Analysis for Aerial-to-Ground Hidden Space Inspection

MOS-Bias: From Hidden Gender Bias to Gender-Aware Speech Quality Assessment

Energy-Aware Multi-Exit TinyML for Smart Zero-Energy Devices

Is Your Safe Controller Actually Safe? A Critical Review of CBF Tautologies and Hidden Assumptions

Quantum Technologies and Edge Devices in Electrical Grids: Opportunities, Challenges, and Future Directions

Robust Skills, Brittle Grounding: Diagnosing Restricted Generalization in Vision-Language Action Policies via Multi-Object Picking

Browse by Category

Research Type

Publish Your Research