Hubert Lacoin in Engineering — Research Repository

Engineering Preprint PDF DOI

BUT System Description for CHiME-9 MCoRec Challenge

Dominik Klement, Alexander Polok, Nguyen Hai Phong, Prachi Singh, Lukas Burget · 2026

Multi-talker automatic speech recognition (ASR) in conversational recordings remains an open problem, particularly in scenarios with large portion of overlapping speech where identifying and transcrib…

Read Paper →

Engineering Preprint PDF DOI

The Field of Safe Motion: Operationalizing Affordances in the Field of Safe Travel Using Reachability Analysis

Leif Johnson, Trent Victor, Johan Engstrom · 2026

We present the Field of Safe Motion (FSM), a quantitative safety model for determining whether a driver maintains a collision-free escape route, or "out," at any given moment by accounting for that dr…

Read Paper →

Engineering Preprint PDF DOI

SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding

Mingyu Zhao, Zijian Lin, Kun Wei, Zhiyong Wu · 2026

Conventional neural speech codecs suffer from severe intelligibility degradation at ultra-low bitrates, where the bottleneck transitions from acoustic distortion to semantic loss. To address this issu…

Read Paper →

Engineering Preprint PDF DOI

Complex Approximate Message Passing with Non-separable Denoising

Vishnu Teja Kunde, Alessandro Mirri, Jean-Francois Chamberland, Enrico Paolini · 2026

Approximate Message Passing (AMP) is a general framework for iterative algorithms, originally developed for compressed sensing and later extended to a wide range of high-dimensional inference problems…

Read Paper →

Engineering Preprint PDF DOI

ALAS: Adaptive Long-Horizon Action Synthesis via Async-pathway Stream Disentanglement

Yutong Shen, Hangxu Liu, Lei Zhang, Penghui Liu, Yinqi Liu, Liuxiang Yang, Tongtong Feng · 2026

Long-Horizon (LH) tasks in Human-Scene Interaction (HSI) are complex multi-step tasks that require continuous planning, sequential decision-making, and extended execution across domains to achieve the…

Read Paper →

Engineering Preprint PDF DOI

World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems

Runze Li, Hongyin Zhang, Junxi Jin, Qixin Zeng, Zifeng Zhuang, Yiqi Tang, Shangke Lyu, Donglin Wang · 2026

Vision-Language-Action (VLA) models have emerged as a promising paradigm for building embodied agents that ground perception and language into action. However, most existing approaches rely on direct …

Read Paper →

Engineering Preprint PDF DOI

Wearable AI in the Era of Large Sensor Models

Yize Cai, Baoshen Guo, Guobin Shen, Zhiqing Hong · 2026

As an effective approach to understanding the human-centric physical world, Wearable Artificial Intelligence (AI), which leverages multimodal wearable sensors to understand human physiology and behavi…

Read Paper →

Engineering Preprint PDF DOI

Incremental Residual Reinforcement Learning Toward Real-World Learning for Social Navigation

Haruto Nagahisa, Kohei Matsumoto, Yuki Tomita, Yuki Hyodo, Ryo Kurazume · 2026

As the demand for mobile robots continues to increase, social navigation has emerged as a critical task, driving active research into deep reinforcement learning (RL) approaches. However, because pede…

Read Paper →

Engineering Preprint PDF DOI

TAMEn: Tactile-Aware Manipulation Engine for Closed-Loop Data Collection in Contact-Rich Tasks

Longyan Wu, Jieji Ren, Chenghang Jiang, Junxi Zhou, Shijia Peng, Ran Huang, Guoying Gu, Li Chen, Hongyang Li · 2026

Handheld paradigms offer an efficient and intuitive way for collecting large-scale demonstration of robot manipulation. However, achieving contact-rich bimanual manipulation through these methods rema…

Read Paper →

Engineering Preprint PDF DOI

CWRNN-INVR: A Coupled WarpRNN based Implicit Neural Video Representation

Yiyang Li, Yanbo Gao, Shuai Li, Zhenyu Du, Jinglin Zhang, Hui Yuan, Mao Ye, Xingyu Gao · 2026

Implicit Neural Video Representation (INVR) has emerged as a novel approach for video representation and compression, using learnable grids and neural networks. Existing methods focus on developing ne…

Read Paper →

Engineering Preprint PDF DOI

Robust Nonlinear System Identification in Reproducing Kernel Hilbert Spaces via Scenario Optimization

Jannis Lubsen, Annika Eichler · 2026

This paper proposes a method for constructing one-step prediction tubes for nonlinear systems using reproducing kernel Hilbert spaces. We approximate a bounded reproducing kernel Hilbert space (RKHS) …

Read Paper →

Engineering Preprint PDF DOI

DDA-Net: Accurate TDD Channel Estimation via Deep Unfolding the Doppler-Delay-Angle Representation of Channel Signals

Yufei Ma, Xu Zhu, Tiejun Li · 2026

In TDD massive MIMO systems, channel estimation under sparse frequency-hopping pilots is challenging: each snapshot captures only one narrow pilot block that hops across frequency, with tens of millis…

Read Paper →

Engineering Preprint PDF DOI

Learning Locomotion on Complex Terrain for Quadrupedal Robots with Foot Position Maps and Stability Rewards

Matthew Hwang, Yubin Liu, Ryo Hakoda, Takeshi Oishi · 2026

Quadrupedal locomotion over complex terrain has been a long-standing research topic in robotics. While recent reinforcement learning-based locomotion methods improve generalizability and foot-placemen…

Read Paper →

Engineering Preprint PDF DOI

Koopman Subspace Pruning in Reproducing Kernel Hilbert Spaces via Principal Vectors

Dhruv Shah, Jorge Cortes · 2026

Data-driven approximations of the infinite-dimensional Koopman operator rely on finite-dimensional projections, where the predictive accuracy of the resulting models hinges heavily on the invariance o…

Read Paper →

Engineering Preprint PDF DOI

Learning When to See and When to Feel: Adaptive Vision-Torque Fusion for Contact-Aware Manipulation

Jiuzhou Lei, Chang Liu, Yu She, Xiao Liang, Minghui Zheng · 2026

Vision-based policies have achieved a good performance in robotic manipulation due to the accessibility and richness of visual observations. However, purely visual sensing becomes insufficient in cont…

Read Paper →

Engineering Preprint PDF DOI

VisG AV-HuBERT: Viseme-Guided AV-HuBERT

Aristeidis Papadopoulos, Rishabh Jain, Naomi Harte · 2026

Audio-Visual Speech Recognition (AVSR) systems nowadays integrate Large Language Model (LLM) decoders with transformer-based encoders, achieving state-of-the-art results. However, the relative contrib…

Read Paper →

Engineering Preprint PDF DOI

HARNESS: Lightweight Distilled Arabic Speech Foundation Models

Vrunda N. Sukhadia, Shammur Absar Chowdhury · 2026

Large self-supervised speech (SSL) models achieve strong downstream performance, but their size limits deployment in resource-constrained settings. We present HArnESS, an Arabic-centric self-supervise…

Read Paper →

Engineering Preprint PDF DOI

Acoustic-to-articulatory Inversion of the Complete Vocal Tract from RT-MRI with Various Audio Embeddings and Dataset Sizes

Sofiane Azzouz, Pierre-Andre Vuissoz, Yves Laprie · 2026

Articulatory-to-acoustic inversion strongly depends on the type of data used. While most previous studies rely on EMA, which is limited by the number of sensors and restricted to accessible articulato…

Read Paper →

Engineering Preprint PDF DOI

Multimodal-NF: A Wireless Dataset for Near-Field Low-Altitude Sensing and Communications

Mengyuan Li, Qianfan Lu, Jiachen Tian, Hongjun Hu, Yu Han, Xiao Li, Chao-Kai Wen, Shi Jin · 2026

Environment-aware 6G wireless networks demand the deep integration of multimodal and wireless data. However, most existing datasets are confined to 2D terrestrial far-field scenarios, lacking the 3D s…

Read Paper →

Engineering Preprint PDF DOI

Deep Learning-Based Site-Specific Channel Modeling and Inference

Junzhe Song, Ruisi He, Mi Yang, Zhengyu Zhang, Shuaiqi Gao, Bo Ai, Zhangdui Zhong · 2026

Site-specific channel inference plays a critical role in the design and evaluation of next-generation wireless communication systems by considering the surrounding propagation environment. However, tr…

Read Paper →

Browse Research Papers

BUT System Description for CHiME-9 MCoRec Challenge

The Field of Safe Motion: Operationalizing Affordances in the Field of Safe Travel Using Reachability Analysis

SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding

Complex Approximate Message Passing with Non-separable Denoising

ALAS: Adaptive Long-Horizon Action Synthesis via Async-pathway Stream Disentanglement

World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems

Wearable AI in the Era of Large Sensor Models

Incremental Residual Reinforcement Learning Toward Real-World Learning for Social Navigation

TAMEn: Tactile-Aware Manipulation Engine for Closed-Loop Data Collection in Contact-Rich Tasks

CWRNN-INVR: A Coupled WarpRNN based Implicit Neural Video Representation

Robust Nonlinear System Identification in Reproducing Kernel Hilbert Spaces via Scenario Optimization

DDA-Net: Accurate TDD Channel Estimation via Deep Unfolding the Doppler-Delay-Angle Representation of Channel Signals

Learning Locomotion on Complex Terrain for Quadrupedal Robots with Foot Position Maps and Stability Rewards

Koopman Subspace Pruning in Reproducing Kernel Hilbert Spaces via Principal Vectors

Learning When to See and When to Feel: Adaptive Vision-Torque Fusion for Contact-Aware Manipulation

VisG AV-HuBERT: Viseme-Guided AV-HuBERT

HARNESS: Lightweight Distilled Arabic Speech Foundation Models

Acoustic-to-articulatory Inversion of the Complete Vocal Tract from RT-MRI with Various Audio Embeddings and Dataset Sizes

Multimodal-NF: A Wireless Dataset for Near-Field Low-Altitude Sensing and Communications

Deep Learning-Based Site-Specific Channel Modeling and Inference

Browse by Category

Research Type

Publish Your Research