Severin Lust · Engineering · Preprint — Research Repository

Engineering Preprint PDF DOI

LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models

Hao Chen, Jiaming Liu, Zhonghao Yan, Nuowei Han, Renrui Zhang, Chenyang Gu, Jialin Gao, Ziyu Guo, Siyuan Qian, Yinxi Wang, Peng Jia, Chi-Wing Fu, Shanghang Zhang, Pheng-Ann Heng · 2026

Vision-Language-Action (VLA) models have increasingly incorporated reasoning mechanisms for complex robotic manipulation. However, existing approaches share a critical limitation: whether employing ex…

Read Paper →

Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung · 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annota…

Read Paper →

Engineering Preprint PDF DOI

Connected Dependability Cage: Run-Time Function and Anomaly Monitoring for the Development and Operation of Safe Automated Vehicles

Iqra Aslam, Nour Habib, Abhishek Buragohain, Meng Zhang, Andreas Rausch, Vaibhav Tiwari, Mohamed Benchat · 2026

The advancement of automated vehicles introduces complex safety challenges, particularly in dynamic and unpredictable environments where AI-enabled perception systems must operate reliably. Ensuring c…

Read Paper →

Engineering Preprint PDF DOI

Robot Learning from Human Videos: A Survey

Junyi Ma, Erhang Zhang, Haoran Yang, Ditao Li, Chenyang Xu, Guangming Wang, Hesheng Wang · 2026

A critical bottleneck hindering further advancement in embodied AI and robotics is the challenge of scaling robot data. To address this, the field of learning robot manipulation skills from human vide…

Read Paper →

Engineering Preprint PDF DOI

SASI: Leveraging Sub-Action Semantics for Robust Early Action Recognition in Human-Robot Interaction

Yongpeng Cao, Masahiro Hirano, Hyuno Kim, Yuji Yamakawa · 2026

Understanding human actions is critical for advancing behavior analysis in human-robot interaction. Particularly in tasks that demand quick and proactive feedback, robots must recognize human actions …

Read Paper →

Engineering Preprint PDF DOI

Array Zooming Optimization for Near-Field Localization With Movable Antennas

Yuxin Duan, Boyu Teng, Xiaojun Yuan, Rui Wang · 2026

The emergence of movable antenna (MA) technology provides a promising way to enhance wireless sensing and communication by introducing spatial degrees of freedom through dynamic array reconfiguration.…

Read Paper →

Engineering Preprint PDF DOI

Impact of Background Dense Multipath Components on Multi-Band Fusion ISAC Systems

Dexin Wang, Roberto Bomfin, Ahmad Bazzi, Marwa Chafii · 2026

Multi-band sensing has emerged as a key enabler of integrated sensing and communication (ISAC), one of the six primary usage scenarios defined for IMT-2030 (6G). The introduction of frequency range 3 …

Read Paper →

Engineering Preprint PDF DOI

A Two Stage Pipeline for Left Atrial Wall Constrained Scar Segmentation and Localization from LGE-MR Images

Bipasha Kundu, Cristian Linte · 2026

Accurate segmentation and localization of left atrial (LA) ablation scars from Late gadolinium enhancement (LGE)-MRI is essential for assessing the lesion completeness and guiding ablation therapy. In…

Read Paper →

Engineering Preprint PDF DOI

Exploring Converter Control Duality in Microgrids: AC Grid-Forming vs DC Droop Control

Jovan Krajacic, Ognjen Stanojev, Mario Schweizer, Orcun Karaca, Gabriela Hug, Vladan Lazarevic · 2026

Power electronic converters are fundamental building blocks of both AC and DC microgrids, enabling the integration of renewable energy sources, energy storage systems, electronic loads, and electric v…

Read Paper →

Engineering Preprint PDF DOI

LLM-Flax : Generalizable Robotic Task Planning via Neuro-Symbolic Approaches with Large Language Models

Seongmin Kim, Daegyu Lee · 2026

Deploying a neuro-symbolic task planner on a new domain today requires significant manual effort: a domain expert must author relaxation and complementary rules, and hundreds of training problems must…

Read Paper →

Engineering Preprint PDF DOI

3D Generation for Embodied AI and Robotic Simulation: A Survey

Tianwei Ye, Yifan Mao, Minwen Liao, Jian Liu, Chunchao Guo, Dazhao Du, Quanxin Shou, Fangqi Zhu, Song Guo · 2026

Embodied AI and robotic systems increasingly depend on scalable, diverse, and physically grounded 3D content for simulation-based training and real-world deployment. While 3D generative modeling has a…

Read Paper →

Engineering Preprint PDF DOI

CONCERTO : Optimization of readout electronics

Mounir Abdkrimi (NEEL - MagSup), Olivier Rossetto (LPSC), Olivier Bourrion (LPSC), Christophe Hoarau (LPSC), Christophe Vescovi (LPSC) · 2026

The CONCERTO millimeter-wave spectral-imaging instrument was deployed on the Atacama Pathfinder EXperiment (APEX), where it acquired science data between April 2021 and May 2023. The instrument featur…

Read Paper →

Engineering Preprint PDF DOI

Optimizing Tracking Accuracy in Energy-Constrained Multimodal ISAC via Lyapunov-Driven Heterogeneous Mixture-of-Experts

Wenqi Fan, Ning Wei, Ahmad Bazzi, Rongyan Xi, Zhixian Song, You Li, Zhihan Zeng, Yue Xiu, Chadi Assi · 2026

The integration of multimodal sensing and millimeter-wave (mmWave) communications is a key enabler for highly mobile vehicle-to-infrastructure (V2I) networks. However, continuous high-resolution visua…

Read Paper →

Engineering Preprint PDF DOI

Dual-LoRA: Parameter-Efficient Adversarial Disentanglement for Cross-Lingual Speaker Verification

Qituan Shangguan, Junhao Du, Kunyang Peng, Feng Xue, Hui Zhang, Xinsheng Wang, Kai Yu, Shuai Wang · 2026

Cross-lingual speaker verification suffers from severe language-speaker entanglement. This causes systematic degradation in the hardest scenario: correctly accepting utterances from the same speaker a…

Read Paper →

Engineering Preprint PDF DOI

SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding

Mingyu Zhao, Zijian Lin, Kun Wei, Zhiyong Wu · 2026

Conventional neural speech codecs suffer from severe intelligibility degradation at ultra-low bitrates, where the bottleneck transitions from acoustic distortion to semantic loss. To address this issu…

Read Paper →

Engineering Preprint PDF DOI

Lights Out: A Nighttime UAV Localization Framework Using Thermal Imagery and Semantic 3D Maps

Ryan Allen, Melissa Greeff · 2026

Reliable backup localization for unmanned aerial vehicles (UAVs) operating in GNSS-denied nighttime conditions remains an open challenge due to the severe modality gap between daytime RGB maps and nig…

Read Paper →

Engineering Preprint PDF DOI

One Voice, Many Tongues: Cross-Lingual Voice Cloning for Scientific Speech

Amanuel Gizachew Abebe, Yasmin Moslem · 2026

Preserving a speaker's voice identity while generating speech in a different language remains a fundamental challenge in spoken language technology, particularly in specialized domains such as scienti…

Read Paper →

Engineering Preprint PDF DOI

FlowS: One-Step Motion Prediction via Local Transport Conditioning

Leandro Di Bella, Adrian Munteanu, Bruno Cornelis · 2026

Generative motion prediction must satisfy three simultaneous requirements for real-world autonomy: high accuracy, diverse multimodal futures, and strictly bounded latency. Diffusion models meet the fi…

Read Paper →

Engineering Preprint PDF DOI

KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning

Yixuan Huang, Bowen Li, Vaibhav Saxena, Yichao Liang, Utkarsh Aashu Mishra, Liang Ji, Lihan Zha, Jimmy Wu, Nishanth Kumar, Sebastian Scherer, Danfei Xu, Tom Silver · 2026

Robotic systems that interact with the physical world must reason about kinematic and dynamic constraints imposed by their own embodiment, their environment, and the task at hand. We introduce KinDER,…

Read Paper →

Engineering Preprint PDF DOI

SpecFed: Accelerating Federated LLM Inference with Speculative Decoding and Compressed Transmission

Ce Zheng, Xinghan Wang, Jiahong Ning, Yuxuan Shi, Ning Huang, Tingting Yang · 2026

Federated inference enhances LLM performance in edge computing through weighted averaging of distributed model predictions. However, autoregressive LLM inference requires frequent full-model forward p…

Read Paper →

Browse Research Papers

LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Connected Dependability Cage: Run-Time Function and Anomaly Monitoring for the Development and Operation of Safe Automated Vehicles

Robot Learning from Human Videos: A Survey

SASI: Leveraging Sub-Action Semantics for Robust Early Action Recognition in Human-Robot Interaction

Array Zooming Optimization for Near-Field Localization With Movable Antennas

Impact of Background Dense Multipath Components on Multi-Band Fusion ISAC Systems

A Two Stage Pipeline for Left Atrial Wall Constrained Scar Segmentation and Localization from LGE-MR Images

Exploring Converter Control Duality in Microgrids: AC Grid-Forming vs DC Droop Control

LLM-Flax : Generalizable Robotic Task Planning via Neuro-Symbolic Approaches with Large Language Models

3D Generation for Embodied AI and Robotic Simulation: A Survey

CONCERTO : Optimization of readout electronics

Optimizing Tracking Accuracy in Energy-Constrained Multimodal ISAC via Lyapunov-Driven Heterogeneous Mixture-of-Experts

Dual-LoRA: Parameter-Efficient Adversarial Disentanglement for Cross-Lingual Speaker Verification

SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding

Lights Out: A Nighttime UAV Localization Framework Using Thermal Imagery and Semantic 3D Maps

One Voice, Many Tongues: Cross-Lingual Voice Cloning for Scientific Speech

FlowS: One-Step Motion Prediction via Local Transport Conditioning

KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning

SpecFed: Accelerating Federated LLM Inference with Speculative Decoding and Compressed Transmission

Browse by Category

Research Type

Publish Your Research