Sverre Steen in Engineering — Research Repository

Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung · 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annota…

Read Paper →

Engineering Preprint PDF DOI

Array Zooming Optimization for Near-Field Localization With Movable Antennas

Yuxin Duan, Boyu Teng, Xiaojun Yuan, Rui Wang · 2026

The emergence of movable antenna (MA) technology provides a promising way to enhance wireless sensing and communication by introducing spatial degrees of freedom through dynamic array reconfiguration.…

Read Paper →

Engineering Preprint PDF DOI

A Two Stage Pipeline for Left Atrial Wall Constrained Scar Segmentation and Localization from LGE-MR Images

Bipasha Kundu, Cristian Linte · 2026

Accurate segmentation and localization of left atrial (LA) ablation scars from Late gadolinium enhancement (LGE)-MRI is essential for assessing the lesion completeness and guiding ablation therapy. In…

Read Paper →

Engineering Preprint PDF DOI

Risk-Aware Multi-Market Scheduling of Virtual Power Plants with Dynamic Network Tariffs

Lorenzo Zapparoli, Paul Fath, Blazhe Gjorgiev, Giovanni Sansavini · 2026

As the penetration of distributed energy resources (DERs) increases, harnessing their flexibility becomes critical for power system operations. Virtual power plants (VPPs) offer a promising solution. …

Read Paper →

Engineering Preprint PDF DOI

Optimizing Tracking Accuracy in Energy-Constrained Multimodal ISAC via Lyapunov-Driven Heterogeneous Mixture-of-Experts

Wenqi Fan, Ning Wei, Ahmad Bazzi, Rongyan Xi, Zhixian Song, You Li, Zhihan Zeng, Yue Xiu, Chadi Assi · 2026

The integration of multimodal sensing and millimeter-wave (mmWave) communications is a key enabler for highly mobile vehicle-to-infrastructure (V2I) networks. However, continuous high-resolution visua…

Read Paper →

Engineering Preprint PDF DOI

Dual-LoRA: Parameter-Efficient Adversarial Disentanglement for Cross-Lingual Speaker Verification

Qituan Shangguan, Junhao Du, Kunyang Peng, Feng Xue, Hui Zhang, Xinsheng Wang, Kai Yu, Shuai Wang · 2026

Cross-lingual speaker verification suffers from severe language-speaker entanglement. This causes systematic degradation in the hardest scenario: correctly accepting utterances from the same speaker a…

Read Paper →

Engineering Preprint PDF DOI

SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding

Mingyu Zhao, Zijian Lin, Kun Wei, Zhiyong Wu · 2026

Conventional neural speech codecs suffer from severe intelligibility degradation at ultra-low bitrates, where the bottleneck transitions from acoustic distortion to semantic loss. To address this issu…

Read Paper →

Engineering Preprint PDF DOI

Lights Out: A Nighttime UAV Localization Framework Using Thermal Imagery and Semantic 3D Maps

Ryan Allen, Melissa Greeff · 2026

Reliable backup localization for unmanned aerial vehicles (UAVs) operating in GNSS-denied nighttime conditions remains an open challenge due to the severe modality gap between daytime RGB maps and nig…

Read Paper →

Engineering Preprint PDF DOI

Similarity Choice and Negative Scaling in Supervised Contrastive Learning for Deepfake Audio Detection

Jaskirat Sudan, Hashim Ali, Surya Subramani, Hafiz Malik · 2026

Supervised contrastive learning (SupCon) is widely used to shape representations, but has seen limited targeted study for audio deepfake detection. Existing work typically combines contrastive terms w…

Read Paper →

Engineering Preprint PDF DOI

The role of physical models in the validation and calibration of numerical models -- The example of the Lilleb{\ae}lt Bridge

Paula Apollonia Wunderlich, Gledson Rodrigo Tondo, Guido Morgenthal · 2026

With the rapid advancement of computer technologies enabling fast calculations of complex structures, numerical methods have become a central tool in engineering sciences, while physical models have i…

Read Paper →

Engineering Preprint PDF DOI

Monitoring exposure-length variations in submarine power cables using distributed fiber-optic sensing

Sakiko Mishima, Yoshiyuki Yajima, Noriyuki Tonami, Tomoyuki Hino, Shugo Aibe, Junichiro Saikawa, Koji Mizuguchi · 2026

This study proposes an anomaly-detection framework for monitoring exposure-length variations in submarine free-span cables using Distributed Acoustic Sensing (DAS), which is one of the distributed fib…

Read Paper →

Engineering Preprint PDF DOI

Agent-Centric Visual Reinforcement Learning under Dynamic Perturbations

Zhengru Fang, Yu Guo, Fei Liu, Yuang Zhang, Yihang Tao, Senkang Hu, Wenbo Ding, Yuguang Fang · 2026

Visual reinforcement learning aims to empower an agent to learn policies from visual observations, yet it remains vulnerable to dynamic visual perturbations, such as unpredictable shifts in corruption…

Read Paper →

Engineering Preprint PDF DOI

MACAW: Matching-free Acquisition of Channels with Anisotropic Wavefronts

Heling Zhang, Shidong Zhou · 2026

The escalating data rate demands of future wireless communications necessitate the deployment of extremely large aperture arrays (ELAA) at base stations. In such configurations, wireless channel chara…

Read Paper →

Engineering Preprint PDF DOI

Pedestrians play chicken with an autonomous vehicle

Rakshit Soni, Charles Fox · 2026

Automated vehicles (AVs) are commonly programmed to yield unconditionally to pedestrians in the interest of safety. However, this design choice can give rise to the Freezing Robot Problem in which ped…

Read Paper →

Engineering Preprint PDF DOI

Deep Learning-Enabled Dissolved Oxygen Sensing in Biofouling Environments for Ocean Monitoring

Nikolaos Salaris, Adrien Desjardins, Manish K. Tiwari · 2026

The escalating climate crisis and ecosystem degradation demand intelligent, low-cost sensors capable of robust, long-term monitoring in real-world environments. Absolute dissolved oxygen (DO) concentr…

Read Paper →

Engineering Preprint PDF DOI

AsyncShield: A Plug-and-Play Edge Adapter for Asynchronous Cloud-based VLA Navigation

Kai Yang, Zedong Chu, Yingnan Guo, Zhengbo Wang, Shichao Xie, Yanfen Shen, Xiaolong Wu, Xing Li, Mu Xu · 2026

While Vision-Language-Action (VLA) models have been demonstrated possessing strong zero-shot generalization for robot control, their massive parameter sizes typically necessitate cloud-based deploymen…

Read Paper →

Engineering Preprint PDF DOI

Betting for Sim-to-Real Performance Evaluation

Zaid Mahboob, Yujia Chen, Bowen Weng · 2026

This paper studies the problem of robot performance evaluation, focusing on how to obtain accurate and efficient estimates of real-world behavior under severe constraints on physical experimentation. …

Read Paper →

Engineering Preprint PDF DOI

Efficient Near Field Beam Tracking via Thompson Sampling

Junchi Liu, Zijun Wang, Shawn Tsai, Rui Zhang · 2026

The shift to the radiative near field region due to large antenna arrays necessitates beamforming that accounts for both angle and range, evolving mobility management into a joint angular range tracki…

Read Paper →

Engineering Preprint PDF DOI

Adaptive Spatial-Temporal Graph Learning-Enabled Short-Term Voltage Stability Assessment against Time-Varying Topological Conditions

Chao Deng, Lipeng Zhu, Chang Liu, Hefeng Zhai, Baoye Tian, Zexiang Zhu, Jiayong Li, Cong Zhang · 2026

The emerging deep learning (DL) technology has recently exhibited great potential in data-driven short-term voltage stability (SVS) assessment of complex power grids. However, without sufficient atten…

Read Paper →

Engineering Preprint PDF DOI

A 99-Line Homogenization Code for Lattice-skin Plate Structures

Zhongkai Ji, Dawei Li, Yong Zhao, Wenhe Zhao · 2026

Recent years have seen growing application potential for Lattice-skin Plate Structures in advanced manufacturing fields such as aerospace and automotive engineering. For multiscale performance evaluat…

Read Paper →

Browse Research Papers

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Array Zooming Optimization for Near-Field Localization With Movable Antennas

A Two Stage Pipeline for Left Atrial Wall Constrained Scar Segmentation and Localization from LGE-MR Images

Risk-Aware Multi-Market Scheduling of Virtual Power Plants with Dynamic Network Tariffs

Optimizing Tracking Accuracy in Energy-Constrained Multimodal ISAC via Lyapunov-Driven Heterogeneous Mixture-of-Experts

Dual-LoRA: Parameter-Efficient Adversarial Disentanglement for Cross-Lingual Speaker Verification

SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding

Lights Out: A Nighttime UAV Localization Framework Using Thermal Imagery and Semantic 3D Maps

Similarity Choice and Negative Scaling in Supervised Contrastive Learning for Deepfake Audio Detection

The role of physical models in the validation and calibration of numerical models -- The example of the Lilleb{\ae}lt Bridge

Monitoring exposure-length variations in submarine power cables using distributed fiber-optic sensing

Agent-Centric Visual Reinforcement Learning under Dynamic Perturbations

MACAW: Matching-free Acquisition of Channels with Anisotropic Wavefronts

Pedestrians play chicken with an autonomous vehicle

Deep Learning-Enabled Dissolved Oxygen Sensing in Biofouling Environments for Ocean Monitoring

AsyncShield: A Plug-and-Play Edge Adapter for Asynchronous Cloud-based VLA Navigation

Betting for Sim-to-Real Performance Evaluation

Efficient Near Field Beam Tracking via Thompson Sampling

Adaptive Spatial-Temporal Graph Learning-Enabled Short-Term Voltage Stability Assessment against Time-Varying Topological Conditions

A 99-Line Homogenization Code for Lattice-skin Plate Structures

Browse by Category

Research Type

Publish Your Research