Krysta M Svore in Engineering — Research Repository

Engineering Preprint PDF DOI

BUT System Description for CHiME-9 MCoRec Challenge

Dominik Klement, Alexander Polok, Nguyen Hai Phong, Prachi Singh, Lukas Burget · 2026

Multi-talker automatic speech recognition (ASR) in conversational recordings remains an open problem, particularly in scenarios with large portion of overlapping speech where identifying and transcrib…

Read Paper →

Engineering Preprint PDF DOI

Safe Navigation using Neural Radiance Fields via Reachable Sets

Omanshu Thapliyal, Malarvizhi Sankaranarayanasamy, Ravigopal Vennelakanti · 2026

Safe navigation in cluttered environments is an important challenge for autonomous systems. Robots navigating through obstacle ridden scenarios need to be able to navigate safely in the presence of ob…

Read Paper →

Engineering Preprint PDF DOI

Atomic-Probe Governance for Skill Updates in Compositional Robot Policies

Xue Qin, Simin Luan, John See, Cong Yang, Zhijun Li · 2026

Skill libraries in deployed robotic systems are continually updated through fine-tuning, fresh demonstrations, or domain adaptation, yet existing typed-composition methods (BLADE, SymSkill, Generative…

Read Paper →

Engineering Preprint PDF DOI

SEP Analysis of Quantized SIMO Systems with M-PSK over Correlated Fading Channels

Amila Ravinath, Bikshapathi Gouda, Italo Atzeni, Antti Tolli · 2026

The average symbol error probability (SEP) of a phase-quantized single-input multiple-output system with M-ary phase-shift keying modulation and maximum ratio combining (MRC) is analyzed under correla…

Read Paper →

Engineering Preprint PDF DOI

Validating the Clinical Utility of CineECG 3D Reconstructions through Cross-Modal Feature Attribution

Karol Dobiczek, Maciej Mozolewski, Szymon Bobek, Micha{l} Szafarczyk, Peter van Dam, Grzegorz J. Nalepa · 2026

Deep learning models for 12-lead electrocardiogram (ECG) analysis achieve high diagnostic performance but lack the intuitive interpretability required for clinical integration. Standard feature attrib…

Read Paper →

Engineering Preprint PDF DOI

Optimizing Tracking Accuracy in Energy-Constrained Multimodal ISAC via Lyapunov-Driven Heterogeneous Mixture-of-Experts

Wenqi Fan, Ning Wei, Ahmad Bazzi, Rongyan Xi, Zhixian Song, You Li, Zhihan Zeng, Yue Xiu, Chadi Assi · 2026

The integration of multimodal sensing and millimeter-wave (mmWave) communications is a key enabler for highly mobile vehicle-to-infrastructure (V2I) networks. However, continuous high-resolution visua…

Read Paper →

Engineering Preprint PDF DOI

Lights Out: A Nighttime UAV Localization Framework Using Thermal Imagery and Semantic 3D Maps

Ryan Allen, Melissa Greeff · 2026

Reliable backup localization for unmanned aerial vehicles (UAVs) operating in GNSS-denied nighttime conditions remains an open challenge due to the severe modality gap between daytime RGB maps and nig…

Read Paper →

Engineering Preprint PDF DOI

Robustness Evaluation of a Foundation Segmentation Model Under Simulated Domain Shifts in Abdominal CT: Implications for Health Digital Twin Deployment

Sanghati Basu · 2026

Foundation segmentation models such as the Segment Anything Model (SAM) have demonstrated strong generalization across natural images; however, their robustness under clinically realistic medical imag…

Read Paper →

Engineering Preprint PDF DOI

UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition

Chong-Xin Gan, Peter Bell, Man-Wai Mak, Zhe Li, Zezhong Jin, Zilong Huang, Kong Aik Lee · 2026

The joint training of speech enhancement and speaker embedding networks for speaker recognition is widely adopted under noisy acoustic environments. While effective, this paradigm often fails to lever…

Read Paper →

Engineering Preprint PDF DOI

Robust Accent Identification via Voice Conversion and Non-Timbral Embeddings

Rayane Bakari, Olivier Le Blouch, Nicolas Gengembre, Nicholas Evans · 2026

Automatic accent identification (AID) remains a challenging task due to the complex variability of accents, the entanglement of accent cues with speaker traits, and the scarcity of reliable accentlabe…

Read Paper →

Engineering Preprint PDF DOI

Monitoring exposure-length variations in submarine power cables using distributed fiber-optic sensing

Sakiko Mishima, Yoshiyuki Yajima, Noriyuki Tonami, Tomoyuki Hino, Shugo Aibe, Junichiro Saikawa, Koji Mizuguchi · 2026

This study proposes an anomaly-detection framework for monitoring exposure-length variations in submarine free-span cables using Distributed Acoustic Sensing (DAS), which is one of the distributed fib…

Read Paper →

Engineering Preprint PDF DOI

Real-time windrow detection from onboard tractor sensors for automated following

Lorenz Gunreben, Nico Heider, Sebastian Zurner, Martin Schieck, Bogdan Franczyk · 2026

Proprietary design in commercial windrow-detection systems restricts transparency and limits progress in open autonomous forage-harvesting research. We present a multi-modal dataset combining stereo v…

Read Paper →

Engineering Preprint PDF DOI

Guiding Vector Field Generation via Score-based Diffusion Model

Zirui Chen, Shiliang Guo, Shiyu Zhao · 2026

Guiding Vector Fields (GVFs) are a powerful tool for robotic path following. However, classical methods assume smooth, ordered curves and fail when paths are unordered, multi-branch, or generated by p…

Read Paper →

Engineering Preprint PDF DOI

$M^2$-VLA: Boosting Vision-Language Models for Generalizable Manipulation via Layer Mixture and Meta-Skills

Siyao Xiao, Yuhong Zhang, Zhifang Liu, Zihan Gao, Jingye Zhang, Sinwai Choo, Dake Zhong, Mengzhe Wang, Xiao Lin, Xianfeng Zhou, Jia Jia, Haoqian Wang · 2026

Current Vision-Language-Action (VLA) models predominantly rely on end-to-end fine-tuning. While effective, this paradigm compromises the inherent generalization capabilities of Vision-Language Models …

Read Paper →

Engineering Preprint PDF DOI

A Road-Mobile GNSS-Disciplined Oscillator for Accurate Synchronization of Vehicular Microwave Measurements

Maximilian Engelhardt, Carsten Andrich, Daniel Stanko, Alexander Ihlow, Markus Landmann · 2026

Precise synchronization is essential in various technical disciplines, being especially challenging in mobile scenarios. Unfortunately, state-of-the-art global navigation satellite system (GNSS) disci…

Read Paper →

Engineering Preprint PDF DOI

QuietWalk: Physics-Informed Reinforcement Learning for Ground Reaction Force-Aware Humanoid Locomotion Under Diverse Footwear

Hanze Hu, Luying Feng, Silu Chen, Tianjiang Zheng, Dexin Jiang, Wei Chen, Chi Zhang, Guilin Yang, Yaochu Jin · 2026

Humanoid robots operating in human-centered environments (e.g., homes, hospitals, and offices) must mitigate foot--ground impact transients, as impact-induced vibration and noise degrade user experien…

Read Paper →

Engineering Preprint PDF DOI

Explainable AI in Speaker Recognition -- Making Latent Representations Understandable

Yanze Xu, Wenwu Wang, Mark D. Plumbley · 2026

Neural networks can be trained to learn task-relevant representations from data. Understanding how these networks make decisions falls within the Explainable AI (XAI) domain. This paper proposes to st…

Read Paper →

Engineering Preprint PDF DOI

Collaborative Trajectory Prediction via Late Fusion

Nadya Abdel Madjid, Murad Mebrahtu, Zakhar Yagudin, Bilal Hassan, Naoufel Werghi, Jorge Dias, Dzmitry Tsetserukou, Majid Khonji · 2026

Predicting future trajectories of surrounding traffic agents is critical for safe autonomous navigation and collision avoidance. Despite all advances in the trajectory forecasting realm, the predictio…

Read Paper →

Engineering Preprint PDF DOI

Beyond Acoustic Sparsity and Linguistic Bias: A Prompt-Free Paradigm for Mispronunciation Detection and Diagnosis

Haopeng Geng, Longfei Yang, Xi Chen, Haitong Sun, Daisuke Saito, Nobuaki Minematsu · 2026

Mispronunciation Detection and Diagnosis (MDD) requires modeling fine-grained acoustic deviations. However, current ASR-derived MDD systems often face inherent limitations. In particular, CTC-based mo…

Read Paper →

Engineering Preprint PDF DOI

VistaBot: View-Robust Robot Manipulation via Spatiotemporal-Aware View Synthesis

Songen Gu, Yuhang Zheng, Weize Li, Yupeng Zheng, Yating Feng, Xiang Li, Yilun Chen, Pengfei Li, Wenchao Ding · 2026

Recently, end-to-end robotic manipulation models have gained significant attention for their generalizability and scalability. However, they often suffer from limited robustness to camera viewpoint ch…

Read Paper →

Browse Research Papers

BUT System Description for CHiME-9 MCoRec Challenge

Safe Navigation using Neural Radiance Fields via Reachable Sets

Atomic-Probe Governance for Skill Updates in Compositional Robot Policies

SEP Analysis of Quantized SIMO Systems with M-PSK over Correlated Fading Channels

Validating the Clinical Utility of CineECG 3D Reconstructions through Cross-Modal Feature Attribution

Optimizing Tracking Accuracy in Energy-Constrained Multimodal ISAC via Lyapunov-Driven Heterogeneous Mixture-of-Experts

Lights Out: A Nighttime UAV Localization Framework Using Thermal Imagery and Semantic 3D Maps

Robustness Evaluation of a Foundation Segmentation Model Under Simulated Domain Shifts in Abdominal CT: Implications for Health Digital Twin Deployment

UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition

Robust Accent Identification via Voice Conversion and Non-Timbral Embeddings

Monitoring exposure-length variations in submarine power cables using distributed fiber-optic sensing

Real-time windrow detection from onboard tractor sensors for automated following

Guiding Vector Field Generation via Score-based Diffusion Model

$M^2$-VLA: Boosting Vision-Language Models for Generalizable Manipulation via Layer Mixture and Meta-Skills

A Road-Mobile GNSS-Disciplined Oscillator for Accurate Synchronization of Vehicular Microwave Measurements

QuietWalk: Physics-Informed Reinforcement Learning for Ground Reaction Force-Aware Humanoid Locomotion Under Diverse Footwear

Explainable AI in Speaker Recognition -- Making Latent Representations Understandable

Collaborative Trajectory Prediction via Late Fusion

Beyond Acoustic Sparsity and Linguistic Bias: A Prompt-Free Paradigm for Mispronunciation Detection and Diagnosis

VistaBot: View-Robust Robot Manipulation via Spatiotemporal-Aware View Synthesis

Browse by Category

Research Type

Publish Your Research