Dusan Malbaski in Engineering — Research Repository

Engineering Preprint PDF DOI

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

Zhengyang Li, Thomas Graave, Bjorn Moller, Zehang Wu, Matthias Franz, Tim Fingscheidt · 2026

In audiovisual automatic speech recognition (AV-ASR) systems, information fusion of visual features in a pre-trained ASR has been proven as a promising method to improve noise robustness. In this work…

Read Paper →

Engineering Preprint PDF DOI

PatchDSU: Uncertainty Modeling for Out of Distribution Generalization in Keyword Spotting

Bronya Roni Chernyak, Yael Segal, Yosi Shrem, Joseph Keshet · 2025

Deep learning models excel at many tasks but rely on the assumption that training and test data follow the same distribution. This assumption often does not hold in real-world speech systems, where di…

Read Paper →

Engineering Preprint PDF DOI

Approxify: Automating Energy-Accuracy Trade-offs in Batteryless IoT Devices

Muhammad Abdullah Soomro, Naveed Anwar Bhatti, Muhammad Hamad Alizai · 2024

Batteryless IoT devices, powered by energy harvesting, face significant challenges in maintaining operational efficiency and reliability due to intermittent power availability. Traditional checkpointi…

Read Paper →

Engineering Preprint PDF DOI

Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection

Theophile Stourbe, Victor Miara, Theo Lepage, Reda Dehak · 2024

This paper describes our submitted systems to the ASVspoof 5 Challenge Track 1: Speech Deepfake Detection - Open Condition, which consists of a stand-alone speech deepfake (bonafide vs spoof) detectio…

Read Paper →

Engineering Preprint PDF DOI

Coordinated Planning of Offshore Charging Stations and Electrified Ships: A Case Study on Shanghai-Busan Maritime Route

Hao Li, Hanqi Tao, Wentao Huang, Hongcai Zhang, Ran Li · 2023

Despite the success of electric vehicles on land, electrification of maritime ships is challenged by the dilemma of range anxiety and cargo-carrying capacity. The longer range requires larger batterie…

Read Paper →

Engineering Preprint PDF DOI

On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition

Lokesh Bansal, S. Pavankumar Dubagunta, Malolan Chetlur, Pushpak Jagtap, Aravind Ganapathiraju · 2023

New-age conversational agent systems perform both speech emotion recognition (SER) and automatic speech recognition (ASR) using two separate and often independent approaches for real-world application…

Read Paper →

Engineering Preprint PDF DOI

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi · 2022

Conventional automatic speaker verification systems can usually be decomposed into a front-end model such as time delay neural network (TDNN) for extracting speaker embeddings and a back-end model suc…

Read Paper →

Engineering Preprint PDF DOI

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

Karam Park, Jae Woong Soh, Nam Ik Cho · 2021

Deep learning methods have shown outstanding performance in many applications, including single-image super-resolution (SISR). With residual connection architecture, deeply stacked convolutional neura…

Read Paper →

Engineering Preprint PDF DOI

LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge

Xiaoxiao Miao, Ian McLoughlin · 2019

This paper presents a novel Dialect Identification (DID) system developed for the Fifth Edition of the Multi-Genre Broadcast challenge, the task of Fine-grained Arabic Dialect Identification (MGB-5 AD…

Read Paper →

Engineering Preprint PDF DOI

Socially-Aware Navigation: A Non-linear Multi-Objective Optimization Approach

Santosh Balajee Banisetty, Scott Forer, Logan Yliniemi, Monica Nicolescu, David Feil-Seifer · 2019

Mobile robots are increasingly populating homes, hospitals, shopping malls, factory floors, and other human environments. Human society has social norms that people mutually accept; obeying these norm…

Read Paper →

Engineering Preprint PDF DOI

VS-Net: Variable splitting network for accelerated parallel MRI reconstruction

Jinming Duan, Jo Schlemper, Chen Qin, Cheng Ouyang, Wenjia Bai, Carlo Biffi, Ghalib Bello, Ben Statton, Declan P O'Regan, Daniel Rueckert · 2019

In this work, we propose a deep learning approach for parallel magnetic resonance imaging (MRI) reconstruction, termed a variable splitting network (VS-Net), for an efficient, high-quality reconstruct…

Read Paper →

Engineering Preprint PDF DOI

Towards a Unified Planner For Socially-Aware Navigation

Santosh Balajee Banisetty, David Feil-Seifer · 2018

This paper presents the framework for a novel Unified Socially-Aware Navigation (USAN) architecture and explains its need in Socially Assistive Robotics (SAR) applications. Our approach emphasizes int…

Read Paper →

Browse Research Papers

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

PatchDSU: Uncertainty Modeling for Out of Distribution Generalization in Keyword Spotting

Approxify: Automating Energy-Accuracy Trade-offs in Batteryless IoT Devices

Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection

Coordinated Planning of Offshore Charging Stations and Electrified Ships: A Case Study on Shanghai-Busan Maritime Route

On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge

Socially-Aware Navigation: A Non-linear Multi-Objective Optimization Approach

VS-Net: Variable splitting network for accelerated parallel MRI reconstruction

Towards a Unified Planner For Socially-Aware Navigation

Browse by Category

Research Type

Publish Your Research