Susan Mniszewski in Engineering — Research Repository

Engineering Preprint PDF DOI

DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming

Shuide Wen, Sungil Seok, Beier Ku, Richee Li, Yubin He, Bowen Qu, Yang Yang, Ping Su, Can Jiao · 2026

We present DLIOS, a Large Language Model (LLM)-augmented real-time multi-modal interactive enhancement overlay system for Douyin (TikTok) live streaming. DLIOS employs a three-layer transparent window…

Read Paper →

Engineering Preprint PDF DOI

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

Zhengyang Li, Thomas Graave, Bjorn Moller, Zehang Wu, Matthias Franz, Tim Fingscheidt · 2026

In audiovisual automatic speech recognition (AV-ASR) systems, information fusion of visual features in a pre-trained ASR has been proven as a promising method to improve noise robustness. In this work…

Read Paper →

Engineering Preprint PDF DOI

PatchDSU: Uncertainty Modeling for Out of Distribution Generalization in Keyword Spotting

Bronya Roni Chernyak, Yael Segal, Yosi Shrem, Joseph Keshet · 2025

Deep learning models excel at many tasks but rely on the assumption that training and test data follow the same distribution. This assumption often does not hold in real-world speech systems, where di…

Read Paper →

Engineering Preprint PDF DOI

Approxify: Automating Energy-Accuracy Trade-offs in Batteryless IoT Devices

Muhammad Abdullah Soomro, Naveed Anwar Bhatti, Muhammad Hamad Alizai · 2024

Batteryless IoT devices, powered by energy harvesting, face significant challenges in maintaining operational efficiency and reliability due to intermittent power availability. Traditional checkpointi…

Read Paper →

Engineering Preprint PDF DOI

Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection

Theophile Stourbe, Victor Miara, Theo Lepage, Reda Dehak · 2024

This paper describes our submitted systems to the ASVspoof 5 Challenge Track 1: Speech Deepfake Detection - Open Condition, which consists of a stand-alone speech deepfake (bonafide vs spoof) detectio…

Read Paper →

Engineering Preprint PDF DOI

Coordinated Planning of Offshore Charging Stations and Electrified Ships: A Case Study on Shanghai-Busan Maritime Route

Hao Li, Hanqi Tao, Wentao Huang, Hongcai Zhang, Ran Li · 2023

Despite the success of electric vehicles on land, electrification of maritime ships is challenged by the dilemma of range anxiety and cargo-carrying capacity. The longer range requires larger batterie…

Read Paper →

Engineering Preprint PDF DOI

On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition

Lokesh Bansal, S. Pavankumar Dubagunta, Malolan Chetlur, Pushpak Jagtap, Aravind Ganapathiraju · 2023

New-age conversational agent systems perform both speech emotion recognition (SER) and automatic speech recognition (ASR) using two separate and often independent approaches for real-world application…

Read Paper →

Engineering Preprint PDF DOI

Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG

Shibani Hamsa, Ismail Shahin, Youssef Iraqi, Ernesto Damiani, Naoufel Werghi · 2022

Speech signals are subjected to more acoustic interference and emotional factors than other signals. Noisy emotion-riddled speech data is a challenge for real-time speech processing applications. It i…

Read Paper →

Engineering Preprint PDF DOI

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi · 2022

Conventional automatic speaker verification systems can usually be decomposed into a front-end model such as time delay neural network (TDNN) for extracting speaker embeddings and a back-end model suc…

Read Paper →

Engineering Preprint PDF DOI

LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge

Xiaoxiao Miao, Ian McLoughlin · 2019

This paper presents a novel Dialect Identification (DID) system developed for the Fifth Edition of the Multi-Genre Broadcast challenge, the task of Fine-grained Arabic Dialect Identification (MGB-5 AD…

Read Paper →

Engineering Preprint PDF DOI

Socially-Aware Navigation: A Non-linear Multi-Objective Optimization Approach

Santosh Balajee Banisetty, Scott Forer, Logan Yliniemi, Monica Nicolescu, David Feil-Seifer · 2019

Mobile robots are increasingly populating homes, hospitals, shopping malls, factory floors, and other human environments. Human society has social norms that people mutually accept; obeying these norm…

Read Paper →

Engineering Preprint PDF DOI

Towards a Unified Planner For Socially-Aware Navigation

Santosh Balajee Banisetty, David Feil-Seifer · 2018

This paper presents the framework for a novel Unified Socially-Aware Navigation (USAN) architecture and explains its need in Socially Assistive Robotics (SAR) applications. Our approach emphasizes int…

Read Paper →

Browse Research Papers

DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

PatchDSU: Uncertainty Modeling for Out of Distribution Generalization in Keyword Spotting

Approxify: Automating Energy-Accuracy Trade-offs in Batteryless IoT Devices

Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection

Coordinated Planning of Offshore Charging Stations and Electrified Ships: A Case Study on Shanghai-Busan Maritime Route

On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition

Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge

Socially-Aware Navigation: A Non-linear Multi-Objective Optimization Approach

Towards a Unified Planner For Socially-Aware Navigation

Browse by Category

Research Type

Publish Your Research