Expertini Research Research

Browse Research Papers

12+ open-access research outputs.

✕ Clear
🔍 susan mniszewski 📂 Engineering
Showing 12 results for "susan mniszewski" in Engineering
Engineering Preprint PDF DOI

DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming

Shuide Wen, Sungil Seok, Beier Ku, Richee Li, Yubin He, Bowen Qu, Yang Yang, Ping Su, Can Jiao · 2026

We present DLIOS, a Large Language Model (LLM)-augmented real-time multi-modal interactive enhancement overlay system for Douyin (TikTok) live streaming. DLIOS employs a three-layer transparent window…

Read Paper →
Engineering Preprint PDF DOI

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

Zhengyang Li, Thomas Graave, Bjorn Moller, Zehang Wu, Matthias Franz, Tim Fingscheidt · 2026

In audiovisual automatic speech recognition (AV-ASR) systems, information fusion of visual features in a pre-trained ASR has been proven as a promising method to improve noise robustness. In this work…

Read Paper →
Engineering Preprint PDF DOI

PatchDSU: Uncertainty Modeling for Out of Distribution Generalization in Keyword Spotting

Bronya Roni Chernyak, Yael Segal, Yosi Shrem, Joseph Keshet · 2025

Deep learning models excel at many tasks but rely on the assumption that training and test data follow the same distribution. This assumption often does not hold in real-world speech systems, where di…

Read Paper →
Engineering Preprint PDF DOI

Approxify: Automating Energy-Accuracy Trade-offs in Batteryless IoT Devices

Muhammad Abdullah Soomro, Naveed Anwar Bhatti, Muhammad Hamad Alizai · 2024

Batteryless IoT devices, powered by energy harvesting, face significant challenges in maintaining operational efficiency and reliability due to intermittent power availability. Traditional checkpointi…

Read Paper →
Engineering Preprint PDF DOI

Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection

Theophile Stourbe, Victor Miara, Theo Lepage, Reda Dehak · 2024

This paper describes our submitted systems to the ASVspoof 5 Challenge Track 1: Speech Deepfake Detection - Open Condition, which consists of a stand-alone speech deepfake (bonafide vs spoof) detectio…

Read Paper →
Engineering Preprint PDF DOI

Coordinated Planning of Offshore Charging Stations and Electrified Ships: A Case Study on Shanghai-Busan Maritime Route

Hao Li, Hanqi Tao, Wentao Huang, Hongcai Zhang, Ran Li · 2023

Despite the success of electric vehicles on land, electrification of maritime ships is challenged by the dilemma of range anxiety and cargo-carrying capacity. The longer range requires larger batterie…

Read Paper →
Engineering Preprint PDF DOI

On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition

Lokesh Bansal, S. Pavankumar Dubagunta, Malolan Chetlur, Pushpak Jagtap, Aravind Ganapathiraju · 2023

New-age conversational agent systems perform both speech emotion recognition (SER) and automatic speech recognition (ASR) using two separate and often independent approaches for real-world application…

Read Paper →
Engineering Preprint PDF DOI

Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG

Shibani Hamsa, Ismail Shahin, Youssef Iraqi, Ernesto Damiani, Naoufel Werghi · 2022

Speech signals are subjected to more acoustic interference and emotional factors than other signals. Noisy emotion-riddled speech data is a challenge for real-time speech processing applications. It i…

Read Paper →
Engineering Preprint PDF DOI

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi · 2022

Conventional automatic speaker verification systems can usually be decomposed into a front-end model such as time delay neural network (TDNN) for extracting speaker embeddings and a back-end model suc…

Read Paper →
Engineering Preprint PDF DOI

LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge

Xiaoxiao Miao, Ian McLoughlin · 2019

This paper presents a novel Dialect Identification (DID) system developed for the Fifth Edition of the Multi-Genre Broadcast challenge, the task of Fine-grained Arabic Dialect Identification (MGB-5 AD…

Read Paper →
Engineering Preprint PDF DOI

Socially-Aware Navigation: A Non-linear Multi-Objective Optimization Approach

Santosh Balajee Banisetty, Scott Forer, Logan Yliniemi, Monica Nicolescu, David Feil-Seifer · 2019

Mobile robots are increasingly populating homes, hospitals, shopping malls, factory floors, and other human environments. Human society has social norms that people mutually accept; obeying these norm…

Read Paper →
Engineering Preprint PDF DOI

Towards a Unified Planner For Socially-Aware Navigation

Santosh Balajee Banisetty, David Feil-Seifer · 2018

This paper presents the framework for a novel Unified Socially-Aware Navigation (USAN) architecture and explains its need in Socially Assistive Robotics (SAR) applications. Our approach emphasizes int…

Read Paper →