Wolfram Decker · Engineering · Preprint — Research Repository

Engineering Preprint PDF DOI

$\mu$-FlowNet: A Deep Learning Approach for Mapping Flow Fields in Irregular Microchannels Using an Attention-based U-Net Encoder-Decoder Architecture

Ganesh Sahadeo Meshram, Suman Chakraborty, Nishant Sinha, Partha Pratim Chakrabarti · 2026

In the complex domain of microfluidics systems, analysing fluid flow patterns through random-shaped circular microchannels is significantly challenging task. Conventional approach of solving such prob…

Read Paper →

Engineering Preprint PDF DOI

Flow Motion Policy: Manipulator Motion Planning with Flow Matching Models

Davood Soleymanzadeh, Xiao Liang, Minghui Zheng · 2026

Open-loop end-to-end neural motion planners have recently been proposed to improve motion planning for robotic manipulators. These methods enable planning directly from sensor observations without rel…

Read Paper →

Engineering Preprint PDF DOI

A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems

J. E. Dominguez-Vidal · 2026

Foundation vision-language models are becoming increasingly relevant to robotics because they can provide richer semantic perception than narrow task-specific pipelines. However, their practical adopt…

Read Paper →

Engineering Preprint PDF DOI

Asymmetric Encoder-Decoder Based on Time-Frequency Correlation for Speech Separation

Ui-Hyeop Shin, Hyung-Min Park · 2026

Speech separation in realistic acoustic environments remains challenging because overlapping speakers, background noise, and reverberation must be resolved simultaneously. Although recent time-frequen…

Read Paper →

Engineering Preprint PDF DOI

Koopman Operator Framework for Modeling and Control of Off-Road Vehicle on Deformable Terrain

Kartik Loya, Phanindra Tallapragada · 2026

This work presents a hybrid physics-informed and data-driven modeling framework for predictive control of autonomous off-road vehicles operating on deformable terrain. Traditional high-fidelity terram…

Read Paper →

Engineering Preprint PDF DOI

Graph Based Semantic Encoder Decoder Framework for Task Oriented Communications in Connected Autonomous Vehicles

Soheyb Ribouh, Phil Polo Ditsia Di Ngoma · 2026

Connected autonomous vehicles (CAVs) require reliable and efficient communication frameworks to support safety critical and task-oriented applications such as collision avoidance, cooperative percepti…

Read Paper →

Engineering Preprint PDF DOI

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

Jaeyoung Lee, Masato Mimura · 2026

We present a decoder-only Conformer for automatic speech recognition (ASR) that processes speech and text in a single stack without external speech encoders or pretrained large language models (LLM). …

Read Paper →

Engineering Preprint PDF DOI

A Multi-decoder Neural Tracking Method for Accurately Predicting Speech Intelligibility

Rien Sonck, Bernd Accou, Tom Francart, Jonas Vanthornhout · 2026

Objective: EEG-based methods can predict speech intelligibility, but their accuracy and robustness lag behind behavioral tests, which typically show test-retest differences under 1 dB. We introduce th…

Read Paper →

Engineering Preprint PDF DOI

SanD-Planner: Sample-Efficient Diffusion Planner in B-Spline Space for Robust Local Navigation

Jincheng Wang, Lingfan Bao, Tong Yang, Diego Martinez Plasencia, Jianhao Jiao, Dimitrios Kanoulas · 2026

The challenge of generating reliable local plans has long hindered practical applications in highly cluttered and dynamic environments. Key fundamental bottlenecks include acquiring large-scale expert…

Read Paper →

Engineering Preprint PDF DOI

Streaming Speech Recognition with Decoder-Only Large Language Models and Latency Optimization

Genshun Wan, Wenhui Zhang, Jing-Xuan Zhang, Shifu Xiong, Jianqing Gao, Zhongfu Ye · 2026

Recent advances have demonstrated the potential of decoderonly large language models (LLMs) for automatic speech recognition (ASR). However, enabling streaming recognition within this framework remain…

Read Paper →

Engineering Preprint PDF DOI

T-Mimi: A Transformer-based Mimi Decoder for Real-Time On-Phone TTS

Haibin Wu, Bach Viet Do, Naveen Suda, Julian Chan, Madhavan C R, Gene-Ping Yang, Yi-Chiao Wu, Naoyuki Kanda, Yossef Adi, Xin Lei, Yue Liu, Florian Metze, Yuzong Liu · 2026

Neural audio codecs provide promising acoustic features for speech synthesis, with representative streaming codecs like Mimi providing high-quality acoustic features for real-time Text-to-Speech (TTS)…

Read Paper →

Engineering Preprint PDF DOI

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

Zhengyang Li, Thomas Graave, Bjorn Moller, Zehang Wu, Matthias Franz, Tim Fingscheidt · 2026

In audiovisual automatic speech recognition (AV-ASR) systems, information fusion of visual features in a pre-trained ASR has been proven as a promising method to improve noise robustness. In this work…

Read Paper →

Engineering Preprint PDF DOI

DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning

Junha Lee, Eunha Park, Minsu Cho · 2026

Language-driven dexterous grasp generation requires the models to understand task semantics, 3D geometry, and complex hand-object interactions. While vision-language models have been applied to this p…

Read Paper →

Engineering Preprint PDF DOI

Partial Decoder Attention Network with Contour-weighted Loss Function for Data-Imbalance Medical Image Segmentation

Zhengyong Huang, Ning Jiang, Xingwen Sun, Lihua Zhang, Peng Chen, Jens Domke, Yao Sui · 2026

Image segmentation is pivotal in medical image analysis, facilitating clinical diagnosis, treatment planning, and disease evaluation. Deep learning has significantly advanced automatic segmentation me…

Read Paper →

Engineering Preprint PDF DOI

Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models

Bang Zeng, Beilong Tang, Wang Xiang, Ming Li · 2026

Target speaker extraction (TSE) aims to recover the speech signal of a desired speaker from a mixed audio recording, given a short enrollment utterance. Most existing TSE approaches are based on discr…

Read Paper →

Engineering Preprint PDF DOI

GR-Dexter Technical Report

Ruoshi Wen, Guangzeng Chen, Zhongren Cui, Min Du, Yang Gou, Zhigang Han, Liqun Huang, Mingyu Lei, Yunfei Li, Zhuohang Li, Wenlei Liu, Yuxiao Liu, Xiao Ma, Hao Niu, Yutao Ouyang, Zeyu Ren, Haixin Shi, Wei Xu, Haoxiang Zhang, Jiajun Zhang, Xiao Zhang, Liwei Zheng, Weiheng Zhong, Yifei Zhou, Zhengming Zhu, Hang Li · 2025

Vision-language-action (VLA) models have enabled language-conditioned, long-horizon robot manipulation, but most existing systems are limited to grippers. Scaling VLA policies to bimanual robots with …

Read Paper →

Engineering Preprint PDF DOI

Spatial Interpolation of Room Impulse Responses based on Deeper Physics-Informed Neural Networks with Residual Connections

Ken Kurata, Gen Sato, Izumi Tsunokuni, Yusuke Ikeda · 2025

The room impulse response (RIR) characterizes sound propagation in a room from a loudspeaker to a microphone under the linear time-invariant assumption. Estimating RIRs from a limited number of measur…

Read Paper →

Engineering Preprint PDF DOI

From Human Bias to Robot Choice: How Occupational Contexts and Racial Priming Shape Robot Selection

Jiangen He, Wanqi Zhang, Jessica Barfield · 2025

As artificial agents increasingly integrate into professional environments, fundamental questions have emerged about how societal biases influence human-robot selection decisions. We conducted two com…

Read Paper →

Engineering Preprint PDF DOI

Verified Design of Robotic Autonomous Systems using Probabilistic Model Checking

Atef Azaiez, David Alireza Anisi · 2025

Safety and reliability play a crucial role when designing Robotic Autonomous Systems (RAS). Early consideration of hazards, risks and mitigation actions -- already in the concept study phase -- are im…

Read Paper →

Engineering Preprint PDF DOI

Context Representation via Action-Free Transformer encoder-decoder for Meta Reinforcement Learning

Amir M. Soufi Enayati, Homayoun Honari, Homayoun Najjaran · 2025

Reinforcement learning (RL) enables robots to operate in uncertain environments, but standard approaches often struggle with poor generalization to unseen tasks. Context-adaptive meta reinforcement le…

Read Paper →

Browse Research Papers

$\mu$-FlowNet: A Deep Learning Approach for Mapping Flow Fields in Irregular Microchannels Using an Attention-based U-Net Encoder-Decoder Architecture

Flow Motion Policy: Manipulator Motion Planning with Flow Matching Models

A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems

Asymmetric Encoder-Decoder Based on Time-Frequency Correlation for Speech Separation

Koopman Operator Framework for Modeling and Control of Off-Road Vehicle on Deformable Terrain

Graph Based Semantic Encoder Decoder Framework for Task Oriented Communications in Connected Autonomous Vehicles

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

A Multi-decoder Neural Tracking Method for Accurately Predicting Speech Intelligibility

SanD-Planner: Sample-Efficient Diffusion Planner in B-Spline Space for Robust Local Navigation

Streaming Speech Recognition with Decoder-Only Large Language Models and Latency Optimization

T-Mimi: A Transformer-based Mimi Decoder for Real-Time On-Phone TTS

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning

Partial Decoder Attention Network with Contour-weighted Loss Function for Data-Imbalance Medical Image Segmentation

Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models

GR-Dexter Technical Report

Spatial Interpolation of Room Impulse Responses based on Deeper Physics-Informed Neural Networks with Residual Connections

From Human Bias to Robot Choice: How Occupational Contexts and Racial Priming Shape Robot Selection

Verified Design of Robotic Autonomous Systems using Probabilistic Model Checking

Context Representation via Action-Free Transformer encoder-decoder for Meta Reinforcement Learning

Browse by Category

Research Type

Publish Your Research