Allison Beemer in Engineering — Research Repository

Engineering Preprint PDF DOI

A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS)

Chun-wei Ho, Sabato Marco Siniscalchi, Kai Li, Chin-Hui Lee · 2026

We propose a knowledge-driven approach to speech target extraction in the presence of background sound effects already recorded in cinematic audio. The specific knowledge sources studied are manners o…

Read Paper →

Engineering Preprint PDF DOI

Circular Phase Representation and Geometry-Aware Optimization for Ptychographic Image Reconstruction

Carson Yu Liu, Jun Cheng, Chien-Chun Chen, Steve F. Shu · 2026

Traditional iterative reconstruction methods are accurate but computationally expensive, limiting their use in high-throughput and real-time ptychography. Recent deep learning approaches improve speed…

Read Paper →

Engineering Preprint PDF DOI

Split over $n$ resource sharing problem: Are fewer capable agents better than many simpler ones?

Karthik Soma, Mohamed S. Talamali, Genki Miyauchi, Giovanni Beltrame, Heiko Hamann, Roderich Gross · 2026

In multi-agent systems, should limited resources be concentrated into a few capable agents or distributed among many simpler ones? This work formulates the split over $n$ resource sharing problem wher…

Read Paper →

Engineering Preprint PDF DOI

An Efficient Beam Search Algorithm for Active Perception in Mobile Robotics

Kaixian Qu, Han Wang, Victor Klemm, Cesar Cadena, Marco Hutter · 2026

Active perception is a fundamental problem in autonomous robotics in which the robot must decide where to move and what to sense in order to obtain the most informative observations for accomplishing …

Read Paper →

Engineering Preprint PDF DOI

Adaptive Spatial-Temporal Graph Learning-Enabled Short-Term Voltage Stability Assessment against Time-Varying Topological Conditions

Chao Deng, Lipeng Zhu, Chang Liu, Hefeng Zhai, Baoye Tian, Zexiang Zhu, Jiayong Li, Cong Zhang · 2026

The emerging deep learning (DL) technology has recently exhibited great potential in data-driven short-term voltage stability (SVS) assessment of complex power grids. However, without sufficient atten…

Read Paper →

Engineering Preprint PDF DOI

A Vehicle Routing Problem for Human-Centered Electric Mobility

Mostafa Emam, Bjorn Martens, Thomas Rottmann, Matthias Gerdts · 2026

In this paper, we present the Electric Mobility Dial-a-Ride Problem (EM-DARP), which extends the Electric Vehicle Dial-a-Ride Problem (EV-DARP) to better accommodate human-focused mobility services. T…

Read Paper →

Engineering Preprint PDF DOI

GazeVLA: Learning Human Intention for Robotic Manipulation

Chengyang Li, Kaiyi Xiong, Yuan Xu, Lei Qian, Yizhou Wang, Wentao Zhu · 2026

Embodied foundation models have achieved significant breakthroughs in robotic manipulation, yet they still depend heavily on large-scale robot demonstrations. Although recent works have explored lever…

Read Paper →

Engineering Preprint PDF DOI

Useful nonrobust features are ubiquitous in biomedical images

Coenraad Mouton, Randle Rabe, Niklas C. Koser, Nicolai Krekiehn, Christopher Hansen, Jan-Bernd Hovener, Claus-C. Gluer · 2026

We study whether deep networks for medical imaging learn useful nonrobust features - predictive input patterns that are not human interpretable and highly susceptible to small adversarial perturbation…

Read Paper →

Engineering Preprint PDF DOI

DM-ASR: Diarization-aware Multi-speaker ASR with Large Language Models

Li Li, Ming Cheng, Weixin Zhu, Yannan Wang, Juan Liu, Ming Li · 2026

Multi-speaker automatic speech recognition (ASR) aims to transcribe conversational speech involving multiple speakers, requiring the model to capture not only what was said, but also who said it and s…

Read Paper →

Engineering Preprint PDF DOI

DiffNR: Diffusion-Enhanced Neural Representation Optimization for Sparse-View 3D Tomographic Reconstruction

Shiyan Su, Ruyi Zha, Danli Shi, Hongdong Li, Xuelian Cheng · 2026

Neural representations (NRs), such as neural fields and 3D Gaussians, effectively model volumetric data in computed tomography (CT) but suffer from severe artifacts under sparse-view settings. To addr…

Read Paper →

Engineering Preprint PDF DOI

FalconApp: Rapid iPhone Deployment of End-to-End Perception via Automatically Labeled Synthetic Data

Yan Miao, Will Shen, Sayan Mitra · 2026

Reliable perception for robotics depends on large-scale labeled data, yet real-world datasets rely on heavy manual annotation and are time-consuming to produce. We present FalconApp, an iPhone app wit…

Read Paper →

Engineering Preprint PDF DOI

RoomRecon: High-Quality Textured Room Layout Reconstruction on Mobile Devices

Seok Joon Kim, Dinh Duc Cao, Federica Spinola, Se Jin Lee, Kyu Sung Cho · 2026

Widespread RGB-Depth (RGB-D) sensors and advanced 3D reconstruction technologies facilitate the capture of indoor spaces, improving the fields of augmented reality (AR), virtual reality (VR), and exte…

Read Paper →

Engineering Preprint PDF DOI

DAG-STL: A Hierarchical Framework for Zero-Shot Trajectory Planning under Signal Temporal Logic Specifications

Ruijia Liu, Ancheng Hou, Xiao Yu, Xiang Yin · 2026

Signal Temporal Logic (STL) is a powerful language for specifying temporally structured robotic tasks. Planning executable trajectories under STL constraints remains difficult when system dynamics and…

Read Paper →

Engineering Preprint PDF DOI

Locomotion of an Elastic Snake Robot via Natural Dynamics

Tristan Ehlert, Arne Sachtler, Annika Schmidt, Davide Calzolari, Alin Albu-Schaffer · 2026

Nature suggests that exploiting the elasticities and natural dynamics of robotic systems could increase their locomotion efficiency. Prior work on elastic snake robots supports this hypothesis, but ha…

Read Paper →

Engineering Preprint PDF DOI

Driving risk emerges from the required two-dimensional joint evasive acceleration

Hao Cheng, Yanbo Jiang, Wenhao Yu, Rui Zhou, Jiang Bian, Keyu Chen, Zhiyuan Liu, Heye Huang, Hailun Zhang, Fang Zhang, Jianqiang Wang, Sifa Zheng · 2026

Most autonomous driving safety benchmarks use time-to-collision (TTC) to assess risk and guide safe behaviour. However, TTC-based methods treat risk as a one-dimensional closing problem, despite the i…

Read Paper →

Engineering Preprint PDF DOI

Goal-oriented Resource Allocation for Collaborative Integrated Sensing and Communication

Trong Duy Tran (L2S, VNU-UET), Maxime Ferreira Da Costa (L2S), Salah Eddine Elayoubi (L2S), Nguyen Linh Trung (VNU-UET) · 2026

In this paper, we consider resource allocation for a collaborative integrated sensing and communication (ISAC) scenario, in which distributed smart devices can be scheduled to perform sensing and tran…

Read Paper →

Engineering Preprint PDF DOI

Beyond Conservative Automated Driving in Multi-Agent Scenarios via Coupled Model Predictive Control and Deep Reinforcement Learning

Saeed Rahmani, Gozde Korpe, Zhenlin (Gavin) Xu, Bruno Brito, Simeon Craig Calvert, Bart van Arem · 2026

Automated driving at unsignalized intersections is challenging due to complex multi-vehicle interactions and the need to balance safety and efficiency. Model Predictive Control (MPC) offers structured…

Read Paper →

Engineering Preprint PDF DOI

Evolvable Embodied Agent for Robotic Manipulation via Long Short-Term Reflection and Optimization

Jianzong Wang, Botao Zhao, Yayun He, Junqing Peng, Xulong Zhang · 2026

Achieving general-purpose robotics requires empowering robots to adapt and evolve based on their environment and feedback. Traditional methods face limitations such as extensive training requirements,…

Read Paper →

Engineering Preprint PDF DOI

Uncertainty Guided Exploratory Trajectory Optimization for Sampling-Based Model Predictive Control

O. Goktug Poyrazoglu, Yukang Cao, Rahul Moorthy, Volkan Isler · 2026

Trajectory optimization depends heavily on initialization. In particular, sampling-based approaches are highly sensitive to initial solutions, and limited exploration frequently leads them to converge…

Read Paper →

Engineering Preprint PDF DOI

Why Your Tokenizer Fails in Information Fusion: A Timing-Aware Pre-Quantization Fusion for Video-Enhanced Audio Tokenization

Xiangyu Zhang, Benjamin John Southwell, Siqi Pan, Xinlei Niu, Beena Ahmed, Julien Epps · 2026

Audio tokenization has emerged as a critical component in end-to-end audio language models, enabling efficient discrete representation learning for both audio understanding and generation tasks. Howev…

Read Paper →

Browse Research Papers

A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS)

Circular Phase Representation and Geometry-Aware Optimization for Ptychographic Image Reconstruction

Split over $n$ resource sharing problem: Are fewer capable agents better than many simpler ones?

An Efficient Beam Search Algorithm for Active Perception in Mobile Robotics

Adaptive Spatial-Temporal Graph Learning-Enabled Short-Term Voltage Stability Assessment against Time-Varying Topological Conditions

A Vehicle Routing Problem for Human-Centered Electric Mobility

GazeVLA: Learning Human Intention for Robotic Manipulation

Useful nonrobust features are ubiquitous in biomedical images

DM-ASR: Diarization-aware Multi-speaker ASR with Large Language Models

DiffNR: Diffusion-Enhanced Neural Representation Optimization for Sparse-View 3D Tomographic Reconstruction

FalconApp: Rapid iPhone Deployment of End-to-End Perception via Automatically Labeled Synthetic Data

RoomRecon: High-Quality Textured Room Layout Reconstruction on Mobile Devices

DAG-STL: A Hierarchical Framework for Zero-Shot Trajectory Planning under Signal Temporal Logic Specifications

Locomotion of an Elastic Snake Robot via Natural Dynamics

Driving risk emerges from the required two-dimensional joint evasive acceleration

Goal-oriented Resource Allocation for Collaborative Integrated Sensing and Communication

Beyond Conservative Automated Driving in Multi-Agent Scenarios via Coupled Model Predictive Control and Deep Reinforcement Learning

Evolvable Embodied Agent for Robotic Manipulation via Long Short-Term Reflection and Optimization

Uncertainty Guided Exploratory Trajectory Optimization for Sampling-Based Model Predictive Control

Why Your Tokenizer Fails in Information Fusion: A Timing-Aware Pre-Quantization Fusion for Video-Enhanced Audio Tokenization

Browse by Category

Research Type

Publish Your Research