Edith Hemaspaandra in Engineering — Research Repository

Engineering Preprint PDF DOI

QGas: Interactive Gas Infrastructure Toolkit

Marco Quantschnig, Yannick Werner, Sonja Wogrin, Thomas Klatzer · 2026

Gas infrastructure datasets are essential inputs for energy system planning to support strategic decision-making toward decarbonization. However, relevant data are typically scattered across heterogen…

Read Paper →

Engineering Preprint PDF DOI

SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models

Xiyang Wu, Guangyao Shi, Qingzi Wang, Zongxia Li, Amrit Singh Bedi, Dinesh Manocha · 2026

Vision-language-action (VLA) models enable robots to follow natural-language instructions grounded in visual observations, but the instruction channel also introduces a critical vulnerability: small t…

Read Paper →

Engineering Preprint PDF DOI

Editing Away the Evidence: Diffusion-Based Image Manipulation and the Failure Modes of Robust Watermarking

Qian Qi, Jiangyun Tang, Jim Lee, Emily Davis, Finn Carter · 2026

Robust invisible watermarks are widely used to support copyright protection, content provenance, and accountability by embedding hidden signals designed to survive common post-processing operations. H…

Read Paper →

Engineering Preprint PDF DOI

Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment

Asif Azad, MD Sadik Hossain Shanto, Mohammad Sadat Hossain, Bdour Alwuqaysi, Sabri Boughorbel, Yahya Bokhari, Abdulrhman Aljouie, Ayah Othman Sindi, Ehsan Hoque · 2026

Automated phoneme-level pronunciation assessment is vital for scalable speech therapy and language learning, yet validated tools for Arabic remain scarce. We present Harf-Speech, a modular system scor…

Read Paper →

Engineering Preprint PDF DOI

STRIDE: Post-Training LLMs to Reason and Refine Bio-Sequences via Edit Trajectories

Daiheng Zhang, Shiyang Zhang, Sizhuang He, Yangtian Zhang, Syed Asad Rizvi, David van Dijk · 2026

Discrete biological sequence optimization requires iterative refinement under strict syntactic constraints. Diffusion models offer progressive refinement but do not naturally expose controllable discr…

Read Paper →

Engineering Preprint PDF DOI

RoboSubtaskNet: Temporal Sub-task Segmentation for Human-to-Robot Skill Transfer in Real-World Environments

Dharmendra Sharma, Archit Sharma, John Rebeiro, Vaibhav Kesharwani, Peeyush Thakur, Narendra Kumar Dhar, Laxmidhar Behera · 2026

Temporally locating and classifying fine-grained sub-task segments in long, untrimmed videos is crucial to safe human-robot collaboration. Unlike generic activity recognition, collaborative manipulati…

Read Paper →

Engineering Preprint PDF DOI

Design and Evaluation of an Assisted Programming Interface for Behavior Trees in Robotics

Jonathan Styrud, Matteo Iovino, Rebecca Stower, Mart Kartasev, Mikael Norrlof, M{aa}rten Bjorkman, Christian Smith · 2026

The possibility to create reactive robot programs faster without the need for extensively trained programmers is becoming increasingly important. So far, it has not been explored how various technique…

Read Paper →

Engineering Preprint PDF DOI

Integrated Exploration and Sequential Manipulation on Scene Graph with LLM-based Situated Replanning

Heqing Yang, Ziyuan Jiao, Shu Wang, Yida Niu, Si Liu, Hangxin Liu · 2026

In partially known environments, robots must combine exploration to gather information with task planning for efficient execution. To address this challenge, we propose EPoG, an Exploration-based sequ…

Read Paper →

Engineering Preprint PDF DOI

VoiceSculptor: Your Voice, Designed By You

Jingbin Hu, Huakang Chen, Linhan Ma, Dake Guo, Qirui Zhan, Wenhao Li, Haoyu Zhang, Kangxiang Xia, Ziyu Zhang, Wenjie Tian, Chengyou Wang, Jinrui Liang, Shuhan Guo, Zihang Yang, Bengu Wu, Binbin Zhang, Pengcheng Zhu, Pengyuan Xie, Chuan Xie, Qiang Zhang, Jie Liu, Lei Xie · 2026

Despite rapid progress in text-to-speech (TTS), open-source systems still lack truly instruction-following, fine-grained control over core speech attributes (e.g., pitch, speaking rate, age, emotion, …

Read Paper →

Engineering Preprint PDF DOI

Neural Brain Fields: A NeRF-Inspired Approach for Generating Nonexistent EEG Electrodes

Shahar Ain Kedem, Itamar Zimerman, Eliya Nachmani · 2025

Electroencephalography (EEG) data present unique modeling challenges because recordings vary in length, exhibit very low signal to noise ratios, differ significantly across participants, drift over ti…

Read Paper →

Engineering Preprint PDF DOI

Maestro: Orchestrating Robotics Modules with Vision-Language Models for Zero-Shot Generalist Robots

Junyao Shi, Rujia Yang, Kaitian Chao, Selina Bingqing Wan, Yifei Shao, Jiahui Lei, Jianing Qian, Long Le, Pratik Chaudhari, Kostas Daniilidis, Chuan Wen, Dinesh Jayaraman · 2025

Today's best-explored routes towards generalist robots center on collecting ever larger "observations-in actions-out" robotics datasets to train large end-to-end models, copying a recipe that has work…

Read Paper →

Engineering Preprint PDF DOI

Unsupervised lexicon learning from speech is limited by representations rather than clustering

Danel Slabbert, Simon Malan, Herman Kamper · 2025

Zero-resource word segmentation and clustering systems aim to tokenise speech into word-like units without access to text labels. Despite progress, the induced lexicons are still far from perfect. In …

Read Paper →

Engineering Preprint PDF DOI

Physicochemically Informed Dual-Conditioned Generative Model of T-Cell Receptor Variable Regions for Cellular Therapy

Jiahao Ma, Hongzong Li, Ye-Fan Hu, Jian-Dong Huang · 2025

Physicochemically informed biological sequence generation has the potential to accelerate computer-aided cellular therapy, yet current models fail to \emph{jointly} ensure novelty, diversity, and biop…

Read Paper →

Engineering Preprint PDF DOI

FORGE-Tree: Diffusion-Forcing Tree Search for Long-Horizon Robot Manipulation

Yanjia Huang, Shuo Liu, Sheng Liu, Qingxiao Xu, Mingyang Wu, Xiangbo Gao, Zhengzhong Tu · 2025

Long-horizon robot manipulation tasks remain challenging for Vision-Language-Action (VLA) policies due to drift and exposure bias, often denoise the entire trajectory with fixed hyperparameters, causi…

Read Paper →

Engineering Preprint PDF DOI

Performance-Guided Refinement for Visual Aerial Navigation using Editable Gaussian Splatting in FalconGym 2.0

Yan Miao, Ege Yuceel, Georgios Fainekos, Bardh Hoxha, Hideki Okamoto, Sayan Mitra · 2025

Visual policy design is crucial for aerial navigation. However, state-of-the-art visual policies often overfit to a single track and their performance degrades when track geometry changes. We develop …

Read Paper →

Engineering Preprint PDF DOI

Path Diffuser: Diffusion Model for Data-Driven Traffic Simulator

Da Saem Lee, Akash Karthikeyan, Yash Vardhan Pant, Sebastian Fischmeister · 2025

Simulating diverse and realistic traffic scenarios is critical for developing and testing autonomous planning. Traditional rule-based planners lack diversity and realism, while learning-based simulato…

Read Paper →

Engineering Preprint PDF DOI

Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration

Yifan Yang, Bing Han, Hui Wang, Long Zhou, Wei Wang, Mingyu Cui, Xu Tan, Xie Chen · 2025

Prosody diversity is essential for achieving naturalness and expressiveness in zero-shot text-to-speech (TTS). However, frequently used acoustic metrics capture only partial views of prosodic variatio…

Read Paper →

Engineering Preprint PDF DOI

LSMTCR: A Scalable Multi-Architecture Model for Epitope-Specific T Cell Receptor de novo Design

Ruihao Zhang, Xiao Liu · 2025

Designing full-length, epitope-specific TCR {\alpha}\b{eta} remains challenging due to vast sequence space, data biases and incomplete modeling of immunogenetic constraints. We present LSMTCR, a scala…

Read Paper →

Engineering Preprint PDF DOI

Masquerade: Learning from In-the-wild Human Videos using Data-Editing

Marion Lepert, Jiaying Fang, Jeannette Bohg · 2025

Robot manipulation research still suffers from significant data scarcity: even the largest robot datasets are orders of magnitude smaller and less diverse than those that fueled recent breakthroughs i…

Read Paper →

Engineering Preprint PDF DOI

When Digital Twins Meet Large Language Models: Realistic, Interactive, and Editable Simulation for Autonomous Driving

Tanmay Vilas Samak, Chinmay Vilas Samak, Bing Li, Venkat Krovi · 2025

Simulation frameworks have been key enablers for the development and validation of autonomous driving systems. However, existing methods struggle to comprehensively address the autonomy-oriented requi…

Read Paper →

Browse Research Papers

QGas: Interactive Gas Infrastructure Toolkit

SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models

Editing Away the Evidence: Diffusion-Based Image Manipulation and the Failure Modes of Robust Watermarking

Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment

STRIDE: Post-Training LLMs to Reason and Refine Bio-Sequences via Edit Trajectories

RoboSubtaskNet: Temporal Sub-task Segmentation for Human-to-Robot Skill Transfer in Real-World Environments

Design and Evaluation of an Assisted Programming Interface for Behavior Trees in Robotics

Integrated Exploration and Sequential Manipulation on Scene Graph with LLM-based Situated Replanning

VoiceSculptor: Your Voice, Designed By You

Neural Brain Fields: A NeRF-Inspired Approach for Generating Nonexistent EEG Electrodes

Maestro: Orchestrating Robotics Modules with Vision-Language Models for Zero-Shot Generalist Robots

Unsupervised lexicon learning from speech is limited by representations rather than clustering

Physicochemically Informed Dual-Conditioned Generative Model of T-Cell Receptor Variable Regions for Cellular Therapy

FORGE-Tree: Diffusion-Forcing Tree Search for Long-Horizon Robot Manipulation

Performance-Guided Refinement for Visual Aerial Navigation using Editable Gaussian Splatting in FalconGym 2.0

Path Diffuser: Diffusion Model for Data-Driven Traffic Simulator

Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration

LSMTCR: A Scalable Multi-Architecture Model for Epitope-Specific T Cell Receptor de novo Design

Masquerade: Learning from In-the-wild Human Videos using Data-Editing

When Digital Twins Meet Large Language Models: Realistic, Interactive, and Editable Simulation for Autonomous Driving

Browse by Category

Research Type

Publish Your Research