Erman Ayday in Engineering — Research Repository

Engineering Preprint PDF DOI

Designing Active Operation in Low-Voltage Distribution Grids: Requirements, Interfaces and Roadmap

Eric Tonges, Andrea Schoen, Frank Marten, Marco Pau, Denis Mende · 2026

This paper outlines a pathway towards active operation of lowvoltage distribution grids. In these grids, the growing deployment of distributed generation, controllable demand and storage, together wit…

Read Paper →

Engineering Preprint PDF DOI

From Energy Transition Pathways to Measurement Requirements: A Scenario-Based Study of Low-Voltage Grids

Nane Zimmermann, Lukas P. Wagner, Luca von Ronn, Florian Strobel, Paul Huttmann, Felix Gehlhoff · 2026

Increasing penetration of electric vehicles, heat pumps, and rooftop photovoltaics is creating thermal and voltage stress in low-voltage distribution grids. This work links three German energy transit…

Read Paper →

Engineering Preprint PDF DOI

Defending the power grid by segmenting the EV charging cyber infrastructure

Kirill Kuroptev, Florian Steinke, Efthymios Karangelos · 2026

This paper examines defending the power grid against load-altering attacks using electric vehicle charging. It proposes to preventively segment the cyber infrastructure that charging station operators…

Read Paper →

Engineering Preprint PDF DOI

Identification and Visualization of Correlation Structures in Large-Scale Power Quality Data

Max Domagk, Jan Meyer, Marco Lindner · 2026

Large-scale power quality (PQ) measurement campaigns generate vast amounts of multivariate data, in which systematic dependencies are difficult to identify using conventional analysis techniques. This…

Read Paper →

Engineering Preprint PDF DOI

BabAR: from phoneme recognition to developmental measures of young children's speech production

Marvin Lavechin, Elika Bergelson, Roger Levy · 2026

Studying early speech development at scale requires automatic tools, yet automatic phoneme recognition, especially for young children, remains largely unsolved. Building on decades of data collection,…

Read Paper →

Engineering Preprint PDF DOI

The PARLO Dementia Corpus: A German Multi-Center Resource for Alzheimer's Disease

Franziska Braun, Christopher Witzl, Florian Honig, Elmar Noth, Tobias Bocklet, Korbinian Riedhammer · 2026

Early and accessible detection of Alzheimer's disease (AD) remains a major challenge, as current diagnostic methods often rely on costly and invasive biomarkers. Speech and language analysis has emerg…

Read Paper →

Engineering Preprint PDF DOI

Real-world energy data of 200 feeders from low-voltage grids with metadata in Germany over two years

Manuel Treutlein, Pascal Bothe, Marc Schmidt, Roman Hahn, Oliver Neumann, Ralf Mikut, Veit Hagenmeyer · 2026

The last mile of the distribution grid is crucial for a successful energy transition, as more low-carbon technology like photovoltaic systems, heat pumps, and electric vehicle chargers connect to the …

Read Paper →

Engineering Preprint PDF DOI

Timbre-Aware LLM-based Direct Speech-to-Speech Translation Extendable to Multiple Language Pairs

Lalaram Arya, Mrinmoy Bhattacharjee, Adarsh C. R., S. R. Mahadeva Prasanna · 2026

Direct Speech-to-Speech Translation (S2ST) has gained increasing attention for its ability to translate speech from one language to another, while reducing error propagation and latency inherent in tr…

Read Paper →

Engineering Preprint PDF DOI

Systemization of Knowledge: Resilience and Fault Tolerance in Cyber-Physical Systems

Rahul Bulusu · 2025

Cyber-Physical Systems (CPS) now support critical infrastructure spanning transportation, energy, manufacturing, medical devices, and autonomous robotics. Their defining characteristic is the tight co…

Read Paper →

Engineering Preprint PDF DOI

Phoneme-based speech recognition driven by large language models and sampling marginalization

Te Ma, Nanjie Li, Hao Huang, Zhijian Ou · 2025

Recently, the Large Language Model-based Phoneme-to-Grapheme (LLM-P2G) method has shown excellent performance in speech recognition tasks and has become a feasible direction to replace the traditional…

Read Paper →

Engineering Preprint PDF DOI

Towards Language-Independent Face-Voice Association with Multimodal Foundation Models

Aref Farhadipour, Teodora Vukovic, Volker Dellwo · 2025

This paper describes the UZH-CL system submitted to the FAME2026 Challenge. The challenge focuses on cross-modal verification under unique multilingual conditions, specifically unseen and unheard lang…

Read Paper →

Engineering Preprint PDF DOI

RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech

Zhisheng Zheng, Xiaohang Sun, Tuan Dinh, Abhishek Yanamandra, Abhinav Jain, Zhu Liu, Sunil Hadap, Vimal Bhat, Manoj Aggarwal, Gerard Medioni, David Harwath · 2025

End-to-end speech-to-speech translation (S2ST) systems typically struggle with a critical data bottleneck: the scarcity of parallel speech-to-speech corpora. To overcome this, we introduce RosettaSpee…

Read Paper →

Engineering Preprint PDF DOI

Influence of Transmission Rank on EMF Exposure Measured With Provoked Data Traffic Around 5G Massive MIMO Base Stations

Lisa-Marie Schilling, Christian Bornkessel, Anna-Malin Schiffarth, Thanh Tam Julian Ta, Dirk Heberling, Matthias Hein · 2025

The introduction of 5G New Radio networks with massive MIMO technology has complicated electromagnetic field exposure assessments for radiation protection. Massive MIMO transmission enables beamformin…

Read Paper →

Engineering Preprint PDF DOI

On the Difficulty of Token-Level Modeling of Dysfluency and Fluency Shaping Artifacts

Kashaf Gulzar, Dominik Wagner, Sebastian P. Bayerl, Florian Honig, Tobias Bocklet, Korbinian Riedhammer · 2025

Automatic transcription of stuttered speech remains a challenge, even for modern end-to-end (E2E) automatic speech recognition (ASR) frameworks. Dysfluencies and fluency-shaping artifacts are often ov…

Read Paper →

Engineering Preprint PDF DOI

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing

Zhisheng Zheng, Puyuan Peng, Anuj Diwan, Cong Phuoc Huynh, Xiaohang Sun, Zhu Liu, Vimal Bhat, David Harwath · 2025

We introduce VoiceCraft-X, an autoregressive neural codec language model which unifies multilingual speech editing and zero-shot Text-to-Speech (TTS) synthesis across 11 languages: English, Mandarin, …

Read Paper →

Engineering Preprint PDF DOI

Leveraging Language Information for Target Language Extraction

Mehmet Sinan Y{i}ld{i}r{i}m, Ruijie Tao, Wupeng Wang, Junyi Ao, Haizhou Li · 2025

Target Language Extraction aims to extract speech in a specific language from a mixture waveform that contains multiple speakers speaking different languages. The human auditory system is adept at per…

Read Paper →

Engineering Preprint PDF DOI

A Multilingual Framework for Dysarthria: Detection, Severity Classification, Speech-to-Text, and Clean Speech Generation

Ananya Raghu, Anisha Raghu, Nithika Vivek, Sofie Budman, Omar Mansour · 2025

Dysarthria is a motor speech disorder that results in slow and often incomprehensible speech. Speech intelligibility significantly impacts communication, leading to barriers in social interactions. Dy…

Read Paper →

Engineering Preprint PDF DOI

Variational Low-Rank Adaptation for Personalized Impaired Speech Recognition

Niclas Pokel, Pehuen Moure, Roman Boehringer, Shih-Chii Liu, Yingqiang Gao · 2025

Speech impairments resulting from congenital disorders, such as cerebral palsy, down syndrome, or apert syndrome, as well as acquired brain injuries due to stroke, traumatic accidents, or tumors, pres…

Read Paper →

Engineering Preprint PDF DOI

Data-Efficient ASR Personalization for Non-Normative Speech Using an Uncertainty-Based Phoneme Difficulty Score for Guided Sampling

Niclas Pokel, Pehuen Moure, Roman Bohringer, Yingqiang Gao · 2025

ASR systems struggle with non-normative speech due to high acoustic variability and data scarcity. We propose a data-efficient method using phoneme-level uncertainty to guide fine-tuning for personali…

Read Paper →

Engineering Preprint PDF DOI

Unified Graph-Theoretic Modeling of Multi-Energy Flows in Distribution Systems

Marwan Mostafa, Daniel Wenser, Payam Teimourzadeh Baboli, Christian Becker · 2025

The increasing complexity of energy systems due to sector coupling and decarbonization calls for unified modeling frameworks that capture the physical and structural interactions between electricity, …

Read Paper →

Browse Research Papers

Designing Active Operation in Low-Voltage Distribution Grids: Requirements, Interfaces and Roadmap

From Energy Transition Pathways to Measurement Requirements: A Scenario-Based Study of Low-Voltage Grids

Defending the power grid by segmenting the EV charging cyber infrastructure

Identification and Visualization of Correlation Structures in Large-Scale Power Quality Data

BabAR: from phoneme recognition to developmental measures of young children's speech production

The PARLO Dementia Corpus: A German Multi-Center Resource for Alzheimer's Disease

Real-world energy data of 200 feeders from low-voltage grids with metadata in Germany over two years

Timbre-Aware LLM-based Direct Speech-to-Speech Translation Extendable to Multiple Language Pairs

Systemization of Knowledge: Resilience and Fault Tolerance in Cyber-Physical Systems

Phoneme-based speech recognition driven by large language models and sampling marginalization

Towards Language-Independent Face-Voice Association with Multimodal Foundation Models

RosettaSpeech: Zero-Shot Speech-to-Speech Translation without Parallel Speech

Influence of Transmission Rank on EMF Exposure Measured With Provoked Data Traffic Around 5G Massive MIMO Base Stations

On the Difficulty of Token-Level Modeling of Dysfluency and Fluency Shaping Artifacts

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing

Leveraging Language Information for Target Language Extraction

A Multilingual Framework for Dysarthria: Detection, Severity Classification, Speech-to-Text, and Clean Speech Generation

Variational Low-Rank Adaptation for Personalized Impaired Speech Recognition

Data-Efficient ASR Personalization for Non-Normative Speech Using an Uncertainty-Based Phoneme Difficulty Score for Guided Sampling

Unified Graph-Theoretic Modeling of Multi-Energy Flows in Distribution Systems

Browse by Category

Research Type

Publish Your Research