Erman Ayday — Preprint — Research Repository

AI & Data Science Preprint PDF DOI

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Ansar Aynetdinov, Patrick Haller, Alan Akbik · 2026

Recent research has shown that filtering massive English web corpora into high-quality subsets significantly improves training efficiency. However, for high-resource non-English languages like German,…

Read Paper →

Economics & Finance Preprint PDF DOI

Optimal Consumption and Investment with Energy-Efficiency Adoption

Anthony Britto, Carlos Oliveira, Max Kleinebrahm · 2026

Despite many decades of research, economically grounded models that analyse energy consumption and energy-efficiency adoption within a unified framework remain underdeveloped. This article addresses t…

Read Paper →

AI & Data Science Preprint PDF DOI

Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation

Dawid Wisniewski, Igor Czudy · 2026

Preserving affective nuance remains a challenge in Machine Translation (MT), where semantic equivalence often takes precedence over emotional fidelity. This paper evaluates the performance of three st…

Read Paper →

Economics & Finance Preprint PDF DOI

Data-Driven Stochastic Optimal Control for Intraday Electricity Trading by Renewable Producers

Chiheb Ben Hammouda, Michael Samet, Raul Tempone · 2026

The rapid growth of weather-dependent renewable generation increases price volatility and imbalance penalty risk in power markets, creating the need for advanced quantitative trading strategies. We de…

Read Paper →

AI & Data Science Preprint PDF DOI

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

Juergen Dietrich · 2026

Democratic discourse analysis systems increasingly rely on multi-agent LLM pipelines in which distinct evaluator models are assigned adversarial roles to generate structured, multi-perspective assessm…

Read Paper →

AI & Data Science Preprint PDF DOI

Selective Augmentation: Improving Universal Automatic Phonetic Transcription via G2P Bootstrapping

Tobias Bystrich, Julia M. Pritzen, Christoph A. Schmidt, Claudia Wich-Reif · 2026

In the field of universal automatic phonetic transcription (APT), clean and diverse training transcriptions are required. However, such high-quality data is limited. We propose the bootstrapping appro…

Read Paper →

AI & Data Science Preprint PDF DOI

Cross-Lingual Response Consistency in Large Language Models: An ILR-Informed Evaluation of Claude Across Six Languages

Camelia Baluta · 2026

This paper introduces a systematic evaluation framework grounded in the Interagency Language Roundtable (ILR) Skill Level Descriptions and applies it to Claude (Sonnet 4.6) across six languages: Engli…

Read Paper →

Mathematics Preprint PDF DOI

The Riemann integral on Dedekind complete $f$-algebras

Eder Kikianty, Luan Naude, Mark Roelands, Christopher Schwanke · 2026

In this paper we develop a theory of integration for locally band preserving functions, introduced by Ercan and Wickstead, on Dedekind complete $f$-algebras. Specifically, we construct Darboux and Rie…

Read Paper →

AI & Data Science Preprint PDF DOI

Zero-Shot to Full-Resource: Cross-lingual Transfer Strategies for Aspect-Based Sentiment Analysis

Jakob Fehle, Nils Constantin Hellwig, Udo Kruschwitz, Christian Wolff · 2026

Aspect-based Sentiment Analysis (ABSA) extracts fine-grained opinions toward specific aspects within text but remains largely English-focused despite major advances in transformer-based and instructio…

Read Paper →

AI & Data Science Preprint PDF DOI

Backtranslation Augmented Direct Preference Optimization for Neural Machine Translation

Mehrdad Ghassabi, Spehr Rajabi, Hamidreza Baradaran Kashani, Sadra Hakim, Mahshid Keivandarian · 2026

Contemporary neural machine translation (NMT) systems are almost exclusively built by training on supervised parallel data. Despite the tremendous progress achieved, these systems still exhibit persis…

Read Paper →

Computer Science Preprint PDF DOI

Visual Boosting Techniques for Spatiotemporal Dense Pixel Visualizations

Julius Rauscher, Frederik L. Dennig, Udo Schlegel, Daniel A. Keim, Tobias Schreck · 2026

The analysis of spatiotemporal data is essential in domains such as epidemiology and environmental monitoring, where understanding the interplay between spatially distributed phenomena and their tempo…

Read Paper →

AI & Data Science Preprint PDF DOI

BIMStruct3D: A Fully Automated Hybrid Learning Scan-to-BIM Pipeline with Integrated Topology Refinement

Mahdi Chamseddine, Fabian Kaufmann, Marius Schellen, Christian Glock, Didier Stricker, Jason Rambach · 2026

Automatic generation of Building Information Models (BIM) from building scans is a key challenge in architecture and construction. We present a modular pipeline for generating IFC-compliant BIM from 3…

Read Paper →

AI & Data Science Preprint PDF DOI

Time-Series Forecasting in Safety-Critical Environments: An EU-AI-Act-Compliant Open-Source Package / Zeitreihenprognose in sicherheitskritischen Umgebungen: Ein KI-VO-konformes Open-Source-Paket

Thomas Bartz-Beielstein, Eva Bartz · 2026

With spotforecast2-safe we present an integrated Compliance-by-Design approach to Python-based point forecasting of time series in safety-critical environments. A review of the relevant open-source to…

Read Paper →

AI & Data Science Preprint PDF DOI

Resource-Lean Lexicon Induction for German Dialects

Robert Litschko, Barbara Plank, Diego Frassinelli · 2026

Automatic induction of high-quality dictionaries is essential for building lexical resources, yet low-resource languages and dialects pose several challenges: limited access to annotators, high degree…

Read Paper →

AI & Data Science Preprint PDF DOI

Learning to Decipher from Pixels -- A Case Study of Copiale

Lei Kang, Giuseppe De Gregorio, Raphaela Heil, Alicia Fornes, Beata Megyesi · 2026

Historical encrypted manuscripts require both paleographic interpretation of cipher symbols and cryptanalytic recovery of plaintext. Most existing computational workflows rely on a transcription-first…

Read Paper →

AI & Data Science Preprint PDF DOI

Neural Grammatical Error Correction for Romanian

Teodor-Mihai Cotet, Stefan Ruseti, Mihai Dascalu · 2026

Resources for Grammatical Error Correction (GEC) in non-English languages are scarce, while available spellcheckers in these languages are mostly limited to simple corrections and rules. In this paper…

Read Paper →

AI & Data Science Preprint PDF DOI

Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech

Siqi Ouyang, Shuoyang Ding, Oleksii Hrinchuk, Vitaly Lavrukhin, Brian Yan, Boris Ginsburg, Lei Li · 2026

Simultaneous speech translation (SST) generates translations while receiving partial speech input. Recent advances show that large language models (LLMs) can substantially improve SST quality, but at …

Read Paper →

Engineering Preprint PDF DOI

Designing Active Operation in Low-Voltage Distribution Grids: Requirements, Interfaces and Roadmap

Eric Tonges, Andrea Schoen, Frank Marten, Marco Pau, Denis Mende · 2026

This paper outlines a pathway towards active operation of lowvoltage distribution grids. In these grids, the growing deployment of distributed generation, controllable demand and storage, together wit…

Read Paper →

AI & Data Science Preprint PDF DOI

Time-dependent structural equation modeling of fans' football fever using activity tracking data during the 2025 DFB Cup final

Jonas Bauer, Christiane Fuchs, Tamara Schamberger · 2026

Football fans frequently exhibit pronounced emotional and physiological reactions during high-stakes matches. However, the temporal dynamics of this football fever are rarely modeled as a latent proce…

Read Paper →

AI & Data Science Preprint PDF DOI

The "Small World of Words" German Free-Association Norms

Samuel Aeschbach, Rui Mata, Kaidi Loo, Simon De Deyne, Dirk U. Wulff · 2026

Free-association norms provide essential empirical data for investigating linguistic, semantic, and cultural phenomena in the cognitive sciences. Although large-scale norms exist for languages such as…

Read Paper →

Browse Research Papers

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Optimal Consumption and Investment with Energy-Efficiency Adoption

Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation

Data-Driven Stochastic Optimal Control for Intraday Electricity Trading by Renewable Producers

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

Selective Augmentation: Improving Universal Automatic Phonetic Transcription via G2P Bootstrapping

Cross-Lingual Response Consistency in Large Language Models: An ILR-Informed Evaluation of Claude Across Six Languages

The Riemann integral on Dedekind complete $f$-algebras

Zero-Shot to Full-Resource: Cross-lingual Transfer Strategies for Aspect-Based Sentiment Analysis

Backtranslation Augmented Direct Preference Optimization for Neural Machine Translation

Visual Boosting Techniques for Spatiotemporal Dense Pixel Visualizations

BIMStruct3D: A Fully Automated Hybrid Learning Scan-to-BIM Pipeline with Integrated Topology Refinement

Time-Series Forecasting in Safety-Critical Environments: An EU-AI-Act-Compliant Open-Source Package / Zeitreihenprognose in sicherheitskritischen Umgebungen: Ein KI-VO-konformes Open-Source-Paket

Resource-Lean Lexicon Induction for German Dialects

Learning to Decipher from Pixels -- A Case Study of Copiale

Neural Grammatical Error Correction for Romanian

Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech

Designing Active Operation in Low-Voltage Distribution Grids: Requirements, Interfaces and Roadmap

Time-dependent structural equation modeling of fans' football fever using activity tracking data during the 2025 DFB Cup final

The "Small World of Words" German Free-Association Norms

Browse by Category

Research Type

Publish Your Research