Steven Creech — Research Repository

Physics Preprint PDF DOI

Chemical Taxonomy of $\omega$~Centauri: Ten Populations Reveal a Multi-Phase Enrichment History

Furkan Akbaba, Olcay Plevne, Timur Sahin, Sena Aleyna Senturk · 2026

$\omega$~Centauri, the most massive globular cluster in the Milky Way, exhibits a level of stellar population complexity that has long resisted a unified chemical characterisation. We exploit high-res…

Read Paper →

AI & Data Science Preprint PDF DOI

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

Bo Zhang, Tzu-Yen Ma, Zichen Tang, Junpeng Ding, Zirui Wang, Yizhuo Zhao, Peilin Gao, Zijie Xi, Zixin Ding, Haiyang Sun, Haocheng Gao, Yuan Liu, Liangjia Wang, Yiling Huang, Yujie Wang, Yuyue Zhang, Ronghui Xi, Yuanze Li, Jiacheng Liu, Zhongjun Yang, Haihong E · 2026

We introduce AEGIS, A holistic benchmark for Evaluating forensic analysis of AI-Generated academic ImageS. Compared to existing benchmarks, AEGIS features three key advances: (1) Domain-Specific Compl…

Read Paper →

Computer Science Preprint PDF DOI

Unsafe and Unused? A History of Utility Code in Mature Open Source Projects

Brandon Keller, Kaitlin Yandik, Angela Ngo, Andy Meneely · 2026

Filenames are a concise means of conveying information about source code to fellow developers. One such convention is util. Commonly understood to stand for "utility", filenames with the letters util …

Read Paper →

Computer Science Preprint PDF DOI

DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures

Sigma Jahan, Saurabh Singh Rajput, Tushar Sharma, Mohammad Masudur Rahman · 2026

Transformer models are widely deployed in critical AI applications, yet faults in their attention mechanisms, projections, and other internal components often degrade behavior silently without raising…

Read Paper →

AI & Data Science Preprint PDF DOI

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Ansar Aynetdinov, Patrick Haller, Alan Akbik · 2026

Recent research has shown that filtering massive English web corpora into high-quality subsets significantly improves training efficiency. However, for high-resource non-English languages like German,…

Read Paper →

AI & Data Science Preprint PDF DOI

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

Garvin Kruthof · 2026

When researchers iteratively refine ideas with large language models, do the models preserve fidelity to the original objective? We introduce DriftBench, a benchmark for evaluating constraint adherenc…

Read Paper →

Physics Preprint PDF DOI

Mixture-aware closure of the N-phase Navier--Stokes--Cahn--Hilliard mixture model

M.F.P. ten Eikelder, A. Brunk · 2026

Diffuse-interface (phase-field) models are widely used to describe multiphase mixtures and their interfacial dynamics. In multiphase settings, however, the constitutive closure should remain meaningfu…

Read Paper →

AI & Data Science Preprint PDF DOI

ITS-Mina: A Harris Hawks Optimization-Based All-MLP Framework with Iterative Refinement and External Attention for Multivariate Time Series Forecasting

Pourya Zamanvaziri, Amirhossein Sadr, Aida Pakniyat, Dara Rahmati · 2026

Multivariate time series forecasting plays a pivotal role in numerous real-world applications, including financial analysis, energy management, and traffic planning. While Transformer-based architectu…

Read Paper →

Physics Preprint PDF DOI

International Optical Clock Comparison Using the European Optical Fiber Network

Marco Pizzocaro, Clara Zyskind, Anne Amy-Klein, Erik Benkler, Sebastien Bize, Davide Calonico, Etienne Cantin, Christian Chardonnet, Cecilia Clivati, Stefano Condio, E. Anne Curtis, Simone Donadello, Soren Dorscher, Chen-Hao Feng, Melina Filzinger, Jacques-Olivier Gaudron, Rachel M. Godun, Irene Goti, Ian R. Hill, Wei Huang, Nils Huntemann, Matthew Johnson, Joshua Klose, Jochen Kronjager, Alexander Kuhl, Rodolphe Le Targat, Filippo Levi, Burghard Lipphardt, Christian Lisdat, Jerome Lodewyck, Olivier Lopez, Helen S. Margolis, Maxime Mazouth-Laurol, Alberto Mura, Benjamin Pointard, Paul-Eric Pottie, Matias Risaro, Billy I. Robertson, Marco Schioppo, Kilian Stahl, Martin Steinel, Alexandra Tofful, Mads T{o}nnes, Jacob Tunes · 2026

Optical clocks have achieved remarkable estimated fractional frequency uncertainties reaching the $10^{-18}$ level and below, enabling applications in fundamental physics, general relativity, and geod…

Read Paper →

AI & Data Science Preprint PDF DOI

Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation

Dawid Wisniewski, Igor Czudy · 2026

Preserving affective nuance remains a challenge in Machine Translation (MT), where semantic equivalence often takes precedence over emotional fidelity. This paper evaluates the performance of three st…

Read Paper →

AI & Data Science Preprint PDF DOI

Simulating clinical interventions with a generative multimodal model of human physiology

Guy Lutsker, Gal Sapir, Jordi Merino, Smadar Shilo, Anastasia Godneva, Eli Meirom, Shie Mannor, Hagai Rossman, Gal Chechik, Eran Segal · 2026

Understanding how human health changes over time, and why responses to interventions vary between individuals, remains a central challenge in medicine. Here we present HealthFormer, a decoder-only tra…

Read Paper →

AI & Data Science Preprint PDF DOI

Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection

Ali Shibli, Andrea Nascetti, Yifang Ban · 2026

Semantic segmentation and change detection are two fundamental challenges in remote sensing, requiring models to capture either spatial semantics or temporal differences from satellite imagery. Existi…

Read Paper →

Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung · 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annota…

Read Paper →

Biology & Life Sciences Preprint PDF DOI

The Lifetime Cardiac-Cycle Invariant in Endothermic Vertebrates: A 230-Species Comparative Dataset, Statistical Validation, and Explicit Falsifiability Criteria

Mesfin Taye · 2026

A pygmy shrew (\textit{Suncus etruscus}, ${\approx}2$\,g) sustains a resting heart rate near $1{,}000$\,beats\,min$^{-1}$ and dies within two years; an African elephant (${\approx}4{,}000$\,kg) beats …

Read Paper →

Computer Science Preprint PDF DOI

MASCing: Configurable Mixture-of-Experts Behavior via Activation Steering Masks

Jona te Lintelo, Lichao Wu, Marina Krcek, Sengim Karayalcin, Stjepan Picek · 2026

Mixture-of-Experts (MoE) architectures in Large Language Models (LLMs) have significantly reduced inference costs through sparse activation. However, this sparse activation paradigm also introduces ne…

Read Paper →

Physics Preprint PDF DOI

High-Girth Regular Quantum LDPC Codes from Square-Base Hypergraph Products via CPM Lifts

Koki Okada, Kenta Kasai · 2026

We study square-base Calderbank--Shor--Steane (CSS) hypergraph-product codes as a finite-length class for regular high-girth quantum low-density parity-check (LDPC) design. For base matrices of small …

Read Paper →

AI & Data Science Preprint PDF DOI

GourNet: A CNN-Based Model for Mango Leaf Disease Detection

Ekram Alam, Jaydip Sanyal, Akhil Kumar Das, Arijit Bhattacharya, Farhana Sultana · 2026

Mango cultivation is crucial in the agricultural sector, significantly contributing to economic development and food security. However, diseases affecting mango leaves can significantly reduce both th…

Read Paper →

AI & Data Science Preprint PDF DOI

When Agents Evolve, Institutions Follow

Chao Fei, Hongcheng Guo, Yanghua Xiao · 2026

Across millennia, complex societies have faced the same coordination problem of how to organize collective action among cognitively bounded and informationally incomplete individuals. Different civili…

Read Paper →

AI & Data Science Preprint PDF DOI

JaiTTS: A Thai Voice Cloning Model

Jullajak Karnjanaekarin, Pontakorn Trakuekul, Narongkorn Panitsrisit, Sumana Sumanakul, Vichayuth Nitayasomboon, Nithid Guntasin, Thanavin Denkavin, Attapol T. Rutherford · 2026

We present JaiTTS-v1.0, a state-of-the-art Thai voice cloning text-to-speech model built through continual training on a large Thai-centric speech corpus. The model architecture is adapted from VoxCPM…

Read Paper →

AI & Data Science Preprint PDF DOI

AppTek Call-Center Dialogues: A Multi-Accent Long-Form Benchmark for English ASR

Eugen Beck, Sarah Beranek, Uma Moothiringote, Daniel Mann, Wilfried Michel, Katie Nguyen, Taylor Tragemann · 2026

Evaluating English ASR systems for conversational AI applications remains difficult, as many publicly available corpora are either pre-segmented into short segments, consist of read or prepared speech…

Read Paper →

Browse Research Papers

Chemical Taxonomy of $\omega$~Centauri: Ten Populations Reveal a Multi-Phase Enrichment History

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

Unsafe and Unused? A History of Utility Code in Mature Open Source Projects

DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

Mixture-aware closure of the N-phase Navier--Stokes--Cahn--Hilliard mixture model

ITS-Mina: A Harris Hawks Optimization-Based All-MLP Framework with Iterative Refinement and External Attention for Multivariate Time Series Forecasting

International Optical Clock Comparison Using the European Optical Fiber Network

Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation

Simulating clinical interventions with a generative multimodal model of human physiology

Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

The Lifetime Cardiac-Cycle Invariant in Endothermic Vertebrates: A 230-Species Comparative Dataset, Statistical Validation, and Explicit Falsifiability Criteria

MASCing: Configurable Mixture-of-Experts Behavior via Activation Steering Masks

High-Girth Regular Quantum LDPC Codes from Square-Base Hypergraph Products via CPM Lifts

GourNet: A CNN-Based Model for Mango Leaf Disease Detection

When Agents Evolve, Institutions Follow

JaiTTS: A Thai Voice Cloning Model

AppTek Call-Center Dialogues: A Multi-Accent Long-Form Benchmark for English ASR

Browse by Category

Research Type

Publish Your Research