Alan Inglis — Preprint — Research Repository

AI & Data Science Preprint PDF DOI

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Ansar Aynetdinov, Patrick Haller, Alan Akbik · 2026

Recent research has shown that filtering massive English web corpora into high-quality subsets significantly improves training efficiency. However, for high-resource non-English languages like German,…

Read Paper →

Mathematics Preprint PDF DOI

Circle Pattern Theorem for Quasi-simplicial Triangulated Surfaces

Aijin Lin, Qingyi Liu · 2026

The Circle Pattern Theorem characterizes the existence and rigidity of circle patterns with prescribed intersection angles on simplicial triangulations of closed surfaces. In this paper we extend the …

Read Paper →

AI & Data Science Preprint PDF DOI

KellyBench: A Benchmark for Long-Horizon Sequential Decision Making

Thomas Grady, Kip Parker, Iliyan Zarov, Henry Course, Chengxi Taylor, Ross Taylor · 2026

Language models are saturating benchmarks for procedural tasks with narrow objectives. But they are increasingly being deployed in long-horizon, non-stationary environments with open-ended goals. In t…

Read Paper →

AI & Data Science Preprint PDF DOI

JaiTTS: A Thai Voice Cloning Model

Jullajak Karnjanaekarin, Pontakorn Trakuekul, Narongkorn Panitsrisit, Sumana Sumanakul, Vichayuth Nitayasomboon, Nithid Guntasin, Thanavin Denkavin, Attapol T. Rutherford · 2026

We present JaiTTS-v1.0, a state-of-the-art Thai voice cloning text-to-speech model built through continual training on a large Thai-centric speech corpus. The model architecture is adapted from VoxCPM…

Read Paper →

Physics Preprint PDF DOI

Probing Near-Threshold $s$-Wave Components in Heavy Nuclei via Coulomb-Assisted Neutron Transfer

Yuki Nakanishi, Junki Tanaka, Atsushi Tamii, Shimpei Endo · 2026

We propose a method to probe weakly bound s-wave neutron components near the neutron emission threshold in heavy nuclei using Coulomb-assisted neutron transfer reactions. Weakly bound s-wave neutrons …

Read Paper →

AI & Data Science Preprint PDF DOI

APPSI-139: A Parallel Corpus of English Application Privacy Policy Summarization and Interpretation

Pengyun Zhu, Qiheng Sun, Long Wen, Yanbo Wang, Yang Cao, Junxu Liu, Deyi Xiong, Jinfei Liu, Zhibo Wang, Kui Ren · 2026

Privacy policies are essential for users to understand how service providers handle their personal data. However, these documents are often long and complex, as well as filled with technobabble and le…

Read Paper →

AI & Data Science Preprint PDF DOI

AppTek Call-Center Dialogues: A Multi-Accent Long-Form Benchmark for English ASR

Eugen Beck, Sarah Beranek, Uma Moothiringote, Daniel Mann, Wilfried Michel, Katie Nguyen, Taylor Tragemann · 2026

Evaluating English ASR systems for conversational AI applications remains difficult, as many publicly available corpora are either pre-segmented into short segments, consist of read or prepared speech…

Read Paper →

AI & Data Science Preprint PDF DOI

Entropy of Ukrainian

Anton Lavreniuk, Mykyta Mudryi, Markiian Chaklosh · 2026

In natural language processing, the entropy of a language is a measure of its unpredictability and complexity. The first study on this subject was conducted by Claude Shannon in 1951. By having partic…

Read Paper →

Computer Science Preprint PDF DOI

Examining discontinuance of AI-mediated informal digital learning of English (AI-IDLE) among university students: Evidence from SEM and fsQCA

Yiran Du, Huimin He · 2026

This study examined university students' discontinuance intention towards AI-mediated informal digital learning of English (AI-IDLE). Drawing on the cognition-affect-conation framework, the study inve…

Read Paper →

Computer Science Preprint PDF DOI

Why Learners Drift In and Out: Examining Intermittent Discontinuance in AI-Mediated Informal Digital English Learning (AI-IDLE) Using SEM and fsQCA

Yiran Du, Huimin He · 2026

This study examined intermittent discontinuance in AI-mediated informal digital learning of English (AI-IDLE) through the cognition-affect-conation framework. Survey data were collected from 632 Chine…

Read Paper →

AI & Data Science Preprint PDF DOI

Perturbation Probing: A Two-Pass-per-Prompt Diagnostic for FFN Behavioral Circuits in Aligned LLMs

Hongliang Liu, Tung-Ling Li, Yuhao Wu · 2026

Perturbation probing generates task-specific causal hypotheses for FFN neurons in large language models using two forward passes per prompt and no backpropagation, followed by a one-time intervention …

Read Paper →

Physics Preprint PDF DOI

Asymmetric freezing of a sliding droplet on an inclined surface

Sivanandan Kavuri, George Karapetsas, Chander Shekhar Sharma, Kirti Chandra Sahu · 2026

We investigate the asymmetric freezing of a liquid droplet sliding on an inclined cold surface using numerical simulations based on the lubrication approximation. The combined effects of gravity, capi…

Read Paper →

Computer Science Preprint PDF DOI

Cross-lingual Comparison of Research Funding Projects with Multilingual Sentence-BERT: Evidence from KAKENHI, NIH, NSF, and UKRI

Miki Kimura-Ida · 2026

Cross-national comparison of research funding projects is increasingly important for science policy and strategic planning, but language differences remain a major obstacle. In particular, KAKENHI pro…

Read Paper →

AI & Data Science Preprint PDF DOI

Targeted Linguistic Analysis of Sign Language Models with Minimal Translation Pairs

Serpil Karabuklu, Kanishka Misra, Shester Gueuwou, Diane Brentari, Greg Shakhnarovich, Karen Livescu · 2026

Models of sign language have historically lagged behind those for spoken language (text and speech). Recent work has greatly improved their performance on tasks like sign language translation and isol…

Read Paper →

AI & Data Science Preprint PDF DOI

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

Juergen Dietrich · 2026

Democratic discourse analysis systems increasingly rely on multi-agent LLM pipelines in which distinct evaluator models are assigned adversarial roles to generate structured, multi-perspective assessm…

Read Paper →

Physics Preprint PDF DOI

Dispersive Properties of Plasma Diffraction Gratings: Towards Plasma-Based Laser Pulse Compression

Victor M. Perez-Ramirez, Michelle M. Wang, Ke Ou, Sida Cao, Devdigvijay Singh, Nicholas M. Fasano, Vedin Dewan, Andreas M. Giakas, Arunava Das, Isabelle Tigges-Green, Pierre Michel, Julia M. Mikhailova, Matthew R. Edwards · 2026

The standard architecture for a high-peak-power femtosecond laser is chirped pulse amplification using diffraction gratings for compression; the damage threshold of the compression gratings limits cur…

Read Paper →

AI & Data Science Preprint PDF DOI

Cross-Lingual Response Consistency in Large Language Models: An ILR-Informed Evaluation of Claude Across Six Languages

Camelia Baluta · 2026

This paper introduces a systematic evaluation framework grounded in the Interagency Language Roundtable (ILR) Skill Level Descriptions and applies it to Claude (Sonnet 4.6) across six languages: Engli…

Read Paper →

Physics Preprint PDF DOI

Continuum contribution to charged-current absorption of low-energy $\nu_e$ on $^{40}$Ar

Steven Gardiner, Pablo Barham Alzas, Alexis Nikolakopoulos, Luca H. Abu El-Haj, Natalie Jachowicz, Vishvas Pandey · 2026

Accurate modeling of the absorption of tens-of-MeV $\nu_e$ on $^{40}$Ar is needed to enable measurements of astrophysical neutrinos using large liquid argon time projection chamber (LArTPC) detectors,…

Read Paper →

Physics Preprint PDF DOI

Tunable high-Chern-number Chern insulators in rhombohedral tetralayer graphene/hBN moir\'e superlattices

Chuanqi Zheng, Chushan Li, Ke Huang, Chenyu Zhang, Kenji Watanabe, Takashi Taniguchi, Hao Yang, Dandan Guan, Liang Liu, Shiyong Wang, Yaoyi Li, Hao Zheng, Canhua Liu, Jinfeng Jia, Xueyang Song, Zhiwen Shi, Guorui Chen, Xiao Li, Tingxin Li, Xiaoxue Liu · 2026

Moir\'e superlattices based on rhombohedral multilayer graphene have emerged as a highly tunable platform for engineering correlated topological phases. Here, we systematically investigate the transpo…

Read Paper →

AI & Data Science Preprint PDF DOI

Zero-Shot to Full-Resource: Cross-lingual Transfer Strategies for Aspect-Based Sentiment Analysis

Jakob Fehle, Nils Constantin Hellwig, Udo Kruschwitz, Christian Wolff · 2026

Aspect-based Sentiment Analysis (ABSA) extracts fine-grained opinions toward specific aspects within text but remains largely English-focused despite major advances in transformer-based and instructio…

Read Paper →

Browse Research Papers

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Circle Pattern Theorem for Quasi-simplicial Triangulated Surfaces

KellyBench: A Benchmark for Long-Horizon Sequential Decision Making

JaiTTS: A Thai Voice Cloning Model

Probing Near-Threshold $s$-Wave Components in Heavy Nuclei via Coulomb-Assisted Neutron Transfer

APPSI-139: A Parallel Corpus of English Application Privacy Policy Summarization and Interpretation

AppTek Call-Center Dialogues: A Multi-Accent Long-Form Benchmark for English ASR

Entropy of Ukrainian

Examining discontinuance of AI-mediated informal digital learning of English (AI-IDLE) among university students: Evidence from SEM and fsQCA

Why Learners Drift In and Out: Examining Intermittent Discontinuance in AI-Mediated Informal Digital English Learning (AI-IDLE) Using SEM and fsQCA

Perturbation Probing: A Two-Pass-per-Prompt Diagnostic for FFN Behavioral Circuits in Aligned LLMs

Asymmetric freezing of a sliding droplet on an inclined surface

Cross-lingual Comparison of Research Funding Projects with Multilingual Sentence-BERT: Evidence from KAKENHI, NIH, NSF, and UKRI

Targeted Linguistic Analysis of Sign Language Models with Minimal Translation Pairs

When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis

Dispersive Properties of Plasma Diffraction Gratings: Towards Plasma-Based Laser Pulse Compression

Cross-Lingual Response Consistency in Large Language Models: An ILR-Informed Evaluation of Claude Across Six Languages

Continuum contribution to charged-current absorption of low-energy $\nu_e$ on $^{40}$Ar

Tunable high-Chern-number Chern insulators in rhombohedral tetralayer graphene/hBN moir\'e superlattices

Zero-Shot to Full-Resource: Cross-lingual Transfer Strategies for Aspect-Based Sentiment Analysis

Browse by Category

Research Type

Publish Your Research