791+ open-access research outputs.
The quadratic computational complexity of the standard attention mechanism constitutes a fundamental bottleneck for large language models in long-context inference. While existing KV cache compression…
We present an end-to-end performance evaluation of MPEG-DASH video streaming over a Low-Earth Orbit (LEO) satellite-based 5G Integrated Access and Backhaul (IAB) network. Our objective is to investiga…
Recent studies have shown that distributed storage systems can achieve significant space savings by adapting redundancy levels to varying disk failure rates. This adaptation is performed via code conv…
Using the first data release of the Five-hundred-meter Aperture Spherical radio Telescope (FAST) All-Sky HI survey (FASHI), we compile a catalogue of 70 dark galaxy candidates (DGCs) within 50 Mpc. We…
We extracted a list of 662 nearby (within $\sim16$ Mpc) HI-detection sources from the Five-hundred-meter Aperture Spherical radio Telescope (FAST) All Sky HI Survey (FASHI) and made a visual identific…
Word error rate (WER) is the dominant metric for automatic speech recognition, yet it cannot detect a systematic failure mode: models that produce fluent output in the wrong writing system. We define …
Pashto is absent from Whisper's pre-training corpus despite being one of CommonVoice's largest language collections, leaving off-the-shelf models unusable: all Whisper sizes output Arabic, Dari, or Ur…
Physics-based character animation has become a fundamental approach for synthesizing realistic, physically plausible motions. While current data-driven deep reinforcement learning (DRL) methods can sy…
Pashto is spoken by approximately 60--80 million people but has no published benchmarks for multilingual automatic speech recognition (ASR) on any shared public test set. This paper reports the first …
We aim to investigate the feasibility of accurately determining the helium-to-metal enrichment ratio $\Delta Y/\Delta Z$ for open clusters using Gaia DR3 photometry. To test the reliability of this ca…
Given a root system $R$, two roots are said to be \emph{strongly orthogonal} if neither their sum nor difference is a root. Gashi defined a family of graphs with vertices labelled by sums of $k$-eleme…
We used observations obtained with the Ultraviolet Imaging Telescope on board the AstroSat satellite to measure the integrated far-ultraviolet (FUV) and optical (V) magnitudes of 30 Galactic globular …
We present the Pashto Common Voice corpus -- the first large-scale, openly licensed speech resource for Pashto, a language with over 60 million native speakers largely absent from open speech technolo…
Large language models produce em dashes at varying rates, and the observation that some models "overuse" them has become one of the most widely discussed markers of AI-generated text. Yet no mechanist…
We prove that the theory of the Farey graph is pseudofinite by constructing a sequence of finite structures that satisfy increasingly large subsets of its first-order axiomatization. This graph is an …
Atomic hydrogen (HI) plays a fundamental role in fueling star formation in galaxies. However, the behavior of HI gas in interacting systems, particularly galaxy pairs, remains elusive. In this work, w…
Let $\Gamma < G := \operatorname{SO}(d+1, 1)$ for $d \geq 1$ be a Zariski dense, geometrically finite, discrete subgroup with critical exponent strictly greater than $d/2$. We show that $L^2(\Gamma\ba…
We present PashtoCorp, a 1.25-billion-word corpus for Pashto, a language spoken by 60 million people that remains severely underrepresented in NLP. The corpus is assembled from 39 sources spanning sev…
Omnimodal large language models (OmniLLMs) jointly process audio and visual streams, but the resulting long multimodal token sequences make inference prohibitively expensive. Existing compression meth…
Sentence simplification aims to make complex text more accessible by reducing linguistic complexity while preserving the original meaning. However, progress in this area remains limited for mid-resour…
Free open-access publishing with Google Scholar indexing.
Submission Guide →