Expertini Research Research

Browse Research Papers

366+ open-access research outputs.

✕ Clear
🔍 mahdi bashiri
Showing 366 results for "mahdi bashiri"
Biology & Life Sciences Preprint PDF DOI

Polynomial-time completion of phylogenetic tree sets

Aleksandr Koshkarov, Nadia Tahiri · 2026

Comparative analyses of phylogenetic trees typically require identical taxon sets, however, in practice, trees often include distinct but overlapping taxa. Pruning non-shared leaves discards phylogene…

Read Paper →
AI & Data Science Preprint PDF DOI

ks-pret-5m: a 5 million word, 12 million token kashmiri pretraining dataset

Haq Nawaz Malik, Nahfid Nissar · 2026

We present KS-PRET-5M, the largest publicly available pretraining dataset for the Kashmiri language, comprising 5,090,244 (5.09M) words, 27,692,959 (27.6M) characters, and a vocabulary of 295,433 (295…

Read Paper →
AI & Data Science Preprint PDF DOI

Model Assisted Data Integration: An unbiased sampling strategy to use nonprobability data

Martin Hyllienmark, Gustaf Strandell · 2026

The aim of survey statistics is to produce estimates with a minimal bias and a corresponding acceptable variance given a specific budget, preferable with a minor response burden for the participants. …

Read Paper →
Mathematics Preprint PDF DOI

On Umbilical Real Hypersufaces of Products of Complex Space Forms

Iury Domingos, Ranilze da Silva, Alexandre de Sousa, Feliciano Vitorio · 2026

Tashiro and Tachibana proved that there exist no totally umbilical hypersurfaces in complex space forms with nonzero constant holomorphic sectional curvature, and it is also known that the shape opera…

Read Paper →
AI & Data Science Preprint PDF DOI

Towards Practical Multimodal Hospital Outbreak Detection

Chang Liu, Jieshi Chen, Alexander J. Sundermann, Kathleen Shutt, Marissa P. Griffith, Lora Lee Pless, Lee H. Harrison, Artur W. Dubrawski · 2026

Rapid identification of outbreaks in hospitals is essential for controlling pathogens with epidemic potential. Although whole genome sequencing (WGS) remains the gold standard in outbreak investigatio…

Read Paper →
AI & Data Science Preprint PDF DOI

Bolbosh: Script-Aware Flow Matching for Kashmiri Text-to-Speech

Tajamul Ashraf, Burhaan Rasheed Zargar, Saeed Abdul Muizz, Ifrah Mushtaq, Nazima Mehdi, Iqra Altaf Gillani, Aadil Amin Kak, Janibul Bashir · 2026

Kashmiri is spoken by around 7 million people but remains critically underserved in speech technology, despite its official status and rich linguistic heritage. The lack of robust Text-to-Speech (TTS)…

Read Paper →
Mathematics Preprint PDF DOI

Barycenter technique for the higher order $Q$-curvature equation

Saikat Mazumdar, Cheikh Birahim Ndiaye · 2026

Let $k\ge1$ be an integer, and $(M,g)$ be a smooth, closed Riemannian manifold of dimension $2k+1\le n\le 2k+3$, or $(M,g)$ be locally conformally flat of dimension $n\ge 2k+1$. Applying the Bahri-C…

Read Paper →
Biology & Life Sciences Preprint PDF DOI

Exploring the Utility of MALDI-TOF Mass Spectrometry and Antimicrobial Resistance in Hospital Outbreak Detection

Chang Liu, Jieshi Chen, Alexander J. Sundermann, Kathleen Shutt, Marissa P. Griffith, Lora Lee Pless, Lee H. Harrison, Artur W. Dubrawski · 2026

Accurate and timely identification of hospital outbreak clusters is crucial for preventing the spread of infections that have epidemic potential. While assessing pathogen similarity through whole geno…

Read Paper →
AI & Data Science Preprint PDF DOI

NAAMSE: Framework for Evolutionary Security Evaluation of Agents

Kunal Pai, Parth Shah, Harshil Patel · 2026

AI agents are increasingly deployed in production, yet their security evaluations remain bottlenecked by manual red-teaming or static benchmarks that fail to model adaptive, multi-turn adversaries. We…

Read Paper →
AI & Data Science Preprint PDF DOI

A Human-in-the-Loop, LLM-Centered Architecture for Knowledge-Graph Question Answering

Larissa Pusch, Alexandre Courtiol, Tim Conrad · 2026

Large Language Models (LLMs) excel at language understanding but remain limited in knowledge-intensive domains due to hallucinations, outdated information, and limited explainability. Text-based retri…

Read Paper →
AI & Data Science Preprint PDF DOI

No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data

Dmitry Karpov · 2026

We explore machine translation for five Turkic language pairs: Russian-Bashkir, Russian-Kazakh, Russian-Kyrgyz, English-Tatar, English-Chuvash. Fine-tuning nllb-200-distilled-600M with LoRA on synthet…

Read Paper →
AI & Data Science Preprint PDF DOI

From Consistency to Complementarity: Aligned and Disentangled Multi-modal Learning for Time Series Understanding and Reasoning

Hang Ni, Weijia Zhang, Fei Wang, Zezhi Shao, Hao Liu · 2026

Advances in multi-modal large language models (MLLMs) have inspired time series understanding and reasoning tasks, that enable natural language querying over time series, producing textual analyses of…

Read Paper →
Physics Preprint PDF DOI

Prompt cusps in hierarchical dark matter halos: Implications for annihilation boost

Shin'ichiro Ando, Martin Moro, Youyou Li · 2026

Recent simulations have identified long-lived ``prompt cusps'' -- compact remnants of early density peaks with inner profiles $\rho\propto r^{-3/2}$. They can survive hierarchical assembly and potent…

Read Paper →
Computer Science Preprint PDF DOI

Overcoming Barriers to Computational Reproducibility

Roman Hornung, Laszlo Nemeth, Oleksandr Zadorozhny, Theresa Ullmann, Michael Kammer, Rebecca Killick, Christopher J. Paciorek, Julien Chiquet, Moritz Herrmann, Lucija Batinovic, Rickard Carlsson, Pierre Neuvial, Boris Hejblum, Julia Wrobel, Anne-Laure Boulesteix, Karsten Tabelow · 2026

Computational reproducibility, the possibility for independent researchers to exactly reproduce published empirical results, is fundamental to science. Despite its importance, the proportion of resear…

Read Paper →
AI & Data Science Preprint PDF DOI

synthocr-gen: A synthetic ocr dataset generator for low-resource languages- breaking the data barrier

Haq Nawaz Malik, Kh Mohmad Shafi, Tanveer Ahmad Reshi · 2026

Optical Character Recognition (OCR) for low-resource languages remains a significant challenge due to the scarcity of large-scale annotated training datasets. Languages such as Kashmiri, with approxim…

Read Paper →
Sociology & Anthropology Preprint PDF DOI

The Physics of the Dancing \emph{Deity}: Coupled Oscillators in Himalayan Processions

Nalin Dhiman · 2026

In parts of Himachal Pradesh (Kullu and Mandi) and the Western Himalaya, village deities (\emph{devt\=a}) are carried through the landscape on shoulder-borne palanquins or ``raths.'' Participants ofte…

Read Paper →
AI & Data Science Preprint PDF DOI

ks-lit-3m: A 3.1 million word kashmiri text dataset for large language model pretraining

Haq Nawaz Malik · 2026

Large Language Models (LLMs) demonstrate remarkable fluency across high-resource languages yet consistently fail to generate coherent text in Kashmiri, a language spoken by approximately seven million…

Read Paper →
AI & Data Science Preprint PDF DOI

600k-ks-ocr: a large-scale synthetic dataset for optical character recognition in kashmiri script

Haq Nawaz Malik · 2026

This technical report presents the 600K-KS-OCR Dataset, a large-scale synthetic corpus comprising approximately 602,000 word-level segmented images designed for training and evaluating optical charact…

Read Paper →
AI & Data Science Preprint PDF DOI

Spatial Analysis for AI-segmented Histopathology Images: Methods and Implementation

Yoolkyu Park, Fangjiang Wu, Xin Feng, Shengjie Yang, Elizabeth H. Wang, Bo Yao, Chul Moon, Guanghua Xiao, Qiwei Li · 2025

Quantitative characterization of cellular spatial organization is critical for understanding tumor progression and immune response. Recent advances in artificial intelligence (AI) enable large-scale s…

Read Paper →
AI & Data Science Preprint PDF DOI

Unmasking Airborne Threats: Guided-Transformers for Portable Aerosol Mass Spectrometry

Kyle M. Regan, Michael McLoughlin, Wayne A. Bryden, Gonzalo R. Arce · 2025

Matrix Assisted Laser Desorption/Ionization Mass Spectrometry (MALDI-MS) is a cornerstone in biomolecular analysis, offering precise identification of pathogens through unique mass spectral signatures…

Read Paper →
Page 1 of 19 Next →