David Bengali in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Analysis of AWW (Anganwadi Workers) Training Content, ILA (Incremental Learning Approach) Modules Following CDT (Component Display Theory)

Arka Majhi, Satish B. Agnihotri · 2026

POSHAN Abhiyan envisages capacity building of AWWs or frontline health workers through 21 training modules of ILA (Incremental Learning Approach), modularising the net learning content into smaller le…

Read Paper →

Computer Science Preprint PDF DOI

AVID: A Benchmark for Omni-Modal Audio-Visual Inconsistency Understanding via Agent-Driven Construction

Zixuan Chen, Depeng Wang, Hao Lin, Li Luo, Ke Xu, Ya Guo, Huijia Zhu, Tanfeng Sun, Xinghao Jiang · 2026

We present AVID, the first large-scale benchmark for audio-visual inconsistency understanding in videos. While omni-modal large language models excel at temporally aligned tasks such as captioning and…

Read Paper →

Computer Science Preprint PDF DOI

A Non-Probabilistic Game-Theoretic Information Theory Which Subsumes Probabilistic Channel Coding

Cheuk Ting Li · 2026

Probabilistic settings (e.g., vanishing-error channel coding) and non-probabilistic settings (e.g., zero-error channel coding and adversarial channels) were considered two related but different branch…

Read Paper →

Computer Science Preprint PDF DOI

Script Collapse in Multilingual ASR: Defining and Measuring Script Fidelity Rate

Hanif Rahman · 2026

Word error rate (WER) is the dominant metric for automatic speech recognition, yet it cannot detect a systematic failure mode: models that produce fluent output in the wrong writing system. We define …

Read Paper →

Computer Science Preprint PDF DOI

The Theorems of Dr. David Blackwell and Their Contributions to Artificial Intelligence

Napoleon Paxton · 2026

Dr. David Blackwell was a mathematician and statistician of the first rank, whose contributions to statistical theory, game theory, and decision theory predated many of the algorithmic breakthroughs t…

Read Paper →

Computer Science Preprint PDF DOI

WhisperAlign: Word-Boundary-Aware ASR and WhisperX-Anchored Pyannote Diarization for Long-Form Bengali Speech

Aurchi Chowdhury, Rubaiyat -E-Zaman, Sk. Ashrafuzzaman Nafees · 2026

This paper presents our solution for the DL Sprint 4.0, addressing the dual challenges of Bengali Long-Form Speech Recognition (Task 1) and Speaker Diarization (Task 2). Processing long-form, multi-sp…

Read Paper →

Computer Science Preprint PDF DOI

When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper

Akif Islam, Raufun Nahar, Md. Ekramul Hamid · 2026

Recent advances in automatic speech recognition (ASR) and speech enhancement have led to a widespread assumption that improving perceptual audio quality should directly benefit recognition accuracy. I…

Read Paper →

Computer Science Preprint PDF DOI

LLY Ricci Reweighting in Stochastic Block Models: Uniform Curvature Concentration and Finite-Horizon Tracking

Varun Kotharkar · 2026

We study curvature-driven edge reweighting for community recovery in the balanced two-block stochastic block model. Given a graph G with initial weights equal to the adjacency matrix, we iteratively u…

Read Paper →

Computer Science Preprint PDF DOI

An Investigation Into Various Approaches For Bengali Long-Form Speech Transcription and Bengali Speaker Diarization

Epshita Jahan, Khandoker Md Tanjinul Islam, Pritom Biswas, Tafsir Al Nafin · 2026

Bengali remains a low-resource language in speech technology, especially for complex tasks like long-form transcription and speaker diarization. This paper presents a multistage approach developed for…

Read Paper →

Computer Science Preprint PDF DOI

Teen Vigilance: Navigating Risky Social Interactions on Discord

Elena Koung, Yunhan Liu, Zinan Zhang, Xinning Gui, Yubo Kou · 2026

Teenagers are avid users of Discord, a fast growing platform for synchronous communication where they often interact with strangers. Because Discord combines private DMs, semi-private voice channels, …

Read Paper →

Computer Science Preprint PDF DOI

Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment

Sanjid Hasan, Risalat Labib, A H M Fuad, Bayazid Hasan · 2026

Although Automatic Speech Recognition (ASR) in Bengali has seen significant progress, processing long-duration audio and performing robust speaker diarization remain critical research gaps. To address…

Read Paper →

Computer Science Preprint PDF DOI

823-OLT @ BUET DL Sprint 4.0: Context-Aware Windowing for ASR and Fine-Tuned Speaker Diarization in Bengali Long Form Audio

Ratnajit Dhar, Arpita Mallik · 2026

Bengali, despite being one of the most widely spoken languages globally, remains underrepresented in long form speech technology, particularly in systems addressing transcription and speaker attributi…

Read Paper →

Computer Science Preprint PDF DOI

Bengali-Loop: Community Benchmarks for Long-Form Bangla ASR and Speaker Diarization

H.M. Shadman Tabib, Istiak Ahmmed Rifti, Abdullah Muhammed Amimul Ehsan, Somik Dasgupta, Md Zim Mim Siddiqee Sowdha, Abrar Jahin Sarker, Md. Rafiul Islam Nijamy, Tanvir Hossain, Mst. Metaly Khatun, Munzer Mahmood, Rakesh Debnath, Gourab Biswas, Asif Karim, Wahid Al Azad Navid, Masnoon Muztahid, Fuad Ahmed Udoy, Shahad Shahriar Rahman, Md. Tashdiqur Rahman Shifat, Most. Sonia Khatun, Mushfiqur Rahman, Md. Miraj Hasan, Anik Saha, Mohammad Ninad Mahmud Nobo, Soumik Bhattacharjee, Tusher Bhomik, Ahmmad Nur Swapnil, Shahriar Kabir · 2026

Bengali (Bangla) remains under-resourced in long-form speech technology despite its wide use. We present Bengali-Loop, two community benchmarks to address this gap: (1) a long-form ASR corpus of 191 r…

Read Paper →

Computer Science Preprint PDF DOI

Remarks on Algebraic Reconstruction of Types and Effects

Patrycja Balik, Szymon Jedras, Piotr Polesiuk · 2026

In their 1991 paper "Algebraic Reconstruction of Types and Effects," Pierre Jouvelot and David Gifford presented a type-and-effect reconstruction algorithm based on an algebraic structure of effects. …

Read Paper →

Computer Science Preprint PDF DOI

Re-educating Educated Ones: A Case Study on Chakma Language Revitalization in Chittagong Hill Tracts

Avijoy Chakma, Adity Khisa, Soham Khisa, Jannatun Noor, Sharifa Sultana · 2026

Indigenous languages face significant cultural oppression from official state languages, particularly in the Global South. We investigate the Bangladeshi Chakma language revitalization movement, a com…

Read Paper →

Computer Science Preprint PDF DOI

Zero-Shot to Zero-Lies: Detecting Bengali Deepfake Audio through Transfer Learning

Most. Sharmin Sultana Samu, Md. Rakibul Islam, Md. Zahid Hossain, Md. Kamrozzaman Bhuiyan, Farhad Uz Zaman · 2025

The rapid growth of speech synthesis and voice conversion systems has made deepfake audio a major security concern. Bengali deepfake detection remains largely unexplored. In this work, we study automa…

Read Paper →

Computer Science Preprint PDF DOI

Polynomial-Time Algorithms for Computing the Nucleolus: An Assessment

Holger I. Meinhardt · 2025

Recently, Maggiorano et al. (2025) claimed that they have developed a strongly polynomial-time combinatorial algorithm for the nucleolus in convex games that is based on the reduced game approach and …

Read Paper →

Computer Science Preprint PDF DOI

The Future of Food: How Artificial Intelligence is Transforming Food Manufacturing

Xu Zhou, Ivor Prado, AIFPDS participants, Ilias Tagkopoulos · 2025

Artificial intelligence is accelerating a new era of food innovation, connecting data from farm to consumer to improve formulation, processing, and health outcomes. Recent advances in deep learning, n…

Read Paper →

Computer Science Preprint PDF DOI

Enhancing LLM Code Generation Capabilities through Test-Driven Development and Code Interpreter

Sajed Jalil, Shuvo Saha, Hossain Mohammad Seym · 2025

Over the past few years, improving LLM code generation capabilities has been a key focus in NLP research. Despite Bengali having 242 million native speakers worldwide, it receives little attention whe…

Read Paper →

Computer Science Preprint PDF DOI

Assessing the Reliability of Large Language Models in the Bengali Legal Context: A Comparative Evaluation Using LLM-as-Judge and Legal Experts

Sabik Aftahee, A.F.M. Farhad, Arpita Mallik, Ratnajit Dhar, Jawadul Karim, Nahiyan Bin Noor, Ishmam Ahmed Solaiman · 2025

Accessing legal help in Bangladesh is hard. People face high fees, complex legal language, a shortage of lawyers, and millions of unresolved court cases. Generative AI models like OpenAI GPT-4.1 Mini,…

Read Paper →

Browse Research Papers

Analysis of AWW (Anganwadi Workers) Training Content, ILA (Incremental Learning Approach) Modules Following CDT (Component Display Theory)

AVID: A Benchmark for Omni-Modal Audio-Visual Inconsistency Understanding via Agent-Driven Construction

A Non-Probabilistic Game-Theoretic Information Theory Which Subsumes Probabilistic Channel Coding

Script Collapse in Multilingual ASR: Defining and Measuring Script Fidelity Rate

The Theorems of Dr. David Blackwell and Their Contributions to Artificial Intelligence

WhisperAlign: Word-Boundary-Aware ASR and WhisperX-Anchored Pyannote Diarization for Long-Form Bengali Speech

When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper

LLY Ricci Reweighting in Stochastic Block Models: Uniform Curvature Concentration and Finite-Horizon Tracking

An Investigation Into Various Approaches For Bengali Long-Form Speech Transcription and Bengali Speaker Diarization

Teen Vigilance: Navigating Risky Social Interactions on Discord

Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment

823-OLT @ BUET DL Sprint 4.0: Context-Aware Windowing for ASR and Fine-Tuned Speaker Diarization in Bengali Long Form Audio

Bengali-Loop: Community Benchmarks for Long-Form Bangla ASR and Speaker Diarization

Remarks on Algebraic Reconstruction of Types and Effects

Re-educating Educated Ones: A Case Study on Chakma Language Revitalization in Chittagong Hill Tracts

Zero-Shot to Zero-Lies: Detecting Bengali Deepfake Audio through Transfer Learning

Polynomial-Time Algorithms for Computing the Nucleolus: An Assessment

The Future of Food: How Artificial Intelligence is Transforming Food Manufacturing

Enhancing LLM Code Generation Capabilities through Test-Driven Development and Code Interpreter

Assessing the Reliability of Large Language Models in the Bengali Legal Context: A Comparative Evaluation Using LLM-as-Judge and Legal Experts

Browse by Category

Research Type

Publish Your Research