Osbert Bastani in Computer Science — Research Repository

Computer Science Preprint PDF DOI

FGDM: Reasoning Aware Multi-Agentic Framework for Software Bug Detection using Chain of Thought and Tree of Thought Prompting

Srita Padmanabhuni, Bhargavi Karuturi, Jerusha Karen Indupalli, Santhan Reddy Chilla, Vivek Yelleti · 2026

Deep Learning methods are becoming prominent in automated software bug detection; however, they lack the global understanding of the given code. Consequently, their performance tends to degrade, espec…

Read Paper →

Computer Science Preprint PDF DOI

RAS: a Reliability Oriented Metric for Automatic Speech Recognition

Wenbin Huang, Yuhang Qiu, Bohan Li, Yiwei Guo, Jing Peng, Hankun Wang, Xie Chen, Kai Yu · 2026

Automatic speech recognition systems often produce confident yet incorrect transcriptions under noisy or ambiguous conditions, which can be misleading for both users and downstream applications. Stand…

Read Paper →

Computer Science Preprint PDF DOI

FlashSpread: IO-Aware GPU Simulation of Non-Markovian Epidemic Dynamics via Kernel Fusion

Heman Shakeri, Behnaz Moradi-Jamei, Aram Vajdi, Ehsan Ardjmand · 2026

Non-Markovian (renewal) epidemic simulation on multi-million-node contact networks is essential for realistic forecasting under general age-dependent holding-time distributions (log-normal, Weibull, E…

Read Paper →

Computer Science Preprint PDF DOI

Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

Francois Remy · 2026

Reliable biomedical and clinical retrieval requires more than strong ranking performance: it requires a practical way to find systematic model failures and curate the training evidence needed to corre…

Read Paper →

Computer Science Preprint PDF DOI

Calibrated Abstention for Reliable TCR--pMHC Binding Prediction under Epitope Shift

Arman Bekov, Timur Bekzhanov, Bekzat Sadykov · 2026

Predicting T-cell receptor (TCR)--peptide-MHC (pMHC) binding is central to vaccine design and T-cell therapy, yet deployed models frequently encounter epitopes unseen during training, causing silent o…

Read Paper →

Computer Science Preprint PDF DOI

Participatory, not Punitive: Student-Driven AI Policy Recommendations in a Design Classroom

Kaoru Seki, Manisha Vijay, Yasmine Kotturi · 2026

Generative AI is reshaping education, yet most university AI policies are written without students and focus on penalizing misuse. This top-down approach sidelines those most affected from decisions t…

Read Paper →

Computer Science Preprint PDF DOI

Reproduction Beyond Benchmarks: ConstBERT and ColBERT-v2 Across Backends and Query Distributions

Utshab Kumar Ghosh, Ashish David, Shubham Chatterjee · 2026

Reproducibility must validate architectural robustness, not just numerical accuracy. We evaluate ColBERT-v2 and ConstBERT across five dimensions, finding that while ConstBERT reproduces within 0.05% M…

Read Paper →

Computer Science Preprint PDF DOI

HAFM: Hierarchical Autoregressive Foundation Model for Music Accompaniment Generation

Jian Zhu, Jianwei Cui, Shihao Chen, Yubang Zhang, Cheng Luo · 2026

We present HAFM, a system that generates instrumental music audio to accompany input vocals. Given isolated singing voice, HAFM produces a coherent instrumental accompaniment that can be directly mixe…

Read Paper →

Computer Science Preprint PDF DOI

BRASP: Boolean Range Queries over Encrypted Spatial Data with Access and Search Pattern Privacy

Jing Zhang, Ganxuan Yang, Yifei Yang, Siqi Wen, Zhengyang Qiu · 2026

Searchable Encryption (SE) enables users to query outsourced encrypted data while preserving data confidentiality. However, most efficient schemes still leak the search pattern and access pattern, whi…

Read Paper →

Computer Science Preprint PDF DOI

Fine-grained Approaches for Confidence Calibration of LLMs in Automated Code Revision

Hong Yi Lin, Chunhua Liu, Haoyu Gao, Patanamon Thongtanunam, Christoph Treude · 2026

In today's AI-assisted software engineering landscape, developers increasingly depend on LLMs that are highly capable, yet inherently imperfect. The tendency of these models to produce incorrect outpu…

Read Paper →

Computer Science Preprint PDF DOI

Arch: An AI-Native Hardware Description Language for Register-Transfer Clocked Hardware Design

Shuqing Zhao · 2026

We present Arch (AI-native Register-transfer Clocked Hardware), a hardware description language for micro-architecture specification and AI-assisted code generation. Arch provides first-class construc…

Read Paper →

Computer Science Preprint PDF DOI

SemLink: A Semantic-Aware Automated Test Oracle for Hyperlink Verification using Siamese Sentence-BERT

Guan-Yan Yang, Wei-Ling Wen, Shu-Yuan Ku, Farn Wang, Kuo-Hui Yeh · 2026

Web applications rely heavily on hyperlinks to connect disparate information resources. However, the dynamic nature of the web leads to link rot, where targets become unavailable, and more insidiously…

Read Paper →

Computer Science Preprint PDF DOI

Systematic Integration of Digital Twins and Constrained LLMs for Interpretable Cyber-Physical Anomaly Detection

Konstantinos E. Kampourakis, Vasileios Gkioulos, Sokratis Katsikas · 2026

Cyber attacks targeting Industrial Control Systems (ICS) have become increasingly sophisticated and hard to identify. Detecting such attacks requires integrating low-level behavioral cues with high-le…

Read Paper →

Computer Science Preprint PDF DOI

APEX: Agent Payment Execution with Policy for Autonomous Agent API Access

Mohd Safwan Uddin, Mohammed Mouzam, Mohammed Imran, Syed Badar Uddin Faizan · 2026

Autonomous agents are moving beyond simple retrieval tasks to become economic actors that invoke APIs, sequence workflows, and make real-time decisions. As this shift accelerates, API providers need r…

Read Paper →

Computer Science Preprint PDF DOI

FGR-ColBERT: Identifying Fine-Grained Relevance Tokens During Retrieval

Antonin Jarolim, Martin Fajcik · 2026

Document retrieval identifies relevant documents but does not provide fine-grained evidence cues, such as specific relevant spans. A possible solution is to apply an LLM after retrieval; however, this…

Read Paper →

Computer Science Preprint PDF DOI

SAGAI-MID: A Generative AI-Driven Middleware for Dynamic Runtime Interoperability

Oliver Aleksander Larsen, Mahyar T. Moghaddam · 2026

Modern distributed systems integrate heterogeneous services, REST APIs with different schema versions, GraphQL endpoints, and IoT devices with proprietary payloads that suffer from persistent schema m…

Read Paper →

Computer Science Preprint PDF DOI

Investigation on the Robustness of Acoustic Foundation Models on Post Exercise Speech

Xiangyuan Xue, Yuyu Wang, Ruijie Yao, Xiaoyue Ni, Xiaofan Jiang, Jingping Nie · 2026

Automatic speech recognition (ASR) has been extensively studied on neutral and stationary speech, yet its robustness under post-exercise physiological shift remains underexplored. Compared with restin…

Read Paper →

Computer Science Preprint PDF DOI

Context-Aware Phishing Email Detection Using Machine Learning and NLP

Amitabh Chakravorty, Matthew Price, Nelly Elsayed, Zag ElSayed · 2026

Phishing attacks remain among the most prevalent cybersecurity threats, causing significant financial losses for individuals and organizations worldwide. This paper presents a machine learning-based p…

Read Paper →

Computer Science Preprint PDF DOI

Sovereign Context Protocol: An Open Attribution Layer for Human-Generated Content in the Age of Large Language Models

Praneel Panchigar, Torlach Rush, Matthew Canabarro · 2026

Large Language Models (LLMs) consume vast quantities of human-generated content for both training and real-time inference, yet the creators of that content remain largely invisible in the value chain.…

Read Paper →

Computer Science Preprint PDF DOI

ColBERT-Att: Late-Interaction Meets Attention for Enhanced Retrieval

Raj Nath Patel, Sourav Dutta · 2026

Vector embeddings from pre-trained language models form a core component in Neural Information Retrieval systems across a multitude of knowledge extraction tasks. The paradigm of late interaction, int…

Read Paper →

Browse Research Papers

FGDM: Reasoning Aware Multi-Agentic Framework for Software Bug Detection using Chain of Thought and Tree of Thought Prompting

RAS: a Reliability Oriented Metric for Automatic Speech Recognition

FlashSpread: IO-Aware GPU Simulation of Non-Markovian Epidemic Dynamics via Kernel Fusion

Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

Calibrated Abstention for Reliable TCR--pMHC Binding Prediction under Epitope Shift

Participatory, not Punitive: Student-Driven AI Policy Recommendations in a Design Classroom

Reproduction Beyond Benchmarks: ConstBERT and ColBERT-v2 Across Backends and Query Distributions

HAFM: Hierarchical Autoregressive Foundation Model for Music Accompaniment Generation

BRASP: Boolean Range Queries over Encrypted Spatial Data with Access and Search Pattern Privacy

Fine-grained Approaches for Confidence Calibration of LLMs in Automated Code Revision

Arch: An AI-Native Hardware Description Language for Register-Transfer Clocked Hardware Design

SemLink: A Semantic-Aware Automated Test Oracle for Hyperlink Verification using Siamese Sentence-BERT

Systematic Integration of Digital Twins and Constrained LLMs for Interpretable Cyber-Physical Anomaly Detection

APEX: Agent Payment Execution with Policy for Autonomous Agent API Access

FGR-ColBERT: Identifying Fine-Grained Relevance Tokens During Retrieval

SAGAI-MID: A Generative AI-Driven Middleware for Dynamic Runtime Interoperability

Investigation on the Robustness of Acoustic Foundation Models on Post Exercise Speech

Context-Aware Phishing Email Detection Using Machine Learning and NLP

Sovereign Context Protocol: An Open Attribution Layer for Human-Generated Content in the Age of Large Language Models

ColBERT-Att: Late-Interaction Meets Attention for Enhanced Retrieval

Browse by Category

Research Type

Publish Your Research