Stefan Otten in Computer Science — Research Repository

Computer Science Preprint PDF DOI

FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption

Yanting Wang, Chenlong Yin, Ying Chen, Jinyuan Jia · 2026

Long-context large language models (LLMs)-for example, Gemini-3.1-Pro and Qwen-3.5-are widely used to empower many real-world applications, such as retrieval-augmented generation, autonomous agents, a…

Read Paper →

Computer Science Preprint PDF DOI

Unsafe and Unused? A History of Utility Code in Mature Open Source Projects

Brandon Keller, Kaitlin Yandik, Angela Ngo, Andy Meneely · 2026

Filenames are a concise means of conveying information about source code to fellow developers. One such convention is util. Commonly understood to stand for "utility", filenames with the letters util …

Read Paper →

Computer Science Preprint PDF DOI

Index-Assisted Stratified Sampling for Online Aggregation

Yunnan Yu, Zhuoyue Zhao · 2026

Ad-hoc queries over frequently updated data in a flat schema are common in real-time data analysis applications and often require very low latency. Online aggregation can achieve so by providing appro…

Read Paper →

Computer Science Preprint PDF DOI

DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures

Sigma Jahan, Saurabh Singh Rajput, Tushar Sharma, Mohammad Masudur Rahman · 2026

Transformer models are widely deployed in critical AI applications, yet faults in their attention mechanisms, projections, and other internal components often degrade behavior silently without raising…

Read Paper →

Computer Science Preprint PDF DOI

Towards Neuro-symbolic Causal Rule Synthesis, Verification, and Evaluation Grounded in Legal and Safety Principles

Zainab Rehan, Christian Medeiros Adriano, Sona Ghahremani, Holger Giese · 2026

Rule-based systems remain central in safety-critical domains but often struggle with scalability, brittleness, and goal misspecification. These limitations can lead to reward hacking and failures in f…

Read Paper →

Computer Science Preprint PDF DOI

Tailwind: A Practical Framework for Query Accelerators

Geoffrey X. Yu, Ryan Marcus, Tim Kraska · 2026

Relational database management systems (RDBMSes) can process general-purpose queries, but often have lower performance compared to custom-built solutions for specific queries. For example, consider a …

Read Paper →

Computer Science Preprint PDF DOI

To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems

Shreya Chappidi, Jatinder Singh · 2026

Responsible AI research typically focuses on examining the use and impacts of deployed AI systems. Yet, there is currently limited visibility into the pre-deployment decisions to pursue building such …

Read Paper →

Computer Science Preprint PDF DOI

A Logic of Inability

Shanxia Wang · 2026

Coalition Logic is primarily concerned with what coalitions can achieve, whereas what coalitions cannot achieve -- their \emph{inability} -- has received comparatively little explicit attention. Thi…

Read Paper →

Computer Science Preprint PDF DOI

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

Jin Xin Ng, Ori Livneh, Richard O'Grady, Josh Don, Peng Ding, Samuel Grossman, Luis Otero, Chris Kennelly, David Lo, Carlos Villavieja · 2026

Modern large multicore systems often run multiple workloads that share CPUs under schedulers such as Linux CFS. To keep CPUs busy, these schedulers load-balance runnable work, causing each workload to…

Read Paper →

Computer Science Preprint PDF DOI

An Empirical Evaluation of Code Smell Detection in Angular Applications

Maykon Nunes, Emanuel Coutinho, Carla Bezerra, Ivan Machado · 2026

Angular is one of the most widely adopted frameworks for developing large-scale, dynamic web applications. As projects increase in scope and complexity, developers face growing challenges in managing …

Read Paper →

Computer Science Preprint PDF DOI

SimEval-IR: A Unified Toolkit and Benchmark Suite for Evaluating User Simulators and Search Sessions

Saber Zerhoudi · 2026

User simulators are increasingly central to interactive information retrieval, yet the community lacks standardized evaluation tools. Simulators serve two objectives, behavioral realism (matching real…

Read Paper →

Computer Science Preprint PDF DOI

NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

Shiyao Peng, Qianhe Zheng, Zhuodi Hao, Zichen Tang, Rongjin Li, Qing Huang, Jiayu Huang, Jiacheng Liu, Yifan Zhu, Haihong E · 2026

Although precise recall is a core objective in Retrieval-Augmented Generation (RAG), a critical oversight persists in the field: improvements in retrieval performance do not consistently translate to …

Read Paper →

Computer Science Preprint PDF DOI

ZipCCL: Efficient Lossless Data Compression of Communication Collectives for Accelerating LLM Training

Wenxiang Lin, Xinglin Pan, Ruibo Fan, Shaohuai Shi, Xiaowen Chu · 2026

Communication has emerged as a critical bottleneck in the distributed training of large language models (LLMs). While numerous approaches have been proposed to reduce communication overhead, the poten…

Read Paper →

Computer Science Preprint PDF DOI

Maximally Diverse Stable Matchings: Optimizing Arbitrary Institutional Objectives

Gergely Csaji, Zhaohong Sun · 2026

Stable matching theory is the foundation of centralized clearinghouses worldwide, from school choice programs to medical residency allocations. However, incorporating complex distributional goals-such…

Read Paper →

Computer Science Preprint PDF DOI

Feature-Centric Methodology for Analyzing Cross-Chain NFT Migration Compatibility

Mohd Sameen Chishti, Damilare Peter Oyinloye, Jingyue Li · 2026

Cross-chain NFT migration refers to the process of transferring digital assets along with their associated functionalities and guarantees between distinct blockchain platforms. However, architectural …

Read Paper →

Computer Science Preprint PDF DOI

Multifaceted Hero Developers and Bug-Fixing Outcomes Across Severity

Amit Kumar, Mahen Gandhi, Meher Bhardwaj, Hrishikesh Ethari, Sonali Agarwal · 2026

Open-source projects often rely on a small group of highly active contributors known as hero developers. Prior work shows that hero developers are common in many OSS and enterprise projects, yet who q…

Read Paper →

Computer Science Preprint PDF DOI

Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

Jiaju Chen, Chongming Gao, Chenxiao Fan, Haoyan Liu, Qingpeng Cai, Peng Jiang, Xiangnan He · 2026

Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target d…

Read Paper →

Computer Science Preprint PDF DOI

Social Media Data Toolkit: Standardization and Anonymization of Social Network Datasets

Ali Najafi, Letizia Iannucci, Mikko Kivela, Onur Varol · 2026

The rapid diversification of social media platforms and the increasing restrictions on official APIs have significantly complicated cross-platform analysis. Researchers are often forced to rely on het…

Read Paper →

Computer Science Preprint PDF DOI

Understanding Bugs in Template Engine-Based Applications: Symptoms, Root Causes, and Fix Patterns

Kai Gao, Yu Sun, Chang-ai Sun · 2026

Template engines are indispensable components in modern software ecosystems, enabling the generation of structured documents and scripts across domains such as web development, Infrastructure as Code,…

Read Paper →

Computer Science Preprint PDF DOI

Tail-aware N-version Machine Learning Models for Reliable API Recommendation

Aoi Matsuda, Fumio Machida, David Lo · 2026

Machine learning (ML)-based API recommendation helps developers efficiently identify suitable APIs to complement the application code. However, code datasets used to train ML models often exhibit a lo…

Read Paper →

Browse Research Papers

FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption

Unsafe and Unused? A History of Utility Code in Mature Open Source Projects

Index-Assisted Stratified Sampling for Online Aggregation

DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures

Towards Neuro-symbolic Causal Rule Synthesis, Verification, and Evaluation Grounded in Legal and Safety Principles

Tailwind: A Practical Framework for Query Accelerators

To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems

A Logic of Inability

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

An Empirical Evaluation of Code Smell Detection in Angular Applications

SimEval-IR: A Unified Toolkit and Benchmark Suite for Evaluating User Simulators and Search Sessions

NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

ZipCCL: Efficient Lossless Data Compression of Communication Collectives for Accelerating LLM Training

Maximally Diverse Stable Matchings: Optimizing Arbitrary Institutional Objectives

Feature-Centric Methodology for Analyzing Cross-Chain NFT Migration Compatibility

Multifaceted Hero Developers and Bug-Fixing Outcomes Across Severity

Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

Social Media Data Toolkit: Standardization and Anonymization of Social Network Datasets

Understanding Bugs in Template Engine-Based Applications: Symptoms, Root Causes, and Fix Patterns

Tail-aware N-version Machine Learning Models for Reliable API Recommendation

Browse by Category

Research Type

Publish Your Research