Stephen Styles in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing

Silvio Martinico, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini · 2026

Multivector retrieval models achieve state-of-the-art effectiveness through fine-grained token-level representations, but their deployment incurs substantial computational and memory costs. Current so…

Read Paper →

Computer Science Preprint PDF DOI

To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems

Shreya Chappidi, Jatinder Singh · 2026

Responsible AI research typically focuses on examining the use and impacts of deployed AI systems. Yet, there is currently limited visibility into the pre-deployment decisions to pursue building such …

Read Paper →

Computer Science Preprint PDF DOI

SimEval-IR: A Unified Toolkit and Benchmark Suite for Evaluating User Simulators and Search Sessions

Saber Zerhoudi · 2026

User simulators are increasingly central to interactive information retrieval, yet the community lacks standardized evaluation tools. Simulators serve two objectives, behavioral realism (matching real…

Read Paper →

Computer Science Preprint PDF DOI

WOOTdroid: Whole-system Online On-device Tracing for Android

Simon Althaus, Nikolaos Alexopoulos, Max Muhlhauser, Christian Reuter, Ephraim Zimmer · 2026

System auditing on Android faces two problems. First, existing syscall tracers lose events under load, silently overwriting entries faster than a user space reader can drain them. Second, security-rel…

Read Paper →

Computer Science Preprint PDF DOI

LLM-as-a-Judge for Human-AI Co-Creation: A Reliability-Aware Evaluation Framework for Coding

Md Faizul Ibne Amin, Yutaka Watanobe, Daniel M. Muepu, Haruto Suzuki, Kenta Nanaumi, Md Mostafizer Rahman · 2026

LLMs are increasingly employed both as judges for evaluating open-ended outputs and as co-creation partners in AI-assisted programming; yet rigorous evaluation in human-AI co-creation settings remains…

Read Paper →

Computer Science Preprint PDF DOI

AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments

Jiaju Chen, Jinghua Piao, Xia Xu, Songwei Li, Tong Xia, Xiangnan He, Yong Li · 2026

A long-standing challenge in economics lies not in the lack of intuition, but in the difficulty of translating intuitive insights into verifiable research. To address this challenge, we introduce Agen…

Read Paper →

Computer Science Preprint PDF DOI

Reproducing Adaptive Reranking for Reasoning-Intensive IR

Mandeep Rathee, V Venktesh, Sean MacAvaney, Avishek Anand · 2026

The classical cascading pipeline of retrieve--rerank suffers from a bounded recall problem, stemming from limitations of the first-stage retriever. Most current approaches address the bounded recall p…

Read Paper →

Computer Science Preprint PDF DOI

A Reproducibility Study of LLM-Based Query Reformulation

Amin Bigdeli, Radin Hamidi Rad, Hai Son Le, Mert Incesu, Negar Arabzadeh, Charles L. A. Clarke, Ebrahim Bagheri · 2026

Large Language Models (LLMs) are now widely used for query reformulation and expansion in Information Retrieval, with many studies reporting substantial effectiveness gains. However, these results are…

Read Paper →

Computer Science Preprint PDF DOI

VitaLLM: A Versatile, Ultra-Compact Ternary LLM Accelerator with Dependency-Aware Scheduling

Zi-Wei Lin, Tian-Sheuan Chang · 2026

Deploying Large Language Models (LLMs) on resource-constrained edge devices faces critical bottlenecks in memory bandwidth and power consumption. While ternary quantization (e.g., BitNet b1.58) signif…

Read Paper →

Computer Science Preprint PDF DOI

From Notepad AI to Social Media: How Can Text Style Transformation Mitigate Social Harm?

Syed Mhamudul Hasan, Mohd. Farhan Israk Soumik, Abdur R. Shahid · 2026

The rapid proliferation of harmful and emotionally damaging content on social media platforms has intensified concerns regarding societal harm. While content moderation efforts primarily focus on dete…

Read Paper →

Computer Science Preprint PDF DOI

SynSQL: Synthesizing Relational Databases for Robust Evaluation of Text-to-SQL Systems

Mohammadamin Habibollah, Davood Rafiei · 2026

Evaluating text-to-SQL systems remains largely fragile: correctness is typically judged by executing predicted and gold SQL queries on a single static database, even though the same queries may behave…

Read Paper →

Computer Science Preprint PDF DOI

Efficient Training on Multiple Consumer GPUs with RoundPipe

Yibin Luo, Shiwei Gao, Huichuan Zheng, Youyou Lu, Jiwu Shu · 2026

Fine-tuning Large Language Models (LLMs) on consumer-grade GPUs is highly cost-effective, yet constrained by limited GPU memory and slow PCIe interconnects. Pipeline parallelism combined with CPU offl…

Read Paper →

Computer Science Preprint PDF DOI

Resume-ing Control: (Mis)Perceptions of Agency Around GenAI Use in Recruiting Workflows

Sajel Surati, Rosanna Bellini, Emily Black · 2026

When generative AI (genAI) systems are used in high-stakes decision-making, its recommended role is to aid, rather than replace, human decision-making. However, there is little empirical exploration o…

Read Paper →

Computer Science Preprint PDF DOI

A Semantic Quantum Circuit Cache for Scalable and Distributed Quantum-Classical Workflows

Mar Tejedor, Javier Conejero, Rosa M. Badia · 2026

Hybrid quantum--classical workflows often execute large ensembles of circuits that differ syntactically but implement identical operations, leading to substantial redundant computation. To address thi…

Read Paper →

Computer Science Preprint PDF DOI

Catching the Fly: Practical Challenges in Making Blockchain FlyClient Real

Pericle Perazzo, Dario Capecchi · 2026

FlyClient is a lightweight blockchain verification protocol that enables proof-of-work validation using minimal data, making it ideal for resource-constrained environments like mobile wallets, Interne…

Read Paper →

Computer Science Preprint PDF DOI

A Toolkit for Detecting Spurious Correlations in Speech Datasets

Lara Gauder, Pablo Riera, Andrea Slachevsky, Gonzalo Forno, Adolfo M. Garcia, Luciana Ferrer · 2026

We introduce a toolkit for uncovering spurious correlations between recording characteristics and target class in speech datasets. Spurious correlations may arise due to heterogeneous recording condit…

Read Paper →

Computer Science Preprint PDF DOI

TDD Governance for Multi-Agent Code Generation via Prompt Engineering

Tarlan Hasanli, Shahbaz Siddeeq, Bishwash Khanal, Pyry Kotilainen, Tommi Mikkonen, Pekka Abrahamsson · 2026

Large language models (LLMs) accelerate software development but often exhibit instability, non-determinism, and weak adherence to development discipline in unconstrained workflows. While test-driven …

Read Paper →

Computer Science Preprint PDF DOI

Graph Construction and Matching for Imperative Programs using Neural and Structural Methods

Arshad Beg, Diarmuid O'Donoghue, Rosemary Monahan · 2026

Reusing verification artefacts requires identifying structural and semantic similarities across programs and their specifications. In this paper, we focus on graph construction as a foundational step …

Read Paper →

Computer Science Preprint PDF DOI

PICKLES: a Natural Language Framework for Requirement Specification and Model-Based Testing

Maria Belen Rodriguez, Petra van den Bos · 2026

This paper combines methods from the fields of Model-Based Testing (MBT) and Behaviour-Driven Development (BDD) to define a testing approach with human-readable specifications and test cases, as in BD…

Read Paper →

Computer Science Preprint PDF DOI

RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates

Dong Xu, Mingwei Liu, Xiwen Wang, Jianfeng Zhong, Zibin Zheng · 2026

Maintaining up-to-date, comprehensive documentation for large codebases is a persistent challenge. Recent progress in automated documentation has moved from template-based rules to large language mode…

Read Paper →

Browse Research Papers

Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing

To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems

SimEval-IR: A Unified Toolkit and Benchmark Suite for Evaluating User Simulators and Search Sessions

WOOTdroid: Whole-system Online On-device Tracing for Android

LLM-as-a-Judge for Human-AI Co-Creation: A Reliability-Aware Evaluation Framework for Coding

AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments

Reproducing Adaptive Reranking for Reasoning-Intensive IR

A Reproducibility Study of LLM-Based Query Reformulation

VitaLLM: A Versatile, Ultra-Compact Ternary LLM Accelerator with Dependency-Aware Scheduling

From Notepad AI to Social Media: How Can Text Style Transformation Mitigate Social Harm?

SynSQL: Synthesizing Relational Databases for Robust Evaluation of Text-to-SQL Systems

Efficient Training on Multiple Consumer GPUs with RoundPipe

Resume-ing Control: (Mis)Perceptions of Agency Around GenAI Use in Recruiting Workflows

A Semantic Quantum Circuit Cache for Scalable and Distributed Quantum-Classical Workflows

Catching the Fly: Practical Challenges in Making Blockchain FlyClient Real

A Toolkit for Detecting Spurious Correlations in Speech Datasets

TDD Governance for Multi-Agent Code Generation via Prompt Engineering

Graph Construction and Matching for Imperative Programs using Neural and Structural Methods

PICKLES: a Natural Language Framework for Requirement Specification and Model-Based Testing

RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates

Browse by Category

Research Type

Publish Your Research