Robert Fraser in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing

Silvio Martinico, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini · 2026

Multivector retrieval models achieve state-of-the-art effectiveness through fine-grained token-level representations, but their deployment incurs substantial computational and memory costs. Current so…

Read Paper →

Computer Science Preprint PDF DOI

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

Prashant Kulkarni · 2026

Multi-turn prompt injection follows a known attack path -- trust-building, pivoting, escalation but text-level defenses miss covert attacks where individual turns appear benign. We show this attack pa…

Read Paper →

Computer Science Preprint PDF DOI

Tailwind: A Practical Framework for Query Accelerators

Geoffrey X. Yu, Ryan Marcus, Tim Kraska · 2026

Relational database management systems (RDBMSes) can process general-purpose queries, but often have lower performance compared to custom-built solutions for specific queries. For example, consider a …

Read Paper →

Computer Science Preprint PDF DOI

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

Muhammad Ihsan Al Hafiz, Artur Podobas · 2026

Spiking neural networks (SNNs) are a promising paradigm for energy-efficient event-driven computation, but large-scale SNN execution remains challenging because sparse spike communication and synchron…

Read Paper →

Computer Science Preprint PDF DOI

From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation

Guang Yang, Xing Hu, Xiang Chen, Xin Xi · 2026

Multimodal large language models (MLLMs) are increasingly used to translate visual artifacts into code, from UI mockups into HTML to scientific plots into Python scripts. A circuit diagram can be view…

Read Paper →

Computer Science Preprint PDF DOI

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

Jin Xin Ng, Ori Livneh, Richard O'Grady, Josh Don, Peng Ding, Samuel Grossman, Luis Otero, Chris Kennelly, David Lo, Carlos Villavieja · 2026

Modern large multicore systems often run multiple workloads that share CPUs under schedulers such as Linux CFS. To keep CPUs busy, these schedulers load-balance runnable work, causing each workload to…

Read Paper →

Computer Science Preprint PDF DOI

Can We Volunteer Out of the Peer Review Crisis?

Theo Tang, Toby Handfield, Julian Garcia · 2026

The volume of scientific manuscripts is growing faster than the capacity to evaluate them, yet the institutions that govern peer review have remained largely unchanged. The result is a widening mismat…

Read Paper →

Computer Science Preprint PDF DOI

WOOTdroid: Whole-system Online On-device Tracing for Android

Simon Althaus, Nikolaos Alexopoulos, Max Muhlhauser, Christian Reuter, Ephraim Zimmer · 2026

System auditing on Android faces two problems. First, existing syscall tracers lose events under load, silently overwriting entries faster than a user space reader can drain them. Second, security-rel…

Read Paper →

Computer Science Preprint PDF DOI

Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval

Arne Eichholtz, Yongkang Li, Jutte Vijverberg, Tobias Groot, Mohammad Aliannejadi · 2026

The Hypencoder, proposed by Killingback et al., is a retrieval framework that replaces the fixed inner-product scoring function used in standard bi-encoders with a query-specific neural network (the $…

Read Paper →

Computer Science Preprint PDF DOI

What Is the Cost of Energy Monitoring? An Empirical Study on the Overhead of RAPL-Based Tools

Jeremy Diamond, Vincenzo Stoico · 2026

The Running Average Power Limit (RAPL) interface is widely used to estimate software energy consumption via CPU and DRAM counters, but tool design differences and high-frequency polling can introduce …

Read Paper →

Computer Science Preprint PDF DOI

LLM-Guided Runtime Parameter Optimization for Energy-Efficient Model Inference

Katelyn Crumpacker, Dimitrios Nikolopoulos · 2026

Large Language Models (LLMs) have become an integral part of many real-world workflows. However, LLMs consume a lot of energy, which becomes a large concern in the scale of the demand for these tools.…

Read Paper →

Computer Science Preprint PDF DOI

RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates

Dong Xu, Mingwei Liu, Xiwen Wang, Jianfeng Zhong, Zibin Zheng · 2026

Maintaining up-to-date, comprehensive documentation for large codebases is a persistent challenge. Recent progress in automated documentation has moved from template-based rules to large language mode…

Read Paper →

Computer Science Preprint PDF DOI

Quantamination: Dynamic Quantization Leaks Your Data Across the Batch

Hanna Foerster, Ilia Shumailov, Cheng Zhang, Yiren Zhao, Jamie Hayes, Robert Mullins · 2026

Dynamic quantization emerged as a practical approach to increase the utilization and efficiency of the machine learning serving flow. Unlike static quantization, which applies quantization offline, dy…

Read Paper →

Computer Science Preprint PDF DOI

Efficient Listwise Reranking with Compressed Document Representations

Herve Dejean, Stephane Clinchant · 2026

Reranking, the process of refining the output from a first-stage retriever, is often considered computationally expensive, especially when using Large Language Models (LLMs). A common approach to miti…

Read Paper →

Computer Science Preprint PDF DOI

Incremental Strongly Connected Components with Predictions

Ronald Deng, Samuel McCauley, Aidin Niaparast, Helia Niaparast, Bennett Ptak, Shirel Quintanilla, Shikha Singh, Nathan Vosburg · 2026

Algorithms with predictions is a growing area that aims to leverage machine-learned predictions to design faster beyond-worst-case algorithms. In this paper, we use this framework to design a learned …

Read Paper →

Computer Science Preprint PDF DOI

Converting an Integer to a Decimal String in Under Two Nanoseconds

Jael Champagne Gareau, Daniel Lemire · 2026

Converting binary integers to variable-length decimal strings is a fundamental operation in computing. Conventional fast approaches rely on recursive division and small lookup tables. We propose a SIM…

Read Paper →

Computer Science Preprint PDF DOI

Prime-Field PINI: Machine-Checked Composition Theorems for Post-Quantum NTT Masking

Ray Iskander, Khaled Kirah · 2026

This is Paper 6 of a series of formally-verified analyses of masked NTT hardware for post-quantum cryptography; Paper 1 [1] established structural dependency analysis of the QANARY platform, and Paper…

Read Paper →

Computer Science Preprint PDF DOI

Bug-Report-Driven Fault Localization: Industrial Benchmarking and Lesson Learned at ABB Robotics

Pernilla Hall, Anton Ununger, Riccardo Rubei, Alessio Bucaioni · 2026

Software quality assurance remains a major challenge in industrial environments, where large-scale and long-lived systems inevitably accumulate defects. Identifying the location of a fault is often ti…

Read Paper →

Computer Science Preprint PDF DOI

SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents

Mengyao Du, Han Fang, Haokai Ma, Jiahao Chen, Kai Xu, Quanjun Yin, Ee-Chien Chang · 2026

Web agents have emerged as an effective paradigm for automating interactions with complex web environments, yet remain vulnerable to prompt injection attacks that embed malicious instructions into web…

Read Paper →

Computer Science Preprint PDF DOI

ReTokSync: Self-Synchronizing Tokenization Disambiguation for Generative Linguistic Steganography

Yaofei Wang, Rui Wang, Weilong Pang, JiaLiang Han, Yuan Qi, Donghui Hu, Kejiang Chen · 2026

Generative linguistic steganography (GLS) enables covert communication by embedding secret messages into the natural language generation process. In practical deployment, however, GLS is vulnerable to…

Read Paper →

Browse Research Papers

Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

Tailwind: A Practical Framework for Query Accelerators

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

Can We Volunteer Out of the Peer Review Crisis?

WOOTdroid: Whole-system Online On-device Tracing for Android

Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval

What Is the Cost of Energy Monitoring? An Empirical Study on the Overhead of RAPL-Based Tools

LLM-Guided Runtime Parameter Optimization for Energy-Efficient Model Inference

RepoDoc: A Knowledge Graph-Based Framework to Automatic Documentation Generation and Incremental Updates

Quantamination: Dynamic Quantization Leaks Your Data Across the Batch

Efficient Listwise Reranking with Compressed Document Representations

Incremental Strongly Connected Components with Predictions

Converting an Integer to a Decimal String in Under Two Nanoseconds

Prime-Field PINI: Machine-Checked Composition Theorems for Post-Quantum NTT Masking

Bug-Report-Driven Fault Localization: Industrial Benchmarking and Lesson Learned at ABB Robotics

SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents

ReTokSync: Self-Synchronizing Tokenization Disambiguation for Generative Linguistic Steganography

Browse by Category

Research Type

Publish Your Research