Toby Kenney in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Why Self-Supervised Encoders Want to Be Normal

Yuval Domb · 2026

We develop a geometric and information-theoretic framework for encoder-decoder learning built on the Information Bottleneck (IB) principle. Recasting IB as a rate-distortion problem with Kullback-Leib…

Read Paper →

Computer Science Preprint PDF DOI

FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow

Sina Heidari, Dimitrios S. Nikolopoulos · 2026

Deep learning compilers and vendor libraries deliver strong baseline performance but are bounded by finite, engineer-curated catalogs. When these omit needed optimizations, practitioners substitute ha…

Read Paper →

Computer Science Preprint PDF DOI

RAG-Enhanced Kernel-Based Heuristic Synthesis (RKHS): A Structured Methodology Using Large Language Models for Hardware Design

Shiva Ahir, Alex Doboli · 2026

Heuristic design upholds modern electronic design automation (EDA) tools, yet crafting effective placement, routing, and scheduling strategies entails substantial expertise. We study how large languag…

Read Paper →

Computer Science Preprint PDF DOI

CUDA Kernel Optimization and Counter-Free Performance Analysis for Depthwise Convolution in Cloud Environments

Huriyeh Babak, Melanie Schaller · 2026

Efficient GPU execution of convolution operators is governed by memory-access efficiency, on-chip data reuse, and execution mapping rather than arithmetic throughput alone. This paper presents a contr…

Read Paper →

Computer Science Preprint PDF DOI

Measuring the Unmeasurable: Markov Chain Reliability for LLM Agents

Phat T. Tran-Truong, Xuan-Bach Le · 2026

Large language model (LLM) agents increasingly operate as sequential software systems, but their reliability is often summarized by scalar benchmark metrics. Metrics such as pass$@k$, pass$^k$, and th…

Read Paper →

Computer Science Preprint PDF DOI

#MakeBeefGreatAgain: A Cross-Platform Analysis of Early #MAHA Discourse

Haoning Xue, Yue Li, Benjamin A. Lyons, Andy J. King · 2026

Make America Healthy Again (MAHA) is a health-related campaign slogan proposed by Robert F. Kennedy Jr. and later incorporated into the political coalition of President Trump. While #MAHA quickly circ…

Read Paper →

Computer Science Preprint PDF DOI

FlashSpread: IO-Aware GPU Simulation of Non-Markovian Epidemic Dynamics via Kernel Fusion

Heman Shakeri, Behnaz Moradi-Jamei, Aram Vajdi, Ehsan Ardjmand · 2026

Non-Markovian (renewal) epidemic simulation on multi-million-node contact networks is essential for realistic forecasting under general age-dependent holding-time distributions (log-normal, Weibull, E…

Read Paper →

Computer Science Preprint PDF DOI

Counting Worlds Branching Time Semantics for post-hoc Bias Mitigation in generative AI

Alessandro G. Buda, Giuseppe Primiero, Leonardo Ceragioli, Melissa Antonelli · 2026

Generative AI systems are known to amplify biases present in their training data. While several inference-time mitigation strategies have been proposed, they remain largely empirical and lack formal g…

Read Paper →

Computer Science Preprint PDF DOI

Near-Codewords Aware Bit Flipping Decoding of QC-MDPC Codes

Alessio Baldelli, Marco Baldi, Davide De Zuane, Paolo Santini · 2026

Bit-Flipping (BF) decoders are a family of decoders widely employed in post-quantum cryptographic schemes based on Quasi-Cyclic Moderate-Density Parity-Check (QC-MDPC) codes, such as BIKE. BF decoders…

Read Paper →

Computer Science Preprint PDF DOI

From Craft to Kernel: A Governance-First Execution Architecture and Semantic ISA for Agentic Computers

Xiangyu Wen, Yuang Zhao, Xiaoyu Xu, Lingjun Chen, Changran Xu, Shu Chi, Jianrong Ding, Zeju Li, Haomin Li, Li Jiang, Fangxin Liu, Qiang Xu · 2026

The transition of agentic AI from brittle prototypes to production systems is stalled by a pervasive crisis of craft. We suggest that the prevailing orchestration paradigm-delegating the system contro…

Read Paper →

Computer Science Preprint PDF DOI

Governed MCP: Kernel-Level Tool Governance for AI Agents via Logit-Based Safety Primitives

Daeyeon Son · 2026

AI agents increasingly call external tools (file system, network, APIs) through the Model Context Protocol (MCP). These tool calls are the agent's syscalls -- privileged operations with side effects o…

Read Paper →

Computer Science Preprint PDF DOI

Ragged Paged Attention: A High-Performance and Flexible LLM Inference Kernel for TPU

Jevin Jiang, Ying Chen, Blake A. Hechtman, Fenghui Zhang, Yarong Mu · 2026

Large Language Model (LLM) deployment is increasingly shifting to cost-efficient accelerators like Google's Tensor Processing Units (TPUs), prioritizing both performance and total cost of ownership (T…

Read Paper →

Computer Science Preprint PDF DOI

ProbeLogits: Kernel-Level LLM Inference Primitives for AI-Native Operating Systems

Daeyeon Son · 2026

An OS kernel that runs LLM inference internally can read logit distributions before any text is generated and act on them as a governance primitive. This paper presents ProbeLogits, a kernel-level ope…

Read Paper →

Computer Science Preprint PDF DOI

Record-Remix-Replay: Hierarchical GPU Kernel Optimization using Evolutionary Search

Daniel Nichols, Konstantinos Parasyris, Caetano Melone, Tal Ben-Nun, Giorgis Georgakoudis, Harshitha Menon · 2026

As high-performance computing and AI workloads become increasingly dependent on GPUs, maintaining high performance across rapidly evolving hardware generations has become a major challenge. Developers…

Read Paper →

Computer Science Preprint PDF DOI

WaveTune: Wave-aware Bilinear Modeling for Efficient GPU Kernel Auto-tuning

Kaixuan Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang · 2026

The rapid adoption of Large Language Models (LLMs) has made GPU inference efficiency an increasingly critical system concern. The runtime of LLM workloads is largely dominated by tile-based kernels, p…

Read Paper →

Computer Science Preprint PDF DOI

Tessera: Unlocking Heterogeneous GPUs through Kernel-Granularity Disaggregation

Tiancheng Hu, Jin Qin, Zheng Wang, Junhao Hu, Yuzheng Wang, Lei Chen, Yizhou Shan, Mingxing Zhang, Ting Cao, Chunwei Xia, Huimin Cui, Tao Xie, Chenxi Wang · 2026

Disaggregation maps parts of an AI workload to different types of GPUs, offering a path to utilize modern heterogeneous GPU clusters. However, existing solutions operate at a coarse granularity and ar…

Read Paper →

Computer Science Preprint PDF DOI

eBandit: Kernel-Driven Reinforcement Learning for Adaptive Video Streaming

Mahdi Alizadeh · 2026

User-space Adaptive Bitrate (ABR) algorithms cannot see the transport layer signals that matter most, such as minimum RTT and instantaneous delivery rate, and they respond to network changes only afte…

Read Paper →

Computer Science Preprint PDF DOI

Beyond Crash-to-Patch: Patch Evolution for Linux Kernel Repair

Luyao Bai, Kenan Alghythee, Hang Zhang, Xiaoguang Wang · 2026

Linux kernel bug repair is typically approached as a direct mapping from crash reports to code patches. In practice, however, kernel fixes undergo iterative revision on mailing lists before acceptance…

Read Paper →

Computer Science Preprint PDF DOI

From 8 Seconds to 370ms: Kernel-Fused SAR Imaging on Apple Silicon via Single-Dispatch FFT Pipelines

Mohamed Amine Bergach · 2026

We present the first kernel-fused SAR Range Doppler pipeline on any GPU platform. By fusing FFT, matched-filter multiply, and IFFT into a single Metal compute dispatch -- keeping all intermediate data…

Read Paper →

Computer Science Preprint PDF DOI

Proceedings of the 7th Workshop on Models for Formal Analysis of Real Systems

Maurice H. ter Beek (CNR-ISTI, Pisa, Italy), Gregor Gossler (INRIA, Univ. Grenoble Alpes, Grenoble, France) · 2026

These proceedings contain the papers that were presented at the 7th Workshop on Models for Formal Analysis of Real Systems (MARS 2026), which took place on 12 April 2026 in Turin, Italy, as a satellit…

Read Paper →

Browse Research Papers

Why Self-Supervised Encoders Want to Be Normal

FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow

RAG-Enhanced Kernel-Based Heuristic Synthesis (RKHS): A Structured Methodology Using Large Language Models for Hardware Design

CUDA Kernel Optimization and Counter-Free Performance Analysis for Depthwise Convolution in Cloud Environments

Measuring the Unmeasurable: Markov Chain Reliability for LLM Agents

#MakeBeefGreatAgain: A Cross-Platform Analysis of Early #MAHA Discourse

FlashSpread: IO-Aware GPU Simulation of Non-Markovian Epidemic Dynamics via Kernel Fusion

Counting Worlds Branching Time Semantics for post-hoc Bias Mitigation in generative AI

Near-Codewords Aware Bit Flipping Decoding of QC-MDPC Codes

From Craft to Kernel: A Governance-First Execution Architecture and Semantic ISA for Agentic Computers

Governed MCP: Kernel-Level Tool Governance for AI Agents via Logit-Based Safety Primitives

Ragged Paged Attention: A High-Performance and Flexible LLM Inference Kernel for TPU

ProbeLogits: Kernel-Level LLM Inference Primitives for AI-Native Operating Systems

Record-Remix-Replay: Hierarchical GPU Kernel Optimization using Evolutionary Search

WaveTune: Wave-aware Bilinear Modeling for Efficient GPU Kernel Auto-tuning

Tessera: Unlocking Heterogeneous GPUs through Kernel-Granularity Disaggregation

eBandit: Kernel-Driven Reinforcement Learning for Adaptive Video Streaming

Beyond Crash-to-Patch: Patch Evolution for Linux Kernel Repair

From 8 Seconds to 370ms: Kernel-Fused SAR Imaging on Apple Silicon via Single-Dispatch FFT Pipelines

Proceedings of the 7th Workshop on Models for Formal Analysis of Real Systems

Browse by Category

Research Type

Publish Your Research