Memory in Computer Science — Research Repository

Computer Science Preprint PDF DOI

LLM4C2Rust: Large Language Models for Automated Memory-Safe Code Transpilation

Sarah Bedell, Nazanin Siavash, Armin Moin · 2026

Memory safety has long been a critical challenge in software engineering, particularly for legacy systems written in memory-unsafe languages such as C and C++. Rust, one of the youngest modern program…

Read Paper →

Computer Science Preprint PDF DOI

vstash: Local-First Hybrid Retrieval with Adaptive Fusion for LLM Agents

Jayson Steffens · 2026

We present **vstash**, a local-first document memory system that combines vector similarity search with full-text keyword matching via Reciprocal Rank Fusion (RRF) and adaptive per-query IDF weighting…

Read Paper →

Computer Science Preprint PDF DOI

Ragged Paged Attention: A High-Performance and Flexible LLM Inference Kernel for TPU

Jevin Jiang, Ying Chen, Blake A. Hechtman, Fenghui Zhang, Yarong Mu · 2026

Large Language Model (LLM) deployment is increasingly shifting to cost-efficient accelerators like Google's Tensor Processing Units (TPUs), prioritizing both performance and total cost of ownership (T…

Read Paper →

Computer Science Preprint PDF DOI

Agentic Microphysics: A Manifesto for Generative AI Safety

Federico Pierucci, Matteo Prandi, Marcantonio Bracale Syrnikov, Marcello Galisai, Piercosma Bisconti · 2026

This paper advances a methodological proposal for safety research in agentic AI. As systems acquire planning, memory, tool use, persistent identity, and sustained interaction, safety can no longer be …

Read Paper →

Computer Science Preprint PDF DOI

ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants

Haohui Mai, Xiaoyan Guo, Xiangyun Ding, Daifeng Li, Qiuchu Yu, Chenzhun Guo, Cong Wang, Jiacheng Zhao, Christos Kozyrakis, Binhang Yuan · 2026

LLM-based coding agents can generate functionally correct GPU kernels, yet their performance remains far below hand-optimized libraries on critical computations such as matrix multiplication, attentio…

Read Paper →

Computer Science Preprint PDF DOI

Applying SHAPR in AI-Assisted Research Software Development: Lessons Learnt from Building a Share Trading System

Ka Ching Chan · 2026

Generative AI is changing how research software is developed, but rapid AI-assisted development can weaken continuity, traceability, and methodological clarity. SHAPR (Solo, Human-centred, AI-assisted…

Read Paper →

Computer Science Preprint PDF DOI

Serving Chain-structured Jobs with Large Memory Footprints with Application to Large Foundation Model Serving

Tingyang Sun, Ting He, I-Hong Hou · 2026

As a current trend in Artificial Intelligence (AI), large foundation models are increasingly employed as the core of AI services. However, even after training, serving such models at scale remains a c…

Read Paper →

Computer Science Preprint PDF DOI

Sublinear Spectral Clustering Oracle with Little Memory

Ranran Shen, Xiaoyi Zhu, Pan Peng, Zengfeng Huang · 2026

We study the problem of designing \emph{sublinear spectral clustering oracles} for well-clusterable graphs. Such an oracle is an algorithm that, given query access to the adjacency list of a graph $G$…

Read Paper →

Computer Science Preprint PDF DOI

SAGER: Self-Evolving User Policy Skills for Recommendation Agent

Zhen Tao, Riwei Lai, Chenyun Yu, Weixin Chen, Li Chen, Beibei Kong, Lei Cheng, Chengxiang Zhuo, Zang Li, Qingqiang Sun · 2026

Large language model (LLM) based recommendation agents personalize what they know through evolving per-user semantic memory, yet how they reason remains a universal, static system prompt shared identi…

Read Paper →

Computer Science Preprint PDF DOI

Accelerating CRONet on AMD Versal AIE-ML Engines

Kaustubh Mhatre, Vedant Tewari, Aditya Ray, Farhan Khan, Ridwan Olabiyi, Ashif Iquebal, Aman Arora · 2026

Topology optimization is a computational method used to determine the optimal material distribution within a prescribed design domain, aiming to minimize structural weight while satisfying load and bo…

Read Paper →

Computer Science Preprint PDF DOI

Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model

Adelekun Oluwademilade, Ademola Adedamola, Abiola Abdulhakeem, Akinpelu Azeezat, Eraiyetan Israel, Omotosho Oluwadunsin, Ibenye Ikechukwu, Ayuba Muhammad, Olusanya Olamide, Kamorudeen Amuda · 2026

Speech Emotion Recognition (SER) is the use of machines to detect the emotional state of humans based on the speech, which is gaining importance in natural human-computer interaction. Speech is a very…

Read Paper →

Computer Science Preprint PDF DOI

EdgeDetect: Importance-Aware Gradient Compression with Homomorphic Aggregation for Federated Intrusion Detection

Noor Islam S. Mohammad · 2026

Federated learning (FL) enables collaborative intrusion detection without raw data exchange, but conventional FL incurs high communication overhead from full-precision gradient transmission and remain…

Read Paper →

Computer Science Preprint PDF DOI

PlanB: Efficient Software IPv6 Lookup with Linearized $B^+$-Tree

Zhihao Zhang, Lanzheng Liu, Chen Chen, Huiba Li, Jiwu Shu, Windsor Hsu, Yiming Zhang · 2026

IP lookup via Longest Prefix Match (LPM) is critical for packet forwarding. Unfortunately, conventional lookup algorithms are inefficient for IPv6 Forwarding Information Bases (FIBs), which are charac…

Read Paper →

Computer Science Preprint PDF DOI

DEEP-GAP: Deep-learning Evaluation of Execution Parallelism in GPU Architectural Performance

Kathiravan Palaniappan · 2026

Modern datacenters increasingly rely on low-power, single-slot inference accelerators to balance performance, energy efficiency, and rack density constraints. The NVIDIA T4 GPU has become widely deplo…

Read Paper →

Computer Science Preprint PDF DOI

Fast Concurrent Primitives Despite Contention

Michael A. Bender, Guy E. Blelloch, Martin Farach-Colton, Yang Hu, Rob Johnson, Rotem Oshman, Renfei Zhou · 2026

We study the problem of constructing concurrent objects in a setting where $P$ processes run in parallel and interact through a shared memory that is subject to write contention. Our goal is to transf…

Read Paper →

Computer Science Preprint PDF DOI

Fleet: Hierarchical Task-based Abstraction for Megakernels on Multi-Die GPUs

Sangeeta Chowdhary, Ryan Swann, Sean Siddens, Muhammad Osama, Stephen Neuendorffer, Alexandru Dutu, Karthik Sangaiah, Sandeepa Bhuyan, Samuel Bayliss, Ganesh Dasika · 2026

Modern GPUs adopt chiplet-based designs with multiple private cache hierarchies, but current programming models (CUDA/HIP) expose a flat execution hierarchy that cannot express chiplet-level locality …

Read Paper →

Computer Science Preprint PDF DOI

Parallel R-tree-based Spatial Query Processing on a Commercial Processing-in-Memory System

Tasmia Jannat, Michael Gowanlock, Satish Puri · 2026

The growing volume of data in scientific domains has made spatial query processing increasingly challenging due to high data transfer costs across the memory hierarchy and limited memory bandwidth. To…

Read Paper →

Computer Science Preprint PDF DOI

Incidence Constraints in Hypergraph Partitioning on GPU

Marco Ronzani, Cristina Silvano · 2026

Hypergraph partitioning is a pervasive NP-hard problem, and accelerating its computation on GPU can both slice time-to-solution and raise quality of results. In this work, we implement a multi-level h…

Read Paper →

Computer Science Preprint PDF DOI

A Unified Model and Document Representation for On-Device Retrieval-Augmented Generation

Julian Killingback, Ofer Meshi, Henry Li, Hamed Zamani, Maryam Karimzadehgan · 2026

Traditional Retrieval-Augmented Generation (RAG) approaches generally assume that retrieval and generation occur on powerful servers removed from the end user. While this reduces local hardware constr…

Read Paper →

Computer Science Preprint PDF DOI

Interactive Exploration of Large-scale Streamlines of Vector Fields via a Curve Segment Neighborhood Graph

Nguyen Phan, Brian Kim, Adeel Zafar, Guoning Chen · 2026

Streamlines have been widely used to represent and analyze various steady vector fields. To sufficiently represent important features in complex vector fields (like flow), a large number of streamline…

Read Paper →

Browse Research Papers

LLM4C2Rust: Large Language Models for Automated Memory-Safe Code Transpilation

vstash: Local-First Hybrid Retrieval with Adaptive Fusion for LLM Agents

Ragged Paged Attention: A High-Performance and Flexible LLM Inference Kernel for TPU

Agentic Microphysics: A Manifesto for Generative AI Safety

ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants

Applying SHAPR in AI-Assisted Research Software Development: Lessons Learnt from Building a Share Trading System

Serving Chain-structured Jobs with Large Memory Footprints with Application to Large Foundation Model Serving

Sublinear Spectral Clustering Oracle with Little Memory

SAGER: Self-Evolving User Policy Skills for Recommendation Agent

Accelerating CRONet on AMD Versal AIE-ML Engines

Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model

EdgeDetect: Importance-Aware Gradient Compression with Homomorphic Aggregation for Federated Intrusion Detection

PlanB: Efficient Software IPv6 Lookup with Linearized $B^+$-Tree

DEEP-GAP: Deep-learning Evaluation of Execution Parallelism in GPU Architectural Performance

Fast Concurrent Primitives Despite Contention

Fleet: Hierarchical Task-based Abstraction for Megakernels on Multi-Die GPUs

Parallel R-tree-based Spatial Query Processing on a Commercial Processing-in-Memory System

Incidence Constraints in Hypergraph Partitioning on GPU

A Unified Model and Document Representation for On-Device Retrieval-Augmented Generation

Interactive Exploration of Large-scale Streamlines of Vector Fields via a Curve Segment Neighborhood Graph

Browse by Category

Research Type

Publish Your Research