Memory in Computer Science — Research Repository

Computer Science Preprint PDF DOI

RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs

Parteek Jamwal, Minghao Shao, Boyuan Chen, Achyuta Muthuvelan, Asini Subanya, Boubacar Ballo, Kashish Satija, Mariam Shafey, Mohamed Mahmoud, Moncif Dahaji Bouffi, Pasindu Wickramasinghe, Siyona Goel, Yaakulya Sabbani, Hakim Hacid, Mthandazo Ndhlovu, Eleanna Kafeza, Sanjay Rawat, Muhammad Shafique · 2026

Large Language Models (LLMs) have demonstrated remarkable capabilities across various cybersecurity tasks, including vulnerability classification, detection, and patching. However, their potential in …

Read Paper →

Computer Science Preprint PDF DOI

Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM

Sravanth Kodavanti, Sowmya Vajrala, Srinivas Miriyala, Utsav Tiwari, Uttam Kumar, Utkarsh Kumar Mahawar, Achal Pratap Singh, Arya D, Narendra Mutyala, Vikram Nelvoy Rajendiran, Sharan Kumar Allur, Euntaik Lee, Dohyoung Kim, HyeonSu Lee, Gyusung Cho, JungBae Kim · 2026

Deploying large language models (LLMs) on smartphones poses significant engineering challenges due to stringent constraints on memory, latency, and runtime flexibility. In this work, we present a hard…

Read Paper →

Computer Science Preprint PDF DOI

Empowering Vocabulary Learning Through Teaching AI: Using LLMs as a Student to Perform Learning by Teaching in Vocabulary Acquisition

Tokio Uchida, Ko Watanabe, Andrew Vargo, Shoya Ishimaru, Ralph L. Rose, Ayaka Sugawara, Andreas Dengel, Koichi Kise · 2026

"Learning by Teaching (LbT)" helps learners deepen their understanding by explaining concepts to others, with questions playing a vital role in identifying knowledge gaps and reinforcing comprehension…

Read Paper →

Computer Science Preprint PDF DOI

LLM-Codec: Neural Audio Codec Meets Language Model Objectives

Ho-Lam Chung, Yiming Chen, Hung-yi Lee · 2026

Neural audio codecs are widely used as tokenizers for spoken language models, but they are optimized for waveform reconstruction rather than autoregressive prediction. This mismatch injects acoustical…

Read Paper →

Computer Science Preprint PDF DOI

AsyncSparse: Accelerating Sparse Matrix-Matrix Multiplication on Asynchronous GPU Architectures

Jie Liu, Huanzhi Pu, Zhiru Zhang · 2026

Sparse Matrix-Matrix Multiplication (SpMM) is a fundamental kernel across scientific computing and machine learning. While prior work accelerates SpMM using Tensor Cores, no existing sparse kernel exp…

Read Paper →

Computer Science Preprint PDF DOI

A novel LSTM music generator based on the fractional time-frequency feature extraction

Li Ya, Chen Wei, Li Xiulai, Yu Lei, Deng Xinyi, Chen Chaofan · 2026

In this paper, we propose a novel approach for generating music based on an artificial intelligence (AI) system. We analyze the features of music and use them to fit and predict the music. The fractio…

Read Paper →

Computer Science Preprint PDF DOI

Enabling AI ASICs for Zero Knowledge Proof

Jianming Tong, Jingtian Dang, Simon Langowski, Tianhao Huang, Asra Ali, Jeremy Kun, Jevin Jiang, Srinivas Devadas, Tushar Krishna · 2026

Zero-knowledge proof (ZKP) provers remain costly because multi-scalar multiplication (MSM) and number-theoretic transforms (NTTs) dominate runtime as they need significant computation. AI ASICs such a…

Read Paper →

Computer Science Preprint PDF DOI

AccelCIM: Systematic Dataflow Exploration for SRAM Compute-in-Memory Accelerator

Chenhao Xue, Yukun Wang, An Guo, Yuhui Shi, Jinwei Zhou, Xiping Dong, Yihan Yin, Yuanpeng Zhang, Tianyu Jia, Wei Gao, Qiang Wu, Xin Si, Jun Yang, Guangyu Sun · 2026

SRAM-based compute-in-memory (CIM) offers high computational density and energy efficiency for deep neural network (DNN) accelerators, but its limited capacity causes on/off-chip data movement overhea…

Read Paper →

Computer Science Preprint PDF DOI

Predictive Multi-Tier Memory Management for KV Cache in Large-Scale GPU Inference

Sanjeev Rao Ganjihal · 2026

Key-value (KV) cache memory management is the primary bottleneck limiting throughput and cost-efficiency in large-scale GPU inference serving. Current systems suffer from three compounding inefficienc…

Read Paper →

Computer Science Preprint PDF DOI

Explainable Attention-Based LSTM Framework for Early Detection of AI-Assisted Ransomware via File System Behavioral Analysis

Prabhudarshi Nayak, Gogulakrishnan Thiyagarajan, Debashree Priyadarshini, Vinay Bist, Rohan Swain · 2026

Ransomware continues to evolve as one of the most disruptive cyber threats, with recent variants increasingly leveraging automated and AI-assisted techniques to evade traditional signature-based defen…

Read Paper →

Computer Science Preprint PDF DOI

KnowPilot: Your Knowledge-Driven Copilot for Domain Tasks

Zekun Xi, Yichen Nie, Ziyan Jiang, Yujie Bao, Zhenqian Xu, Zhisong Qiu, Ziwen Xu, Shumin Deng · 2026

Despite the rapid advancement of generative agents, their deployment in real-world industry scenarios often encounters significant challenges due to a lack of domain-specific knowledge. To address thi…

Read Paper →

Computer Science Preprint PDF DOI

MemSearch-o1: Empowering Large Language Models with Reasoning-Aligned Memory Growth in Agentic Search

Sheng Zhang, Junyi Li, Yingyi Zhang, Pengyue Jia, Yichao Wang, Xiaowei Qian, Wenlin Zhang, Maolin Wang, Yong Liu, Xiangyu Zhao · 2026

Recent advances in large language models (LLMs) have scaled the potential for reasoning and agentic search, wherein models autonomously plan, retrieve, and reason over external knowledge to answer com…

Read Paper →

Computer Science Preprint PDF DOI

UCCL-Zip: Lossless Compression Supercharged GPU Communication

Shuang Ma, Chon Lam Lao, Zhiying Xu, Zhuang Wang, Ziming Mao, Delong Meng, Jia Zhen, Jun Wu, Ion Stoica, Yida Wang, Yang Zhou · 2026

The rapid growth of large language models (LLMs) has made GPU communication a critical bottleneck. While prior work reduces communication volume via quantization or lossy compression, these approaches…

Read Paper →

Computer Science Preprint PDF DOI

HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

Justice Owusu Agyemang, Jerry John Kponyo, Obed Kwasi Somuah, Elliot Amponsah, Godfred Manu Addo Boakye, Kwame Opuni-Boachie Obour Agyekum · 2026

When multiple LLM coding agents share a rate-limited API endpoint, they exhibit resource contention patterns analogous to unscheduled OS processes competing for CPU, memory, and I/O. In a motivating i…

Read Paper →

Computer Science Preprint PDF DOI

From Necklaces to Coalitions: Fair and Self-Interested Distribution of Coalition Value Calculations

Terry R. Payne, Luke Riley · 2026

A key challenge in distributed coalition formation within characteristic function games is determining how to allocate the calculation of coalition values across a set of agents. The number of possibl…

Read Paper →

Computer Science Preprint PDF DOI

Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning

Jiachen Qian · 2026

The evolution from static ranking models to Agentic Recommender Systems (Agentic RecSys) empowers AI agents to maintain long-term user profiles and autonomously plan service tasks. While this paradigm…

Read Paper →

Computer Science Preprint PDF DOI

Different Perspectives of Memory System Simulation

Pouya Esmaili-Dokht, Arash Yadegari, Victor Xirau, Julian Pavon, Adrian Cristal, Eduard Ayguade, Petar Radojkovic · 2026

Memory simulators are used to estimate application performance on advanced memory systems, yet they may exhibit significant discrepancies compared to real hardware. This paper investigates two key que…

Read Paper →

Computer Science Preprint PDF DOI

MEMRES: A Memory-Augmented Resolver with Confidence Cascade for Agentic Python Dependency Resolution

Dao Sy Duy Minh, Tran Chi Nguyen, Trung Kiet Huynh, Pham Phu Hoa, Nguyen Lam Phu Quy, Vu Nguyen · 2026

We present MEMRES, an agentic system for Python dependency resolution that introduces a multi-level confidence cascade where the LLM serves as the last resort. Our system combines: (1) a Self-Evolving…

Read Paper →

Computer Science Preprint PDF DOI

HieraSparse: Hierarchical Semi-Structured Sparse KV Attention

Haoxuan Wang, Chen Wang · 2026

The deployment of long-context Large Language Models (LLMs) poses significant challenges due to the intense computational cost of self-attention and the substantial memory overhead of the Key-Value Ca…

Read Paper →

Computer Science Preprint PDF DOI

enclawed: A Configurable, Sector-Neutral Hardening Framework for Single-User AI Assistant Gateways

Alfredo Metere · 2026

We present enclawed, a hard-fork hardening framework built on top of the OpenClaw single-user personal artificial intelligence (AI) assistant gateway. enclawed targets deployments that need attestable…

Read Paper →

Browse Research Papers

RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs

Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM

Empowering Vocabulary Learning Through Teaching AI: Using LLMs as a Student to Perform Learning by Teaching in Vocabulary Acquisition

LLM-Codec: Neural Audio Codec Meets Language Model Objectives

AsyncSparse: Accelerating Sparse Matrix-Matrix Multiplication on Asynchronous GPU Architectures

A novel LSTM music generator based on the fractional time-frequency feature extraction

Enabling AI ASICs for Zero Knowledge Proof

AccelCIM: Systematic Dataflow Exploration for SRAM Compute-in-Memory Accelerator

Predictive Multi-Tier Memory Management for KV Cache in Large-Scale GPU Inference

Explainable Attention-Based LSTM Framework for Early Detection of AI-Assisted Ransomware via File System Behavioral Analysis

KnowPilot: Your Knowledge-Driven Copilot for Domain Tasks

MemSearch-o1: Empowering Large Language Models with Reasoning-Aligned Memory Growth in Agentic Search

UCCL-Zip: Lossless Compression Supercharged GPU Communication

HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

From Necklaces to Coalitions: Fair and Self-Interested Distribution of Coalition Value Calculations

Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning

Different Perspectives of Memory System Simulation

MEMRES: A Memory-Augmented Resolver with Confidence Cascade for Agentic Python Dependency Resolution

HieraSparse: Hierarchical Semi-Structured Sparse KV Attention

enclawed: A Configurable, Sector-Neutral Hardening Framework for Single-User AI Assistant Gateways

Browse by Category

Research Type

Publish Your Research