Expertini Research Research

Browse Research Papers

16,353+ open-access research outputs.

โœ• Clear
๐Ÿ” memory ๐Ÿ“‚ Computer Science
Showing 16353 results for "memory" in Computer Science
Computer Science Preprint PDF DOI

RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs

Parteek Jamwal, Minghao Shao, Boyuan Chen, Achyuta Muthuvelan, Asini Subanya, Boubacar Ballo, Kashish Satija, Mariam Shafey, Mohamed Mahmoud, Moncif Dahaji Bouffi, Pasindu Wickramasinghe, Siyona Goel, Yaakulya Sabbani, Hakim Hacid, Mthandazo Ndhlovu, Eleanna Kafeza, Sanjay Rawat, Muhammad Shafique ยท 2026

Large Language Models (LLMs) have demonstrated remarkable capabilities across various cybersecurity tasks, including vulnerability classification, detection, and patching. However, their potential in โ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM

Sravanth Kodavanti, Sowmya Vajrala, Srinivas Miriyala, Utsav Tiwari, Uttam Kumar, Utkarsh Kumar Mahawar, Achal Pratap Singh, Arya D, Narendra Mutyala, Vikram Nelvoy Rajendiran, Sharan Kumar Allur, Euntaik Lee, Dohyoung Kim, HyeonSu Lee, Gyusung Cho, JungBae Kim ยท 2026

Deploying large language models (LLMs) on smartphones poses significant engineering challenges due to stringent constraints on memory, latency, and runtime flexibility. In this work, we present a hardโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Empowering Vocabulary Learning Through Teaching AI: Using LLMs as a Student to Perform Learning by Teaching in Vocabulary Acquisition

Tokio Uchida, Ko Watanabe, Andrew Vargo, Shoya Ishimaru, Ralph L. Rose, Ayaka Sugawara, Andreas Dengel, Koichi Kise ยท 2026

"Learning by Teaching (LbT)" helps learners deepen their understanding by explaining concepts to others, with questions playing a vital role in identifying knowledge gaps and reinforcing comprehensionโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

LLM-Codec: Neural Audio Codec Meets Language Model Objectives

Ho-Lam Chung, Yiming Chen, Hung-yi Lee ยท 2026

Neural audio codecs are widely used as tokenizers for spoken language models, but they are optimized for waveform reconstruction rather than autoregressive prediction. This mismatch injects acousticalโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

AsyncSparse: Accelerating Sparse Matrix-Matrix Multiplication on Asynchronous GPU Architectures

Jie Liu, Huanzhi Pu, Zhiru Zhang ยท 2026

Sparse Matrix-Matrix Multiplication (SpMM) is a fundamental kernel across scientific computing and machine learning. While prior work accelerates SpMM using Tensor Cores, no existing sparse kernel expโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

A novel LSTM music generator based on the fractional time-frequency feature extraction

Li Ya, Chen Wei, Li Xiulai, Yu Lei, Deng Xinyi, Chen Chaofan ยท 2026

In this paper, we propose a novel approach for generating music based on an artificial intelligence (AI) system. We analyze the features of music and use them to fit and predict the music. The fractioโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Enabling AI ASICs for Zero Knowledge Proof

Jianming Tong, Jingtian Dang, Simon Langowski, Tianhao Huang, Asra Ali, Jeremy Kun, Jevin Jiang, Srinivas Devadas, Tushar Krishna ยท 2026

Zero-knowledge proof (ZKP) provers remain costly because multi-scalar multiplication (MSM) and number-theoretic transforms (NTTs) dominate runtime as they need significant computation. AI ASICs such aโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

AccelCIM: Systematic Dataflow Exploration for SRAM Compute-in-Memory Accelerator

Chenhao Xue, Yukun Wang, An Guo, Yuhui Shi, Jinwei Zhou, Xiping Dong, Yihan Yin, Yuanpeng Zhang, Tianyu Jia, Wei Gao, Qiang Wu, Xin Si, Jun Yang, Guangyu Sun ยท 2026

SRAM-based compute-in-memory (CIM) offers high computational density and energy efficiency for deep neural network (DNN) accelerators, but its limited capacity causes on/off-chip data movement overheaโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Predictive Multi-Tier Memory Management for KV Cache in Large-Scale GPU Inference

Sanjeev Rao Ganjihal ยท 2026

Key-value (KV) cache memory management is the primary bottleneck limiting throughput and cost-efficiency in large-scale GPU inference serving. Current systems suffer from three compounding inefficiencโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Explainable Attention-Based LSTM Framework for Early Detection of AI-Assisted Ransomware via File System Behavioral Analysis

Prabhudarshi Nayak, Gogulakrishnan Thiyagarajan, Debashree Priyadarshini, Vinay Bist, Rohan Swain ยท 2026

Ransomware continues to evolve as one of the most disruptive cyber threats, with recent variants increasingly leveraging automated and AI-assisted techniques to evade traditional signature-based defenโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

KnowPilot: Your Knowledge-Driven Copilot for Domain Tasks

Zekun Xi, Yichen Nie, Ziyan Jiang, Yujie Bao, Zhenqian Xu, Zhisong Qiu, Ziwen Xu, Shumin Deng ยท 2026

Despite the rapid advancement of generative agents, their deployment in real-world industry scenarios often encounters significant challenges due to a lack of domain-specific knowledge. To address thiโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

MemSearch-o1: Empowering Large Language Models with Reasoning-Aligned Memory Growth in Agentic Search

Sheng Zhang, Junyi Li, Yingyi Zhang, Pengyue Jia, Yichao Wang, Xiaowei Qian, Wenlin Zhang, Maolin Wang, Yong Liu, Xiangyu Zhao ยท 2026

Recent advances in large language models (LLMs) have scaled the potential for reasoning and agentic search, wherein models autonomously plan, retrieve, and reason over external knowledge to answer comโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

UCCL-Zip: Lossless Compression Supercharged GPU Communication

Shuang Ma, Chon Lam Lao, Zhiying Xu, Zhuang Wang, Ziming Mao, Delong Meng, Jia Zhen, Jun Wu, Ion Stoica, Yida Wang, Yang Zhou ยท 2026

The rapid growth of large language models (LLMs) has made GPU communication a critical bottleneck. While prior work reduces communication volume via quantization or lossy compression, these approachesโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

Justice Owusu Agyemang, Jerry John Kponyo, Obed Kwasi Somuah, Elliot Amponsah, Godfred Manu Addo Boakye, Kwame Opuni-Boachie Obour Agyekum ยท 2026

When multiple LLM coding agents share a rate-limited API endpoint, they exhibit resource contention patterns analogous to unscheduled OS processes competing for CPU, memory, and I/O. In a motivating iโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

From Necklaces to Coalitions: Fair and Self-Interested Distribution of Coalition Value Calculations

Terry R. Payne, Luke Riley ยท 2026

A key challenge in distributed coalition formation within characteristic function games is determining how to allocate the calculation of coalition values across a set of agents. The number of possiblโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning

Jiachen Qian ยท 2026

The evolution from static ranking models to Agentic Recommender Systems (Agentic RecSys) empowers AI agents to maintain long-term user profiles and autonomously plan service tasks. While this paradigmโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Different Perspectives of Memory System Simulation

Pouya Esmaili-Dokht, Arash Yadegari, Victor Xirau, Julian Pavon, Adrian Cristal, Eduard Ayguade, Petar Radojkovic ยท 2026

Memory simulators are used to estimate application performance on advanced memory systems, yet they may exhibit significant discrepancies compared to real hardware. This paper investigates two key queโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

MEMRES: A Memory-Augmented Resolver with Confidence Cascade for Agentic Python Dependency Resolution

Dao Sy Duy Minh, Tran Chi Nguyen, Trung Kiet Huynh, Pham Phu Hoa, Nguyen Lam Phu Quy, Vu Nguyen ยท 2026

We present MEMRES, an agentic system for Python dependency resolution that introduces a multi-level confidence cascade where the LLM serves as the last resort. Our system combines: (1) a Self-Evolvingโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

HieraSparse: Hierarchical Semi-Structured Sparse KV Attention

Haoxuan Wang, Chen Wang ยท 2026

The deployment of long-context Large Language Models (LLMs) poses significant challenges due to the intense computational cost of self-attention and the substantial memory overhead of the Key-Value Caโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

enclawed: A Configurable, Sector-Neutral Hardening Framework for Single-User AI Assistant Gateways

Alfredo Metere ยท 2026

We present enclawed, a hard-fork hardening framework built on top of the OpenClaw single-user personal artificial intelligence (AI) assistant gateway. enclawed targets deployments that need attestableโ€ฆ

Read Paper โ†’
โ† Prev Page 7 of 818 Next โ†’