Memory in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Remotely programming the weights of a spintronic neural network by a radiofrequency broadcast signal

M. Menshawy, D. Sanz-Hernandez, L. Mazza, V. Puliafito, G. Finocchio, A. Jenkins, R. Ferreira, L. Benetti, J. Grollier, F.A. Mizrahi · 2026

Selectively programming large number of non-volatile synaptic weights without compromising scalability is a key challenge for in-memory computing. Here, we demonstrate remote programming of synaptic w…

Read Paper →

Computer Science Preprint PDF DOI

DECOFFEE: Decentralized Reinforcement Learning for Time-critical Workload Offloading and Energy Efficiency across the Computing Continuum

Anastasios Giannopoulos, Sotirios Spantideas, Panagiotis Trakadas · 2026

The rapid proliferation of latency-sensitive and battery-constrained Internet-of-Things (IoT) applications has intensified the need for intelligent workload placement mechanisms across the Edge-Cloud …

Read Paper →

Computer Science Preprint PDF DOI

Salca: A Sparsity-Aware Hardware Accelerator for Efficient Long-Context Attention Decoding

Wang Fan, Wei Cao, Xi Zha, Kedi Ma, MingQian Sun, Jialin Chen, Fengzhe Zhang, Fan Zhang · 2026

Long contexts improve capabilities of large language models but pose serious hardware challenges: compute and memory footprints grow linearly with sequence length. Particularly, the decoding phase con…

Read Paper →

Computer Science Preprint PDF DOI

Compilation and Execution of an Embeddable YOLO-NAS on the VTA

Anthony Faure-Gignoux, Kevin Delmas, Adrien Gauffriau, Claire Pagetti · 2026

Deploying complex Convolutional Neural Networks (CNNs) on FPGA-based accelerators is a promising way forward for safety-critical domains such as aeronautics. In a previous work, we have explored the V…

Read Paper →

Computer Science Preprint PDF DOI

Exact, Efficient, and Reliable Multi-Objective and Multi-Constrained IoT Workflow Scheduling in Edge-Hub-Cloud Cyber-Physical Systems

Andreas Kouloumpris, Georgios L. Stavrinides, Maria K. Michael, Theocharis Theocharides · 2026

Emerging IoT-enabled cyber-physical applications demand low-latency, energy-efficient, and reliable execution across resource-constrained edge devices with heterogeneous multicore processors and diver…

Read Paper →

Computer Science Preprint PDF DOI

MEMCoder: Multi-dimensional Evolving Memory for Private-Library-Oriented Code Generation

Mofei Li, Taozhi Chen, Guowei Yang, Jia Li · 2026

Large Language Models (LLMs) excel at general code generation, but their performance drops sharply in enterprise settings that rely on internal private libraries absent from public pre-training corpor…

Read Paper →

Computer Science Preprint PDF DOI

TACO: Efficient Communication Compression of Intermediate Tensors for Scalable Tensor-Parallel LLM Training

Man Liu, Xingchen Liu, Xingjian Tian, Bing Lu, Shengkay Lyu, Shengquan Yin, Wenjing Huang, Zheng Wei, Hairui Zhao, Guangming Tan, Dingwen Tao · 2026

Handling communication overhead in large-scale tensor-parallel training remains a critical challenge due to the dense, near-zero distributions of intermediate tensors, which exacerbate errors under fr…

Read Paper →

Computer Science Preprint PDF DOI

DataClaw: An Autonomous Data Agent with Instant Messaging Integration

Huahang Li, Wentao Hu, Zhuoyue Wan, Chen Jason Zhang, Haoyang Li, Xiaoyong Wei · 2026

In daily life, there are many scenarios that people need to tackle data-related tasks, such as filling out forms, analyzing Excel files, and visualize data report. However, the tools available for the…

Read Paper →

Computer Science Preprint PDF DOI

Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents

Jiaqi Li, Yang Zhao, Bin Sun, Yang Yu, Jian Chang, Lidong Zhai · 2026

Autonomous AI agents deployed on platforms such as OpenClaw face prompt injection, memory poisoning, supply-chain attacks, and social engineering, yet existing defences address only the platform perim…

Read Paper →

Computer Science Preprint PDF DOI

SDSL-Solver: Scalable Distributed Sparse Linear Solvers for Large-Scale Interior Point Methods

Shaofeng Yang, Yunting Wang, Yingying Cheng, Fan Zhang, Xin He, Guangming Tan · 2026

The solution of sparse linear systems constitutes the dominant computational bottleneck in interior point methods (IPMs), frequently consuming over 70% of the total solution time. As optimization prob…

Read Paper →

Computer Science Preprint PDF DOI

S2G-RAG: Structured Sufficiency and Gap Judging for Iterative Retrieval-Augmented QA

Minghan Li, Junjie Zou, Xinxuan Lv, Chao Zhang, Guodong Zhou · 2026

Retrieval-Augmented Generation (RAG) grounds language models in external evidence, but multi-hop question answering remains difficult because iterative pipelines must control what to retrieve next and…

Read Paper →

Computer Science Preprint PDF DOI

StateScribe: Towards Accessible Change Awareness Across Real-World Revisits

Ruei-Che Chang, Xirui Jiang, Rosiana Natalie, Hao Chen, Vlad Roznyatovskiy, Jianzhong Zhang, Kang G. Shin, Ke Sun, Anhong Guo · 2026

Real-world environments evolve continuously, yet blind and low-vision (BLV) individuals often have limited access to understanding how they change over time. Unexpected or relocated objects, layout mo…

Read Paper →

Computer Science Preprint PDF DOI

Spore: Efficient and Training-Free Privacy Extraction Attack on LLMs via Inference-Time Hybrid Probing

Yu Cui, Ruiqing Yue, Hang Fu, Sicheng Pan, Zhuoyu Sun, Baohan Huang, Haibin Zhang, Cong Zuo, Licheng Wang · 2026

With the wide adoption of personal AI assistants such as OpenClaw, privacy leakage in user interaction contexts with large language model (LLM) agents has become a critical issue. Existing privacy att…

Read Paper →

Computer Science Preprint PDF DOI

ClusterFusion++: Expanding Cluster-Level Fusion to Full Transformer-Block Decoding

ChiHeng Jin, Hongche Yu, Xihui Chen · 2026

Large language model (LLM) decoding is latency-sensitive and often bottlenecked by fragmented operator execution and repeated off-chip materialization of intermediate tensors. Prior work expands fusio…

Read Paper →

Computer Science Preprint PDF DOI

A Parametric Memory Head for Continual Generative Retrieval

Kidist Amde Mekonnen, Yubao Tang, Maarten de Rijke · 2026

Generative information retrieval (GenIR) consolidates retrieval into a single neural model that decodes document identifiers (docids) directly from queries. While this model-as-index paradigm offers a…

Read Paper →

Computer Science Preprint PDF DOI

Ghost in the Agent: Redefining Information Flow Tracking for LLM Agents

Yuandao Cai, Wensheng Tang, Cheng Wen, Shengchao Qin · 2026

Autonomous Large Language Model (LLM) agents are increasingly deployed to conduct complex tasks by interacting with external tools, APIs, and memory stores. However, processing untrusted external data…

Read Paper →

Computer Science Preprint PDF DOI

From Stateless Queries to Autonomous Actions: A Layered Security Framework for Agentic AI Systems

Kexin Chu · 2026

Agentic AI systems face security challenges that stateless large language models do not. They plan across extended horizons, maintain persistent memory, invoke external tools, and coordinate with peer…

Read Paper →

Computer Science Preprint PDF DOI

An Agentic Framework for Intent Co-Creation in 6G NaaS: Architecture and Open-Source Model Evaluation

Kostis Trantzas, Besiana Agko, Christos Tranoris, Irene Denazi · 2026

6G network complexity necessitates high levels of autonomy, yet current intent-based systems struggle with ambiguous or incomplete human requests. This paper introduces an agent-based, intent-driven e…

Read Paper →

Computer Science Preprint PDF DOI

Tessera: Secure, Near-Line-Rate Weight Streaming for UMA Edge Accelerators

Animan Naskar · 2026

Deploying proprietary Deep Neural Networks (DNNs) on commodity edge devices demands hardware-backed Digital Rights Management (DRM) capable of withstanding both software-level and physical adversaries…

Read Paper →

Computer Science Preprint PDF DOI

Maximizing Memory-Level Parallelism via Integrated Stochastic Logic-in-Memory Architectures

Farzad Razi, Mehran Moghadam, Sercan Aygun, M. Hassan Najafi, Marc Riedel · 2026

Today's high-performance architectures are increasingly constrained by data movement latency and energy overhead, as the slowdown of single-core performance scaling coincides with the rise of highly d…

Read Paper →

Browse Research Papers

Remotely programming the weights of a spintronic neural network by a radiofrequency broadcast signal

DECOFFEE: Decentralized Reinforcement Learning for Time-critical Workload Offloading and Energy Efficiency across the Computing Continuum

Salca: A Sparsity-Aware Hardware Accelerator for Efficient Long-Context Attention Decoding

Compilation and Execution of an Embeddable YOLO-NAS on the VTA

Exact, Efficient, and Reliable Multi-Objective and Multi-Constrained IoT Workflow Scheduling in Edge-Hub-Cloud Cyber-Physical Systems

MEMCoder: Multi-dimensional Evolving Memory for Private-Library-Oriented Code Generation

TACO: Efficient Communication Compression of Intermediate Tensors for Scalable Tensor-Parallel LLM Training

DataClaw: An Autonomous Data Agent with Instant Messaging Integration

Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents

SDSL-Solver: Scalable Distributed Sparse Linear Solvers for Large-Scale Interior Point Methods

S2G-RAG: Structured Sufficiency and Gap Judging for Iterative Retrieval-Augmented QA

StateScribe: Towards Accessible Change Awareness Across Real-World Revisits

Spore: Efficient and Training-Free Privacy Extraction Attack on LLMs via Inference-Time Hybrid Probing

ClusterFusion++: Expanding Cluster-Level Fusion to Full Transformer-Block Decoding

A Parametric Memory Head for Continual Generative Retrieval

Ghost in the Agent: Redefining Information Flow Tracking for LLM Agents

From Stateless Queries to Autonomous Actions: A Layered Security Framework for Agentic AI Systems

An Agentic Framework for Intent Co-Creation in 6G NaaS: Architecture and Open-Source Model Evaluation

Tessera: Secure, Near-Line-Rate Weight Streaming for UMA Edge Accelerators

Maximizing Memory-Level Parallelism via Integrated Stochastic Logic-in-Memory Architectures

Browse by Category

Research Type

Publish Your Research