Expertini Research Research

Browse Research Papers

16,353+ open-access research outputs.

โœ• Clear
๐Ÿ” memory ๐Ÿ“‚ Computer Science
Showing 16353 results for "memory" in Computer Science
Computer Science Preprint PDF DOI

Remotely programming the weights of a spintronic neural network by a radiofrequency broadcast signal

M. Menshawy, D. Sanz-Hernandez, L. Mazza, V. Puliafito, G. Finocchio, A. Jenkins, R. Ferreira, L. Benetti, J. Grollier, F.A. Mizrahi ยท 2026

Selectively programming large number of non-volatile synaptic weights without compromising scalability is a key challenge for in-memory computing. Here, we demonstrate remote programming of synaptic wโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

DECOFFEE: Decentralized Reinforcement Learning for Time-critical Workload Offloading and Energy Efficiency across the Computing Continuum

Anastasios Giannopoulos, Sotirios Spantideas, Panagiotis Trakadas ยท 2026

The rapid proliferation of latency-sensitive and battery-constrained Internet-of-Things (IoT) applications has intensified the need for intelligent workload placement mechanisms across the Edge-Cloud โ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Salca: A Sparsity-Aware Hardware Accelerator for Efficient Long-Context Attention Decoding

Wang Fan, Wei Cao, Xi Zha, Kedi Ma, MingQian Sun, Jialin Chen, Fengzhe Zhang, Fan Zhang ยท 2026

Long contexts improve capabilities of large language models but pose serious hardware challenges: compute and memory footprints grow linearly with sequence length. Particularly, the decoding phase conโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Compilation and Execution of an Embeddable YOLO-NAS on the VTA

Anthony Faure-Gignoux, Kevin Delmas, Adrien Gauffriau, Claire Pagetti ยท 2026

Deploying complex Convolutional Neural Networks (CNNs) on FPGA-based accelerators is a promising way forward for safety-critical domains such as aeronautics. In a previous work, we have explored the Vโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Exact, Efficient, and Reliable Multi-Objective and Multi-Constrained IoT Workflow Scheduling in Edge-Hub-Cloud Cyber-Physical Systems

Andreas Kouloumpris, Georgios L. Stavrinides, Maria K. Michael, Theocharis Theocharides ยท 2026

Emerging IoT-enabled cyber-physical applications demand low-latency, energy-efficient, and reliable execution across resource-constrained edge devices with heterogeneous multicore processors and diverโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

MEMCoder: Multi-dimensional Evolving Memory for Private-Library-Oriented Code Generation

Mofei Li, Taozhi Chen, Guowei Yang, Jia Li ยท 2026

Large Language Models (LLMs) excel at general code generation, but their performance drops sharply in enterprise settings that rely on internal private libraries absent from public pre-training corporโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

TACO: Efficient Communication Compression of Intermediate Tensors for Scalable Tensor-Parallel LLM Training

Man Liu, Xingchen Liu, Xingjian Tian, Bing Lu, Shengkay Lyu, Shengquan Yin, Wenjing Huang, Zheng Wei, Hairui Zhao, Guangming Tan, Dingwen Tao ยท 2026

Handling communication overhead in large-scale tensor-parallel training remains a critical challenge due to the dense, near-zero distributions of intermediate tensors, which exacerbate errors under frโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

DataClaw: An Autonomous Data Agent with Instant Messaging Integration

Huahang Li, Wentao Hu, Zhuoyue Wan, Chen Jason Zhang, Haoyang Li, Xiaoyong Wei ยท 2026

In daily life, there are many scenarios that people need to tackle data-related tasks, such as filling out forms, analyzing Excel files, and visualize data report. However, the tools available for theโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents

Jiaqi Li, Yang Zhao, Bin Sun, Yang Yu, Jian Chang, Lidong Zhai ยท 2026

Autonomous AI agents deployed on platforms such as OpenClaw face prompt injection, memory poisoning, supply-chain attacks, and social engineering, yet existing defences address only the platform perimโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

SDSL-Solver: Scalable Distributed Sparse Linear Solvers for Large-Scale Interior Point Methods

Shaofeng Yang, Yunting Wang, Yingying Cheng, Fan Zhang, Xin He, Guangming Tan ยท 2026

The solution of sparse linear systems constitutes the dominant computational bottleneck in interior point methods (IPMs), frequently consuming over 70% of the total solution time. As optimization probโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

S2G-RAG: Structured Sufficiency and Gap Judging for Iterative Retrieval-Augmented QA

Minghan Li, Junjie Zou, Xinxuan Lv, Chao Zhang, Guodong Zhou ยท 2026

Retrieval-Augmented Generation (RAG) grounds language models in external evidence, but multi-hop question answering remains difficult because iterative pipelines must control what to retrieve next andโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

StateScribe: Towards Accessible Change Awareness Across Real-World Revisits

Ruei-Che Chang, Xirui Jiang, Rosiana Natalie, Hao Chen, Vlad Roznyatovskiy, Jianzhong Zhang, Kang G. Shin, Ke Sun, Anhong Guo ยท 2026

Real-world environments evolve continuously, yet blind and low-vision (BLV) individuals often have limited access to understanding how they change over time. Unexpected or relocated objects, layout moโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Spore: Efficient and Training-Free Privacy Extraction Attack on LLMs via Inference-Time Hybrid Probing

Yu Cui, Ruiqing Yue, Hang Fu, Sicheng Pan, Zhuoyu Sun, Baohan Huang, Haibin Zhang, Cong Zuo, Licheng Wang ยท 2026

With the wide adoption of personal AI assistants such as OpenClaw, privacy leakage in user interaction contexts with large language model (LLM) agents has become a critical issue. Existing privacy attโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

ClusterFusion++: Expanding Cluster-Level Fusion to Full Transformer-Block Decoding

ChiHeng Jin, Hongche Yu, Xihui Chen ยท 2026

Large language model (LLM) decoding is latency-sensitive and often bottlenecked by fragmented operator execution and repeated off-chip materialization of intermediate tensors. Prior work expands fusioโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

A Parametric Memory Head for Continual Generative Retrieval

Kidist Amde Mekonnen, Yubao Tang, Maarten de Rijke ยท 2026

Generative information retrieval (GenIR) consolidates retrieval into a single neural model that decodes document identifiers (docids) directly from queries. While this model-as-index paradigm offers aโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Ghost in the Agent: Redefining Information Flow Tracking for LLM Agents

Yuandao Cai, Wensheng Tang, Cheng Wen, Shengchao Qin ยท 2026

Autonomous Large Language Model (LLM) agents are increasingly deployed to conduct complex tasks by interacting with external tools, APIs, and memory stores. However, processing untrusted external dataโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

From Stateless Queries to Autonomous Actions: A Layered Security Framework for Agentic AI Systems

Kexin Chu ยท 2026

Agentic AI systems face security challenges that stateless large language models do not. They plan across extended horizons, maintain persistent memory, invoke external tools, and coordinate with peerโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

An Agentic Framework for Intent Co-Creation in 6G NaaS: Architecture and Open-Source Model Evaluation

Kostis Trantzas, Besiana Agko, Christos Tranoris, Irene Denazi ยท 2026

6G network complexity necessitates high levels of autonomy, yet current intent-based systems struggle with ambiguous or incomplete human requests. This paper introduces an agent-based, intent-driven eโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Tessera: Secure, Near-Line-Rate Weight Streaming for UMA Edge Accelerators

Animan Naskar ยท 2026

Deploying proprietary Deep Neural Networks (DNNs) on commodity edge devices demands hardware-backed Digital Rights Management (DRM) capable of withstanding both software-level and physical adversariesโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Maximizing Memory-Level Parallelism via Integrated Stochastic Logic-in-Memory Architectures

Farzad Razi, Mehran Moghadam, Sercan Aygun, M. Hassan Najafi, Marc Riedel ยท 2026

Today's high-performance architectures are increasingly constrained by data movement latency and energy overhead, as the slowdown of single-core performance scaling coincides with the rise of highly dโ€ฆ

Read Paper โ†’
โ† Prev Page 3 of 818 Next โ†’