Expertini Research Research

Browse Research Papers

16,353+ open-access research outputs.

โœ• Clear
๐Ÿ” memory ๐Ÿ“‚ Computer Science
Showing 16353 results for "memory" in Computer Science
Computer Science Preprint PDF DOI

DPC: A Distributed Page Cache over CXL

Shai Bergman, Zhe Yang, Julien Eudine, Giorgio Negro, Onur Mutlu, Arash Tavakkol, Ji Zhang ยท 2026

Modern distributed file systems rely on uncoordinated, per node page caches that replicate hot data locally across the cluster. While ensuring fast local access, this architecture underutilizes aggregโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

CROWDio: A Practical Mobile Crowd Computing Framework with Developer-Oriented Design, Adaptive Scheduling, and Fault Resilience

Lakshani Manamperi, Disumi Pathirana, Thiwanka Pathirana, Nipun Premarathna, Kutila Gunasekara ยท 2026

Mobile Crowd Computing (MCdC) leverages the idle computational capacity of consumer smartphones to enable distributed task processing at scale; however, widespread real-world adoption remains constraiโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

POLAR-PIC: A Holistic Framework for Matrixized PIC with Co-Designed Compute, Layout, and Communication

Yizhuo Rao, Xingjian Cui, Shangzhi Pang, Jiabin Xie, Guangnan Feng, Jinhui Wei, Ziyan Zhang, Languang Gao, Zhenyu Wang, Zhiguang Chen, Yutong Lu ยท 2026

Particle-in-Cell (PIC) simulations are fundamental to plasma physics but often suffer from limited scalability due to particle-grid interaction bottlenecks and particle redistribution costs. Specificaโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Energy Efficient LSTM Accelerators for Embedded FPGAs through Parameterised Architecture Design

Chao Qian, Tianheng Ling, Gregor Schiele ยท 2026

Long Short-term Memory Networks (LSTMs) are a vital Deep Learning technique suitable for performing on-device time series analysis on local sensor data streams of embedded devices. In this paper, we pโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

A Simple Communication Scheme for Distributed Fast Multipole Methods

Srinath Kailasa ยท 2026

We present a simple hierarchical communication scheme for distributed Fast Multipole Methods (FMMs) based on MPI neighborhood collectives and uniform trees. The method targets the common case of extenโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Design Rules for Extreme-Edge Scientific Computing on AI Engines

Zhenghua Ma, G Abarajithan, Dimitrios Danopoulos, Olivia Weng, Francesco Restuccia, Ryan Kastner ยท 2026

Extreme-edge scientific applications use machine learning models to analyze sensor data and make real-time decisions. Their stringent latency and throughput requirements demand small batch sizes and rโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Heuristic Search Space Partitioning for Low-Latency Multi-Tenant Cloud Queries

Prashant Kumar Pathak, Chandra Biksheswaran Mouleeswaran, Rama Teja Repaka ยท 2026

Large-scale cloud security platforms must continuously query millions of structured cloud resource records distributed across thousands of tenant accounts. Broad, account-spanning queries saturate datโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

CHRONOS: A Hardware-Assisted Phase-Decoupled Framework for Secure Federated Learning in IoT

Hung Dang ยท 2026

We propose CHRONOS, a hardware-assisted framework that decouples the cryptographic setup required for private gradient aggregation from the active training phase. CHRONOS executes a once-per-epoch serโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Ocean: Fast Estimation-Based Sparse General Matrix-Matrix Multiplication on GPU

Yifan Li, Giulia Guidi ยท 2026

In computational science and data analytics, many workloads involve irregular and sparse computations that are inherently difficult to optimize for modern hardware. A key kernel is Sparse General Matrโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

A Comparative Analysis of ARM and x86-64 Laptop-Class Processors: Architecture, Assembly-Level Performance, and Energy Efficiency

Mustafa Mert Ozy{i}lmaz ยท 2026

ARM-based and x86-64 laptop processors differ not only in instruction-set design, but also in memory hierarchy, core organization, system integration, and power-management mechanisms. This study preseโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

High-Fidelity 3D Gaussian Human Reconstruction via Region-Aware Initialization and Geometric Priors

Yang Liu, Zhiyong Zhang ยท 2026

Real-time, high-fidelity 3D human reconstruction from RGB images is essential for interactive applications such as virtual reality and gaming, yet remains challenging due to the complex non-rigid defoโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Optimizing Branch Predictor for Graph Applications

Upasna, Venkata Kalyan Tavva ยท 2026

Real-world graph applications are generally larger than the size of the cache itself. Due to this reason, the memory hierarchy was identified as a key bottleneck by the earlier works. Undoubtedly, theโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Beyond Indistinguishability: Measuring Extraction Risk in LLM APIs

Ruixuan Liu, David Evans, Li Xiong ยท 2026

Indistinguishability properties such as differential privacy bounds or low empirically measured membership inference are widely treated as proxies to show a model is sufficiently protected against broโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

HybridGen: Efficient LLM Generative Inference via CPU-GPU Hybrid Computing

Mao Lin, Xi Wang, Guilherme Cox, Dong Li, Hyeran Jeon ยท 2026

As modern LLMs support thousands to millions of tokens, KV caches grow to hundreds of gigabytes, stressing memory capacity and bandwidth. Existing solutions, such as KV cache pruning and offloading, aโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Aligning Language Models for Lyric-to-Melody Generation with Rule-Based Musical Constraints

Hao Meng, Siyuan Zheng, Shuran Zhou, Qiangqiang Wang, Yang Song ยท 2026

Large Language Models (LLMs) show promise in lyric-to-melody generation, but models trained with Supervised Fine-Tuning (SFT) often produce musically implausible melodies with issues like poor rhythm โ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Balanced Co-Clustering of Users and Items for Embedding Table Compression in Recommender Systems

Runhao Jiang, Renchi Yang, Donghao Wu ยท 2026

Recommender systems have advanced markedly over the past decade by transforming each user/item into a dense embedding vector with deep learning models. At industrial scale, embedding tables constituteโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

AQPIM: Breaking the PIM Capacity Wall for LLMs with In-Memory Activation Quantization

Kosuke Matsushima, Yasuyuki Okoshi, Masato Motomura, Daichi Fujiki ยท 2026

Processing-in-Memory (PIM) architectures offer a promising solution to the memory bottlenecks in data-intensive machine learning, yet often overlook the growing challenge of activation memory footprinโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Proxics: an efficient programming model for far memory accelerators

Zikai Liu, Niels Pressel, Jasmin Schult, Roman Meier, Pengcheng Xu, Timothy Roscoe ยท 2026

The use of disaggregated or far memory systems such as CXL memory pools has renewed interest in Near-Data Processing (NDP): situating cores close to memory to reduce bandwidth requirements to and fromโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Optimizing Memory Allocation in Distributed Clusters with Predictive Modeling

Jonathan Bader, Edgar Blumenthal, Marten Eckardt, Justus Krebs, Joel Witzke, Xemena Wysokinska, Haci Ismail Aslan, Odej Kao ยท 2026

In modern distributed systems, efficient resource allocation is a vital aspect to maintain scalability, reduce operational costs, and ensure fast execution even across heterogeneous workloads. Predictโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints

Jianan Liu, Jing Yang, Xianyou Li, Weiran Yan, Yichao Wu, Penghao Liang, Mengwei Yuan ยท 2026

The rapid adoption of artificial intelligence (AI) and large language models (LLMs) is transforming financial analytics by enabling natural language interfaces for reporting, decision support, and autโ€ฆ

Read Paper โ†’
โ† Prev Page 6 of 818 Next โ†’