Per Alexandersson in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Crab: A Semantics-Aware Checkpoint/Restore Runtime for Agent Sandboxes

Tianyuan Wu, Chaokun Chang, Lunxi Cao, Wei Gao, Wei Wang · 2026

Autonomous agents act through sandboxed containers and microVMs whose state spans filesystems, processes, and runtime artifacts. Checkpoint and restore (C/R) of this state is needed for fault toleranc…

Read Paper →

Computer Science Preprint PDF DOI

Energy-Aware Quantum-Enhanced Computing Continuum

Carlos J. Barrios H., Frederic Le Mouel, Oscar Carrillo · 2026

We discuss a Quantum-Enhanced Computing Continuum, a heterogeneous, hybrid architecture that integrates quantum processing units (QPUs) within an Edge-Cloud-HPC fabric. Promote sustainability by shift…

Read Paper →

Computer Science Preprint PDF DOI

From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation

Guang Yang, Xing Hu, Xiang Chen, Xin Xi · 2026

Multimodal large language models (MLLMs) are increasingly used to translate visual artifacts into code, from UI mockups into HTML to scientific plots into Python scripts. A circuit diagram can be view…

Read Paper →

Computer Science Preprint PDF DOI

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

Jin Xin Ng, Ori Livneh, Richard O'Grady, Josh Don, Peng Ding, Samuel Grossman, Luis Otero, Chris Kennelly, David Lo, Carlos Villavieja · 2026

Modern large multicore systems often run multiple workloads that share CPUs under schedulers such as Linux CFS. To keep CPUs busy, these schedulers load-balance runnable work, causing each workload to…

Read Paper →

Computer Science Preprint PDF DOI

Can We Volunteer Out of the Peer Review Crisis?

Theo Tang, Toby Handfield, Julian Garcia · 2026

The volume of scientific manuscripts is growing faster than the capacity to evaluate them, yet the institutions that govern peer review have remained largely unchanged. The result is a widening mismat…

Read Paper →

Computer Science Preprint PDF DOI

Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

Jiaju Chen, Chongming Gao, Chenxiao Fan, Haoyan Liu, Qingpeng Cai, Peng Jiang, Xiangnan He · 2026

Large language model (LLM)-based generative list-wise recommendation has advanced rapidly, but decoding remains sequential and thus latency-prone. To accelerate inference without changing the target d…

Read Paper →

Computer Science Preprint PDF DOI

Towards an Ethical AI Curriculum: A Pan-African, Culturally Contextualized Framework for Primary and Secondary Education

Abidemi Kuburat Adedeji, Franklin Tchakounte, Sulaiman Oluwasegun Yusuff · 2026

Artificial intelligence (AI) is now embedded in educational, civic, and economic systems worldwide. For African primary and secondary education, this creates a double imperative: to prepare a young po…

Read Paper →

Computer Science Preprint PDF DOI

Solving Hypergraph Laplacian Systems in Almost-Linear Time

Yuichi Yoshida · 2026

For a connected weighted hypergraph, we give a randomized almost-linear-time solver for the Poisson problem for the cut-based hypergraph Laplacian in the natural input size $P=\sum_{e\in E}|e|$, the s…

Read Paper →

Computer Science Preprint PDF DOI

From Unstructured to Structured: LLM-Guided Attribute Graphs for Entity Search and Ranking

Yilun Zhu, Nikhita Vedula, Shervin Malmasi · 2026

Entity search, i.e., finding the most similar entities to a query entity, faces unique challenges in e-commerce, where product similarity varies across categories and contexts. Traditional embedding-b…

Read Paper →

Computer Science Preprint PDF DOI

RCW-CIM: A Digital CIM-based LLM Accelerator with Read-Compute/Write

Yan-Cheng Guo, Tian-Sheuan Chang, Jian-Wei Su · 2026

Digital computing-in-memory (DCIM) has emerged as a promising solution for large language model (LLM) acceleration by minimizing data transfers between external DRAM and on-chip accelerators while mai…

Read Paper →

Computer Science Preprint PDF DOI

Static Attribution of Android Residential Proxy Malware Using Graph Kernels

Peter Clark, Yong Guan, Zhonghao Liao · 2026

Android residential proxy applications represent a growing class of potentially-unwanted programs (PUPs) that covertly route third-party traffic through end-user devices, enabling ad fraud, credential…

Read Paper →

Computer Science Preprint PDF DOI

Predicting Upcoming Stuttering Events from Three-Second Audio: Stratified Evaluation Reveals Severity-Selective Precursors, and the Model Deploys Fully On-Device

Nazar Kozak · 2026

Audio-based stuttering systems to date have been trained for detection -- what disfluency is present now -- leaving prediction, the capability needed for closed-loop intervention, unstudied at deploya…

Read Paper →

Computer Science Preprint PDF DOI

From Prompt to Physical Actuation: Holistic Threat Modeling of LLM-Enabled Robotic Systems

Neha Nagaraja, Hayretdin Bahsi, Carlo R. da Cunha · 2026

As large language models are integrated into autonomous robotic systems for task planning and control, compromised inputs or unsafe model outputs can propagate through the planning pipeline to physica…

Read Paper →

Computer Science Preprint PDF DOI

FaaSMoE: A Serverless Framework for Multi-Tenant Mixture-of-Experts Serving

Minghe Wang, Trever Schirmer, Mohammadreza Malekabbasi, David Bermbach · 2026

Mixture-of-Experts (MoE) models offer high capacity with efficient inference cost by activating a small subset of expert models per input. However, deploying MoE models requires all experts to reside …

Read Paper →

Computer Science Preprint PDF DOI

Exploring the Efficiency of 3D-Stacked AI Chip Architecture for LLM Inference with Voxel

Yiqi Liu, Noelle Crawford, Michael Wang, Jilong Xue, Jian Huang · 2026

To overcome the well-known memory bottleneck of AI chips, 3D stacked architectures that employ advanced packaging technology with high-density through-silicon vias (TSVs) pins have proven to be a prom…

Read Paper →

Computer Science Preprint PDF DOI

MISES: Minimal Information Sufficiency for Effective Service

Joss Armstrong · 2026

Category-based coordination mechanisms allocate resources by mapping a declared service category to a fixed resource profile, without observing individual demand types. We establish three results for …

Read Paper →

Computer Science Preprint PDF DOI

LLM-Guided Runtime Parameter Optimization for Energy-Efficient Model Inference

Katelyn Crumpacker, Dimitrios Nikolopoulos · 2026

Large Language Models (LLMs) have become an integral part of many real-world workflows. However, LLMs consume a lot of energy, which becomes a large concern in the scale of the demand for these tools.…

Read Paper →

Computer Science Preprint PDF DOI

COPUS: Co-adaptive Parallelism and Batch Size Selection in Large Language Model Training

Akhmed Sakip, Erland Hilman Fuadi, Omar Sayedelahl, Zonghang Li, Jianshu She, Alham Fikri Aji, Steve Liu, Eric Xing, Qirong Ho · 2026

Training large language models requires jointly configuring two interdependent aspects of the system: the global batch size, which governs statistical efficiency, and the 3D parallelism strategy, whic…

Read Paper →

Computer Science Preprint PDF DOI

When to Retrieve During Reasoning: Adaptive Retrieval for Large Reasoning Models

Dongxin Guo, Jikun Wu, Siu Ming Yiu · 2026

Large reasoning models such as DeepSeek-R1 and OpenAI o1 generate extended chains of thought spanning thousands of tokens, yet their integration with retrieval-augmented generation (RAG) remains funda…

Read Paper →

Computer Science Preprint PDF DOI

DMRlib: Easy-coding and Efficient Resource Management for Job Malleability

Sergio Iserte, Rafael Mayo, Enrique S. Quintana-Orti, Antonio J. Pena · 2026

Process malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with the conventional static resource allocation policy. H…

Read Paper →

Browse Research Papers

Crab: A Semantics-Aware Checkpoint/Restore Runtime for Agent Sandboxes

Energy-Aware Quantum-Enhanced Computing Continuum

From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

Can We Volunteer Out of the Peer Review Crisis?

Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

Towards an Ethical AI Curriculum: A Pan-African, Culturally Contextualized Framework for Primary and Secondary Education

Solving Hypergraph Laplacian Systems in Almost-Linear Time

From Unstructured to Structured: LLM-Guided Attribute Graphs for Entity Search and Ranking

RCW-CIM: A Digital CIM-based LLM Accelerator with Read-Compute/Write

Static Attribution of Android Residential Proxy Malware Using Graph Kernels

Predicting Upcoming Stuttering Events from Three-Second Audio: Stratified Evaluation Reveals Severity-Selective Precursors, and the Model Deploys Fully On-Device

From Prompt to Physical Actuation: Holistic Threat Modeling of LLM-Enabled Robotic Systems

FaaSMoE: A Serverless Framework for Multi-Tenant Mixture-of-Experts Serving

Exploring the Efficiency of 3D-Stacked AI Chip Architecture for LLM Inference with Voxel

MISES: Minimal Information Sufficiency for Effective Service

LLM-Guided Runtime Parameter Optimization for Energy-Efficient Model Inference

COPUS: Co-adaptive Parallelism and Batch Size Selection in Large Language Model Training

When to Retrieve During Reasoning: Adaptive Retrieval for Large Reasoning Models

DMRlib: Easy-coding and Efficient Resource Management for Job Malleability

Browse by Category

Research Type

Publish Your Research