Boning Li in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

Prashant Kulkarni · 2026

Multi-turn prompt injection follows a known attack path -- trust-building, pivoting, escalation but text-level defenses miss covert attacks where individual turns appear benign. We show this attack pa…

Read Paper →

Computer Science Preprint PDF DOI

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

Ugo Dal Lago, Guido Fiorillo, Paolo Pistone · 2026

The problem of determining whether a probabilistic program terminates almost surely (i.e.~with probability one) is undecidable, and actually $\Pi^0_2$-complete. For this reason, a growing literature h…

Read Paper →

Computer Science Preprint PDF DOI

AnTi-MiCS: Analytical Framework for Bounding Time in Embedded Mixed-Criticality Systems

Behnaz Ranjbar, Akash Kumar · 2026

In Mixed-Criticality (MC) systems, although the high Worst-Case Execution Time (WCET) serves as a conservative upper bound representing the task's maximum execution time under all conditions, obtainin…

Read Paper →

Computer Science Preprint PDF DOI

TwinGate: Stateful Defense against Decompositional Jailbreaks in Untraceable Traffic via Asymmetric Contrastive Learning

Bowen Sun, Chaozhuo Li, Yaodong Yang, Yiwei Wang, Chaowei Xiao · 2026

Decompositional jailbreaks pose a critical threat to large language models (LLMs) by allowing adversaries to fragment a malicious objective into a sequence of individually benign queries that collecti…

Read Paper →

Computer Science Preprint PDF DOI

LLM-as-a-Judge for Human-AI Co-Creation: A Reliability-Aware Evaluation Framework for Coding

Md Faizul Ibne Amin, Yutaka Watanobe, Daniel M. Muepu, Haruto Suzuki, Kenta Nanaumi, Md Mostafizer Rahman · 2026

LLMs are increasingly employed both as judges for evaluating open-ended outputs and as co-creation partners in AI-assisted programming; yet rigorous evaluation in human-AI co-creation settings remains…

Read Paper →

Computer Science Preprint PDF DOI

The Synthetic Social Graph: Emergent Behavior in AI Agent Communities

Sungguk Cha, DongWook Kim · 2026

Large language model (LLM) agents are increasingly deployed in social settings, yet little is known about how they interact in open-ended environments. We present the first comprehensive sociological …

Read Paper →

Computer Science Preprint PDF DOI

On Coded Caching Systems with Decentralized Linear Coding Placement

Yinbin Ma, Daniela Tuninetti · 2026

Coded caching is a technique that leverages locally cached contents at the end users to reduce the network's peak-time communication load. Coded caching has been shown to achieve significant performan…

Read Paper →

Computer Science Preprint PDF DOI

Hot Fixing in the Wild

Carol Hanna, Karine Even-Mendoza, W.B. Langdon, Mar Zamorano Lopez, Justyna Petke, Federica Sarro · 2026

Despite the operational importance of hot fixes, large-scale evidence on how they reshape routine maintenance workflows, particularly in the era of autonomous coding agents, remains limited. We analys…

Read Paper →

Computer Science Preprint PDF DOI

DMRlib: Easy-coding and Efficient Resource Management for Job Malleability

Sergio Iserte, Rafael Mayo, Enrique S. Quintana-Orti, Antonio J. Pena · 2026

Process malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with the conventional static resource allocation policy. H…

Read Paper →

Computer Science Preprint PDF DOI

Differentially Private Contrastive Learning via Bounding Group-level Contribution

Kecen Li, Chen Gong, Zinan Lin, Tianhao Wang, Xiaokui Xiao · 2026

Differentially private (DP) contrastive learning aims to learn general-purpose representations from sensitive data, alleviating the privacy leakage concerns of organizations deploying or sharing embed…

Read Paper →

Computer Science Preprint PDF DOI

SWE-Bench 5G: Benchmarking AI Coding Agents on Telecom Network Engineering Tasks

Jiao Chen, Jianhua Tang, Xiaotong Yang, Zuohong Lv · 2026

AI coding agents demonstrate strong performance on general-purpose software benchmarks. However, their ability to handle 5G network engineering tasks remains unexplored. We propose SWE-Bench~5G, the f…

Read Paper →

Computer Science Preprint PDF DOI

Enforcing Benign Trajectories: A Behavioral Firewall for Structured-Workflow AI Agents

Hung Dang · 2026

Structured-workflow agents driven by large language models execute tool calls against sensitive external environments. We propose \codename, a telemetry-driven behavioral anomaly detection firewall. D…

Read Paper →

Computer Science Preprint PDF DOI

SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents

Mengyao Du, Han Fang, Haokai Ma, Jiahao Chen, Kai Xu, Quanjun Yin, Ee-Chien Chang · 2026

Web agents have emerged as an effective paradigm for automating interactions with complex web environments, yet remain vulnerable to prompt injection attacks that embed malicious instructions into web…

Read Paper →

Computer Science Preprint PDF DOI

CoRE: A Fine-Grained Code Reasoning Benchmark Beyond Output Prediction

Jun Gao, Yun Peng, Qian Qiao, Changhai Zhou, Yuhua Zhou, Shiyang Zhang, Shichao Weng, Zhenchang Xing, Xiaoxue Ren · 2026

Despite strong performance on code generation tasks, it remains unclear whether large language models (LLMs) genuinely reason about code execution. Existing code reasoning benchmarks primarily evaluat…

Read Paper →

Computer Science Preprint PDF DOI

Safety Drift After Fine-Tuning: Evidence from High-Stakes Domains

Emaan Bilal Khan, Amy Winecoff, Miranda Bogen, Dylan Hadfield-Menell · 2026

Foundation models are routinely fine-tuned for use in particular domains, yet safety assessments are typically conducted only on base models, implicitly assuming that safety properties persist through…

Read Paper →

Computer Science Preprint PDF DOI

New Convex Programming Technique for Nash Social Welfare and Scheduling

Yuda Feng, Weijiang Hu, Shi Li · 2026

We propose a new convex programming relaxation for the weighted Nash social welfare (NSW) problem that achieves a matching $(e^{1/e}\approx 1.445)$-approximation via the rounding algorithm of Feng and…

Read Paper →

Computer Science Preprint PDF DOI

Jailbreaking Frontier Foundation Models Through Intention Deception

Xinhe Wang, Katia Sycara, Yaqi Xie · 2026

Large (vision-)language models exhibit remarkable capability but remain highly susceptible to jailbreaking. Existing safety training approaches aim to have the model learn a refusal boundary between s…

Read Paper →

Computer Science Preprint PDF DOI

Constructive Separations from Gate Elimination

Marco Carmosino, Ngu Dang, Tim Jackman · 2026

Gate elimination is the primary technique for proving explicit lower bounds against general Boolean circuits, including Li and Yang's state-of-the-art $3.1n - o(n)$ bound for affine dispersers (STOC 2…

Read Paper →

Computer Science Preprint PDF DOI

Scalable and Verifiable Federated Learning for Cross-Institution Financial Fraud Detection

Prajwal Panth, Nishant Nigam · 2026

The global financial ecosystem confronts a critical asymmetry: while fraud syndicates operate as borderless, distributed networks, banking institutions remain constrained by regulatory data silos, lim…

Read Paper →

Computer Science Preprint PDF DOI

Scalable LLM-based Coding of Dialogue in Healthcare Simulation: Balancing Coding Performance, Processing Time, and Environmental Impact

Kiyoshige Garces, Gloria Milena Fernandez-Nieto, Linxuan Zhao, Sachini Samaraweera, Dragan Gasevic, Roberto Martinez-Maldonado, Vanessa Echeverria · 2026

Research shows that dialogue, the interactive process through which participants articulate their thinking, plays a central role in constructing shared understanding, coordinating action, and shaping …

Read Paper →

Browse Research Papers

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

AnTi-MiCS: Analytical Framework for Bounding Time in Embedded Mixed-Criticality Systems

TwinGate: Stateful Defense against Decompositional Jailbreaks in Untraceable Traffic via Asymmetric Contrastive Learning

LLM-as-a-Judge for Human-AI Co-Creation: A Reliability-Aware Evaluation Framework for Coding

The Synthetic Social Graph: Emergent Behavior in AI Agent Communities

On Coded Caching Systems with Decentralized Linear Coding Placement

Hot Fixing in the Wild

DMRlib: Easy-coding and Efficient Resource Management for Job Malleability

Differentially Private Contrastive Learning via Bounding Group-level Contribution

SWE-Bench 5G: Benchmarking AI Coding Agents on Telecom Network Engineering Tasks

Enforcing Benign Trajectories: A Behavioral Firewall for Structured-Workflow AI Agents

SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents

CoRE: A Fine-Grained Code Reasoning Benchmark Beyond Output Prediction

Safety Drift After Fine-Tuning: Evidence from High-Stakes Domains

New Convex Programming Technique for Nash Social Welfare and Scheduling

Jailbreaking Frontier Foundation Models Through Intention Deception

Constructive Separations from Gate Elimination

Scalable and Verifiable Federated Learning for Cross-Institution Financial Fraud Detection

Scalable LLM-based Coding of Dialogue in Healthcare Simulation: Balancing Coding Performance, Processing Time, and Environmental Impact

Browse by Category

Research Type

Publish Your Research