Jia Wang in Computer Science — Research Repository

Computer Science Preprint PDF DOI

FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption

Yanting Wang, Chenlong Yin, Ying Chen, Jinyuan Jia · 2026

Long-context large language models (LLMs)-for example, Gemini-3.1-Pro and Qwen-3.5-are widely used to empower many real-world applications, such as retrieval-augmented generation, autonomous agents, a…

Read Paper →

Computer Science Preprint PDF DOI

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

Muhammad Ihsan Al Hafiz, Artur Podobas · 2026

Spiking neural networks (SNNs) are a promising paradigm for energy-efficient event-driven computation, but large-scale SNN execution remains challenging because sparse spike communication and synchron…

Read Paper →

Computer Science Preprint PDF DOI

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

Ugo Dal Lago, Guido Fiorillo, Paolo Pistone · 2026

The problem of determining whether a probabilistic program terminates almost surely (i.e.~with probability one) is undecidable, and actually $\Pi^0_2$-complete. For this reason, a growing literature h…

Read Paper →

Computer Science Preprint PDF DOI

Distributed Santa Claus via Global Rounding

Tijn de Vos, Leo Wennmann, Malte Baumecker, Yannic Maus, Florian Schager · 2026

In this paper, we consider the Santa Claus problem in the CONGEST model. This NP-hard problem can be modeled as a bipartite graph of children and gifts where an edge indicates that a child desires a g…

Read Paper →

Computer Science Preprint PDF DOI

TwinGate: Stateful Defense against Decompositional Jailbreaks in Untraceable Traffic via Asymmetric Contrastive Learning

Bowen Sun, Chaozhuo Li, Yaodong Yang, Yiwei Wang, Chaowei Xiao · 2026

Decompositional jailbreaks pose a critical threat to large language models (LLMs) by allowing adversaries to fragment a malicious objective into a sequence of individually benign queries that collecti…

Read Paper →

Computer Science Preprint PDF DOI

NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

Shiyao Peng, Qianhe Zheng, Zhuodi Hao, Zichen Tang, Rongjin Li, Qing Huang, Jiayu Huang, Jiacheng Liu, Yifan Zhu, Haihong E · 2026

Although precise recall is a core objective in Retrieval-Augmented Generation (RAG), a critical oversight persists in the field: improvements in retrieval performance do not consistently translate to …

Read Paper →

Computer Science Preprint PDF DOI

MASCing: Configurable Mixture-of-Experts Behavior via Activation Steering Masks

Jona te Lintelo, Lichao Wu, Marina Krcek, Sengim Karayalcin, Stjepan Picek · 2026

Mixture-of-Experts (MoE) architectures in Large Language Models (LLMs) have significantly reduced inference costs through sparse activation. However, this sparse activation paradigm also introduces ne…

Read Paper →

Computer Science Preprint PDF DOI

Why Self-Supervised Encoders Want to Be Normal

Yuval Domb · 2026

We develop a geometric and information-theoretic framework for encoder-decoder learning built on the Information Bottleneck (IB) principle. Recasting IB as a rate-distortion problem with Kullback-Leib…

Read Paper →

Computer Science Preprint PDF DOI

Libra: Accelerating Socket I/O via Programmable Selective Data Copying

Kairui Zhou, Shengkai Lin, Wei Zhang, Shizhen Zhao · 2026

Layer-7 (L7) proxies are critical to modern cloud-native systems, yet their performance is increasingly bottlenecked by copying entire payloads across the kernel-user boundary. Existing approaches red…

Read Paper →

Computer Science Preprint PDF DOI

SQuadGen: Generating Simple Quad Layouts via Chart Distance Fields

Youkang Kong, Yang Liu, Yue Dong, Xin Tong, Heung-Yeung Shum · 2026

3D shapes from scanning, reconstruction, or AI-generated content often lack simple quad mesh layouts -- critical for efficient editing and modeling. Existing quad-remeshing techniques typically produc…

Read Paper →

Computer Science Preprint PDF DOI

Graphify: Automated Synthesis of Type-Safe Graph Backends via $O(S)$ GraphQL-to-Gremlin Transpilation

Johannes Graf · 2026

Graph databases offer unparalleled flexibility for managing interconnected data, yet the lack of strict schema enforcement often leads to runtime uncertainties and complex query development. This pape…

Read Paper →

Computer Science Preprint PDF DOI

CI-Repair-Bench: A Repository-Aware Benchmark for Automated Patch Validation via CI Workflows

Rabeya Khatun Muna, Md Nakhla Rafi, Tse-Hsun (Peter) Chen · 2026

Continuous Integration (CI) enforces repository-level correctness through multi-stage workflows and is central to modern software development, yet diagnosing and repairing CI failures remains challeng…

Read Paper →

Computer Science Preprint PDF DOI

New Diameter Approximations via Distance Oracle Techniques

Yael Kirkpatrick, Liam Roditty, Richard Qi, Virginia Vassilevska Williams · 2026

Computing the diameter of a graph is a problem of great interest both in general algorithms research and specifically within fine-grained complexity, where it is a cornerstone hard problem. Recent wor…

Read Paper →

Computer Science Preprint PDF DOI

The Nesting Bird Box Problem is ER-complete: Sharp Hardness Results for the Hidden Set Problem

Lucas Meijer, Till Miltzow, Johanna Ockenfels, Milos Stojakovic · 2026

In the (Nesting) Bird Box Problem we are given a polygonal domain P and a number k and we want to know if there is a set B of k points inside P such that no two points in B can see each other. The und…

Read Paper →

Computer Science Preprint PDF DOI

On (In)approximability of MaxMin Independent Set Reconfiguration

Hung P. Hoang, Naoto Ohsaka, Rin Saito, Yuma Tamura · 2026

In the Independent Set Reconfiguration problem under the Token Addition/Removal rule, given a graph $G$ and two independent sets $I$ and $J$ of $G$, we want to transform $I$ into $J$ by adding and rem…

Read Paper →

Computer Science Preprint PDF DOI

TDD Governance for Multi-Agent Code Generation via Prompt Engineering

Tarlan Hasanli, Shahbaz Siddeeq, Bishwash Khanal, Pyry Kotilainen, Tommi Mikkonen, Pekka Abrahamsson · 2026

Large language models (LLMs) accelerate software development but often exhibit instability, non-determinism, and weak adherence to development discipline in unconstrained workflows. While test-driven …

Read Paper →

Computer Science Preprint PDF DOI

Recommendations for Efficient and Responsible LLM Adoption within Industrial Software Development

Krishna Ronanki, Beatriz Cabrero-Daniel, Tomas Herda, Stefan Sitkovich, Jennifer Horkoff, Christian Berger · 2026

Context: Large language models (LLMs) are observed to have a significant positive impact on various software engineering (SE) activities. With improved accessibility, the adoption of powerful LLMs in …

Read Paper →

Computer Science Preprint PDF DOI

Tatemae: Detecting Alignment Faking via Tool Selection in LLMs

Matteo Leonesi, Francesco Belardinelli, Flavio Corradini, Marco Piangerelli · 2026

Alignment faking (AF) occurs when an LLM strategically complies with training objectives to avoid value modification, reverting to prior preferences once monitoring is lifted. Current detection method…

Read Paper →

Computer Science Preprint PDF DOI

Differentially Private Contrastive Learning via Bounding Group-level Contribution

Kecen Li, Chen Gong, Zinan Lin, Tianhao Wang, Xiaokui Xiao · 2026

Differentially private (DP) contrastive learning aims to learn general-purpose representations from sensitive data, alleviating the privacy leakage concerns of organizations deploying or sharing embed…

Read Paper →

Computer Science Preprint PDF DOI

PiLLar: Matching for Pivot Table Schema via LLM-guided Monte-Carlo Tree Search

Yunjun Gao, Chuangyu Ouyang, Congcong Ge, Yifan Zhu · 2026

Pivot tables are ubiquitous in data lakes of modern data ecosystems, making accurate schema matching over pivot tables a key prerequisite for data integration. In this paper, we focus on matching for …

Read Paper →

Browse Research Papers

FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

Distributed Santa Claus via Global Rounding

TwinGate: Stateful Defense against Decompositional Jailbreaks in Untraceable Traffic via Asymmetric Contrastive Learning

NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

MASCing: Configurable Mixture-of-Experts Behavior via Activation Steering Masks

Why Self-Supervised Encoders Want to Be Normal

Libra: Accelerating Socket I/O via Programmable Selective Data Copying

SQuadGen: Generating Simple Quad Layouts via Chart Distance Fields

Graphify: Automated Synthesis of Type-Safe Graph Backends via $O(S)$ GraphQL-to-Gremlin Transpilation

CI-Repair-Bench: A Repository-Aware Benchmark for Automated Patch Validation via CI Workflows

New Diameter Approximations via Distance Oracle Techniques

The Nesting Bird Box Problem is ER-complete: Sharp Hardness Results for the Hidden Set Problem

On (In)approximability of MaxMin Independent Set Reconfiguration

TDD Governance for Multi-Agent Code Generation via Prompt Engineering

Recommendations for Efficient and Responsible LLM Adoption within Industrial Software Development

Tatemae: Detecting Alignment Faking via Tool Selection in LLMs

Differentially Private Contrastive Learning via Bounding Group-level Contribution

PiLLar: Matching for Pivot Table Schema via LLM-guided Monte-Carlo Tree Search

Browse by Category

Research Type

Publish Your Research