Bei Zeng in Computer Science — Research Repository

Computer Science Preprint PDF DOI

A Reproducibility Study of LLM-Based Query Reformulation

Amin Bigdeli, Radin Hamidi Rad, Hai Son Le, Mert Incesu, Negar Arabzadeh, Charles L. A. Clarke, Ebrahim Bagheri · 2026

Large Language Models (LLMs) are now widely used for query reformulation and expansion in Information Retrieval, with many studies reporting substantial effectiveness gains. However, these results are…

Read Paper →

Computer Science Preprint PDF DOI

Self-Evolving Software Agents

Marco Robol, Paolo Giorgini · 2026

Autonomous agents can adapt their behaviour to changing environments, but remain bound to requirements, goals, and capabilities fixed at design time, preventing genuine software evolution. This paper …

Read Paper →

Computer Science Preprint PDF DOI

UnIte: Uncertainty-based Iterative Document Sampling for Domain Adaptation in Information Retrieval

Jongyoon Kim, Minseong Hwang, Seung-won Hwang · 2026

Unsupervised domain adaptation generalizes neural retrievers to an unseen domain by generating pseudo queries on target domain documents. The quality and efficiency of this adaptation critically depen…

Read Paper →

Computer Science Preprint PDF DOI

New Convex Programming Technique for Nash Social Welfare and Scheduling

Yuda Feng, Weijiang Hu, Shi Li · 2026

We propose a new convex programming relaxation for the weighted Nash social welfare (NSW) problem that achieves a matching $(e^{1/e}\approx 1.445)$-approximation via the rounding algorithm of Feng and…

Read Paper →

Computer Science Preprint PDF DOI

Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval

Dun Zhang · 2026

Modern retrieval pipelines increasingly serve downstream consumers like retrieval-augmented generation (RAG) and autonomous agents that need more than a scalar relevance score. A reranker that only te…

Read Paper →

Computer Science Preprint PDF DOI

Efficient Rationale-based Retrieval: On-policy Distillation from Generative Rerankers based on JEPA

Teng Chen, Sheng Xu, Feixiang Guo, Xiaoyu Wang, Qingqing Gu, Hongyan Li, Luo Ji · 2026

Unlike traditional fact-based retrieval, rationale-based retrieval typically necessitates cross-encoding of query-document pairs using large language models, incurring substantial computational costs.…

Read Paper →

Computer Science Preprint PDF DOI

Well-Conditioned Oblivious Perturbations in Linear Space

Shabarish Chenakkod, Micha{l} Derezinski, Xiaoyu Dong, Mark Rudelson · 2026

Perturbing a deterministic $n$-dimensional matrix with small Gaussian noise is a cornerstone of smoothed analysis of algorithms [Spielman and Teng, JACM 2004], as it reduces the condition number of th…

Read Paper →

Computer Science Preprint PDF DOI

ResRank: Unifying Retrieval and Listwise Reranking via End-to-End Joint Training with Residual Passage Compression

Xiaojie Ke, Shuai Zhang, Liansheng Sun, Yongjin Wang, Hengjun Jiang, Xiangkun Liu, Cunxin Gu, Jian Xu, Guanjun Jiang · 2026

Large language model (LLM) based listwise reranking has emerged as the dominant paradigm for achieving state-of-the-art ranking effectiveness in information retrieval. However, its reliance on feeding…

Read Paper →

Computer Science Preprint PDF DOI

Three-Module SC-VAMP for LDPC-Coded Nonlinear Channels

Tadashi Wadayama, Takumi Takahashi · 2026

We propose a three-module extension of score-based VAMP (SC-VAMP) for signal recovery in nonlinear channels, where the received signal is obtained by applying a nonlinearity to a linear mixture of the…

Read Paper →

Computer Science Preprint PDF DOI

Bayesian experimental design: grouped geometric pooled posterior via ensemble Kalman methods

Huchen Yang, Xinghao Dong, Jinlong Wu · 2026

Bayesian experimental design (BED) for complex physical systems is often limited by the nested inference required to estimate the expected information gain (EIG) or its gradients. Each outer sample in…

Read Paper →

Computer Science Preprint PDF DOI

Federated Parameter-Efficient Adaptation for Interference Mitigation at the Wireless Edge

Evar Jones, Daniel J. Jakubisin, Sanmay Das · 2026

Dense wireless deployments face co-channel interference from heterogeneous sources that vary across base stations (gNBs in 5G). While centralized DNN-based approaches to interference mitigation have s…

Read Paper →

Computer Science Preprint PDF DOI

vstash: Local-First Hybrid Retrieval with Adaptive Fusion for LLM Agents

Jayson Steffens · 2026

We present **vstash**, a local-first document memory system that combines vector similarity search with full-text keyword matching via Reciprocal Rank Fusion (RRF) and adaptive per-query IDF weighting…

Read Paper →

Computer Science Preprint PDF DOI

On the Information Velocity over a Tandem of Erasure Channels

Kai-Chun Chen, I-Hsiang Wang · 2026

Information velocity (IV) is a recently proposed notion to capture the speed of reliable information dissemination over a large-scale network. It is the speed at which reliable end-to-end communicatio…

Read Paper →

Computer Science Preprint PDF DOI

ARHN: Answer-Centric Relabeling of Hard Negatives with Open-Source LLMs for Dense Retrieval

Hyewon Choi, Jooyoung Choi, Hansol Jang, Hyun Kim, Chulmin Yun, ChangWook Jun, Stanley Jungkyu Choi · 2026

Neural retrievers are often trained on large-scale triplet data comprising a query, a positive passage, and a set of hard negatives. In practice, hard-negative mining can introduce false negatives and…

Read Paper →

Computer Science Preprint PDF DOI

Parameterized Algorithms and Complexity for Function Merging with Branch Reordering

Amir K. Goharshady, Kerim Kochekov, Tian Shu, Ahmed Khaled Zaher · 2026

Binary size reduction is an increasingly important optimization objective for compilers. One emerging technique is function merging, where multiple similar functions are merged into one, thereby elimi…

Read Paper →

Computer Science Preprint PDF DOI

Edge-Tilting Field Dynamics: Rapid Mixing at the Uniqueness Threshold and Optimal Mixing for Swendsen-Wang Dynamics

Xiaoyu Chen, Zhe Ju, Tianshun Miao, Yitong Yin, Xinyuan Zhang · 2026

We prove two results on the mixing times of Markov chains for two-spin systems. First, we show that the Glauber dynamics mixes in polynomial time for the Gibbs distributions of antiferromagnetic two-s…

Read Paper →

Computer Science Preprint PDF DOI

Subquadratic Counting via Perfect Marginal Sampling

Xiaoyu Chen, Zongchen Chen, Kuikui Liu, Xinyuan Zhang · 2026

We study the computational complexity of approximately computing the partition function of a spin system. Techniques based on standard counting-to-sampling reductions yield $\tilde{O}(n^2)$-time algor…

Read Paper →

Computer Science Preprint PDF DOI

Is RISC-V Ready for Machine Learning? Portable Gaussian Processes Using Asynchronous Tasks

Alexander Strack, Patrick Diehl, Dirk Pfluger · 2026

Gaussian processes are widely used in machine learning domains but remain computationally demanding, limiting their efficient scalability across diverse hardware platforms. The GPRat library targets t…

Read Paper →

Computer Science Preprint PDF DOI

Birdcast: Interest-aware BEV Multicasting for Infrastructure-assisted Collaborative Perception

Yanan Ma, Zhengru Fang, Yihang Tao, Yu Guo, Yiqin Deng, Xianhao Chen, Yuguang Fang · 2026

Vehicle-to-infrastructure collaborative perception (V2I-CP) leverages a high-vantage node to transmit supplementary information, i.e., bird's-eye-view (BEV) feature maps, to vehicles, effectively over…

Read Paper →

Computer Science Preprint PDF DOI

How unique are hallucinated citations offered by generative Artificial Intelligence models?

Dirk HR Spennemann · 2026

This paper investigates how generative AI produces and propagates hallucinated academic references, focusing on the recurring non-existent citation 'Education Governance and Datafication' attributed t…

Read Paper →

Browse Research Papers

A Reproducibility Study of LLM-Based Query Reformulation

Self-Evolving Software Agents

UnIte: Uncertainty-based Iterative Document Sampling for Domain Adaptation in Information Retrieval

New Convex Programming Technique for Nash Social Welfare and Scheduling

Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval

Efficient Rationale-based Retrieval: On-policy Distillation from Generative Rerankers based on JEPA

Well-Conditioned Oblivious Perturbations in Linear Space

ResRank: Unifying Retrieval and Listwise Reranking via End-to-End Joint Training with Residual Passage Compression

Three-Module SC-VAMP for LDPC-Coded Nonlinear Channels

Bayesian experimental design: grouped geometric pooled posterior via ensemble Kalman methods

Federated Parameter-Efficient Adaptation for Interference Mitigation at the Wireless Edge

vstash: Local-First Hybrid Retrieval with Adaptive Fusion for LLM Agents

On the Information Velocity over a Tandem of Erasure Channels

ARHN: Answer-Centric Relabeling of Hard Negatives with Open-Source LLMs for Dense Retrieval

Parameterized Algorithms and Complexity for Function Merging with Branch Reordering

Edge-Tilting Field Dynamics: Rapid Mixing at the Uniqueness Threshold and Optimal Mixing for Swendsen-Wang Dynamics

Subquadratic Counting via Perfect Marginal Sampling

Is RISC-V Ready for Machine Learning? Portable Gaussian Processes Using Asynchronous Tasks

Birdcast: Interest-aware BEV Multicasting for Infrastructure-assisted Collaborative Perception

How unique are hallucinated citations offered by generative Artificial Intelligence models?

Browse by Category

Research Type

Publish Your Research