Expertini Research Research

Browse Research Papers

46,580+ open-access research outputs.

โœ• Clear
๐Ÿ” avoidance learning ๐Ÿ“‚ Computer Science
Showing 46580 results for "avoidance learning" in Computer Science
Computer Science Preprint PDF DOI

HARPO: Hierarchical Agentic Reasoning for User-Aligned Conversational Recommendation

Subham Raj, Aman Vaibhav Jha, Mayank Anand, Sriparna Saha ยท 2026

Conversational recommender systems (CRSs) operate under incremental preference revelation, requiring systems to make recommendation decisions under uncertainty. While recent approaches particularly thโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Self-Distilled Reinforcement Learning for Co-Evolving Agentic Recommender Systems

Zongwei Wang, Min Gao, Hongzhi Yin, Junliang Yu, Tong Chen, Quoc Viet Hung Nguyen, Shazia Sadiq, Tianrui Li ยท 2026

Large language model-empowered agentic recommender systems (ARS) reformulate recommendation as a multi-turn interaction between a recommender agent and a user agent, enabling iterative preference elicโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Deep Reinforcement Learning for Cognitive Time-Division Joint SAR and Secure Communications

Mohamed-Amine Lahmeri, Ata Khalili, Yujiao Liu, Anke Schmeink, Robert Schober ยท 2026

Synthetic aperture radar (SAR) imaging can be exploited to enhance wireless communication performance through high-precision environmental awareness. However, integrating sensing and communication funโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

EncFormer: Secure and Efficient Transformer Inference over Encrypted Data

Yufan Zhu, Chao Jin, Khin Mi Mi Aung, Xiaokui Xiao ยท 2026

Transformer inference in machine-learning-as-a-service (MLaaS) raises privacy concerns for sensitive user inputs. Prior secure solutions that combine fully homomorphic encryption (FHE) and secure multโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Neuro-Oracle: A Trajectory-Aware Agentic RAG Framework for Interpretable Epilepsy Surgical Prognosis

Aizierjiang Aiersilan, Mohamad Koubeissi ยท 2026

Predicting post-surgical seizure outcomes in pharmacoresistant epilepsy is a clinical challenge. Conventional deep-learning approaches operate on static, single-timepoint pre-operative scans, omittingโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Auditing automated research assessment: an interpretable machine learning approach to validate funding criteria

Rafael P. Gouveia, Thiago C. Silva, Diego R. Amancio ยท 2026

This paper empirically examines the practical validity of the official evaluation criteria underpinning the Research Productivity (PQ) Grant framework, as governed by the Brazilian National Council foโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

DeepTutor: Towards Agentic Personalized Tutoring

Bingxi Zhao, Jiahao Zhang, Xubin Ren, Zirui Guo, Tianzhe Chu, Yi Ma, Chao Huang ยท 2026

Education represents one of the most promising real-world applications for Large Language Models (LLMs). However, conventional tutoring systems rely on static pre-training knowledge that lacks adaptatโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Dynamic Ranked List Truncation for Reranking Pipelines via LLM-generated Reference-Documents

Nilanjan Sinhababu, Soumedhik Bharati, Debasis Ganguly, Pabitra Mitra ยท 2026

Large Language Models (LLM) have been widely used in reranking. Computational overhead and large context lengths remain a challenging issue for LLM rerankers. Efficient reranking usually involves seleโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

XFED: Non-Collusive Model Poisoning Attack Against Byzantine-Robust Federated Classifiers

Israt Jahan Mouri, Muhammad Ridowan, Muhammad Abdullah Adnan ยท 2026

Model poisoning attacks pose a significant security threat to Federated Learning (FL). Most existing model poisoning attacks rely on collusion, requiring adversarial clients to coordinate by exchanginโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Confidence Without Competence in AI-Assisted Knowledge Work

Elena Eleftheriou, George Pallis, Marios Constantinides ยท 2026

Large Language Models (LLMs) are widely used by students, yet their tendency to provide fast and complete answers may discourage reflection and foster overconfidence. We examined how alternative LLM iโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning

Guiyao Tie, Jiawen Shi, Pan Zhou, Lichao Sun ยท 2026

Agent ecosystems increasingly rely on installable skills to extend functionality, and some skills bundle learned model artifacts as part of their execution logic. This creates a supply-chain risk thatโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

DialogueSidon: Recovering Full-Duplex Dialogue Tracks from In-the-Wild Dialogue Audio

Wataru Nakata, Yuki Saito, Kazuki Yamauchi, Emiru Tsunoo, Hiroshi Saruwatari ยท 2026

Full-duplex dialogue audio, in which each speaker is recorded on a separate track, is an important resource for spoken dialogue research, but is difficult to collect at scale. Most in-the-wild two-speโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

CCCE: A Continuous Code Calibration Engine for Autonomous Enterprise Codebase Maintenance via Knowledge Graph Traversal and Adaptive Decision Gating

Santhosh Kusuma Kumar Parimi ยท 2026

Enterprise software organizations face an escalating challenge in maintaining the integrity, security, and freshness of codebases that span hundreds of repositories, multiple programming languages, anโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

A Catalog of Data Errors

Divya Bhadauria, Hazar Harmouch, Felix Naumann, Divesh Srivastava, Lisa Ehrlinger ยท 2026

Data errors are widespread in real-world databases and severely impact downstream applications, such as machine learning pipelines or business analytics reports. Causes of such errors are manifold andโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Structuring versus Problematizing: How LLM-based Agents Scaffold Learning in Diagnostic Reasoning

Fatma Betul Gures, Tanya Nazaretsky, Seyed Parsa Neshaei, Tanja Kaser ยท 2026

Supporting students in developing diagnostic reasoning is a key challenge across educational domains. Novices often face cognitive biases such as premature closure and over-reliance on heuristics, andโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward

Weiyang Guo, Zesheng Shi, Zeen Zhu, Yuan Zhou, Min Zhang, Jing Li ยท 2026

Reinforcement Learning with Verifiable Rewards (RLVR) is an emerging paradigm that significantly boosts a Large Language Model's (LLM's) reasoning abilities on complex logical tasks, such as mathematiโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

TensorHub: Scalable and Elastic Weight Transfer for LLM RL Training

Chenhao Ye, Huaizheng Zhang, Mingcong Han, Baoquan Zhong, Xiang Li, Qixiang Chen, Xinyi Zhang, Weidong Zhang, Kaihua Jiang, Wang Zhang, He Sun, Wencong Xiao, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau ยท 2026

Modern LLM reinforcement learning (RL) workloads require a highly efficient weight transfer system to scale training across heterogeneous computational resources. However, existing weight transfer appโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion

Akshit Jindal, Saket Anand, Chetan Arora, Vikram Goyal ยท 2026

Organisations with limited data and computational resources increasingly outsource model training to Machine Learning as a Service (MLaaS) providers, who adapt vision-language models (VLMs) such as CLโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

DIAURec: Dual-Intent Space Representation Optimization for Recommendation

Yu Zhang, Yiwen Zhang, Yi Zhang, Lei Sang ยท 2026

General recommender systems deliver personalized services by learning user and item representations, with the central challenge being how to capture latent user preferences. However, representations dโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Noise-Aware In-Context Learning for Hallucination Mitigation in ALLMs

Qixuan Huang, Khalid Zaman, Masashi Unoki ยท 2026

Auditory large language models (ALLMs) have demonstrated strong general capabilities in audio understanding and reasoning tasks. However, their reliability is still undermined by hallucination issues.โ€ฆ

Read Paper โ†’
โ† Prev Page 23 of 2329 Next โ†’