Expertini Research Research

Browse Research Papers

346,661+ open-access research outputs.

โœ• Clear
๐Ÿ” avoidance learning
Showing 346661 results for "avoidance learning"
AI & Data Science Preprint PDF DOI

Biased Dreams: Limitations to Epistemic Uncertainty Quantification in Latent Space Models

Julia Berger, Bernd Frauenknecht, Sebastian Trimpe, Bastian Leibe ยท 2026

Model-Based Reinforcement Learning distinguishes between physical dynamics models operating on proprioceptive inputs and latent dynamics models operating on high-dimensional image observations. A promโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Beyond Fidelity: Semantic Similarity Assessment in Low-Level Image Processing

Runjie Wang, Weiling Chen, Tiesong Zhao, Chang Wen Chen ยท 2026

Low-level image processing has long been evaluated mainly from the perspective of visual fidelity. However, with the rise of deep learning and generative models, processed images may preserve perceptuโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Benchmarking PyCaret AutoML Against IndoBERT Fine-Tuning for Sentiment Analysis on Indonesian IKN Twitter Data

Mutia Alfi Mayzaroh, Dwi Fitria Ningsih, Nindi Destriani, Martin C.T. Manullang ยท 2026

This paper benchmarks a classical machine learning approach based on PyCaret AutoML against a deep learning approach based on IndoBERT fine-tuning for binary sentiment analysis of Indonesian-language โ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Co-Writing with AI: An Empirical Study of Diverse Academic Writing Workflows

Silvia Bodei, Duncan P. Brumby, Katie Fisher, Jon Mella ยท 2026

Despite AI tools becoming increasingly embedded in academic practice, little is known about how university students integrate them into their writing processes. We examine how students engage with AI โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Wiki Dumps to Training Corpora: South Slavic Case

Mihailo Skoric ยท 2026

This paper presents a methodology for transforming raw Wikimedia dumps into quality textual corpora for seven South Slavic languages. The work is divided into two major phases. The first involves extrโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Benchmarking and Improving GUI Agents in High-Dynamic Environments

Enqi Liu, Liyuan Pan, Zhi Gao, Yan Yang, Chenrui Shi, Yang Liu, Jingrong Wu, Qing Li ยท 2026

Recent advancements in Graphical User Interface (GUI) agents have predominantly focused on training paradigms like supervised fine-tuning (SFT) and reinforcement learning (RL). However, the challenge โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Safe-Support Q-Learning: Learning without Unsafe Exploration

Yeeun Lim, Narim Jeong, Donghwan Lee ยท 2026

Ensuring safety during reinforcement learning (RL) training is critical in real-world applications where unsafe exploration can lead to devastating outcomes. While most safe RL methods mitigate risk tโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

CoRE: Concept-Reasoning Expansion for Continual Brain Lesion Segmentation

Qianqian Chen, Anglin Liu, Jingyang Zhang, Yudong Zhang ยท 2026

Accurate brain lesion segmentation in MRI is vital for effective clinical diagnosis and treatment planning. Due to high annotation costs and strict data privacy regulations, universal models require eโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Multi-action Tangled Program Graphs for Multi-task Reinforcement Learning with Continuous Control

Quentin Vacher (IETR), Nicolas Beuve (IETR), Mickael Dardaillon (IETR), Karol Desnos (IETR) ยท 2026

Over the past few decades, machine learning has been widely used to learn complex tasks. Reinforcement Learning (RL), inspired by human behavior, is a great example, as it involves developing specificโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Commit-Aware Learning-Based Test Case Prioritization for Continuous Integration

Lorenzo Abbondante, Gerardo Canfora ยท 2026

Regression testing in Continuous Integration (CI) pipelines is increasingly costly due to the growing size and execution frequency of test suites. Test Case Prioritization (TCP) mitigates this problemโ€ฆ

Read Paper โ†’
Medicine & Health Preprint PDF DOI

Unsupervised Physics-Informed Deep Learning for Dual-Energy CT Material Decomposition

Laura Hellwege, Johann Christopher Engster, Moritz Schaar, Thorsten M. Buzug, Maik Stille ยท 2026

Dual-energy computed tomography (DECT) enables material-specific imaging through acquisitions at two different X-ray energy spectra. Material decomposition from DECT data is an ill-posed inverse problโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

GraphPL: Leveraging GNN for Efficient and Robust Modalities Imputation in Patchwork Learning

Xingjian Hu, Zuoyu Yan, Jianhua Zhu, Liangcai Gao, Fei Wang, Tengfei Ma ยท 2026

Current research on distributed multi-modal learning typically assumes that clients can access complete information across all modalities, which may not hold in practice. In this paper, we explore patโ€ฆ

Read Paper โ†’
Physics Preprint PDF DOI

Proximity Ferroelectricity Driven by Mobile High-Miller-Index Domain Walls

Changming Ke, Shi Liu ยท 2026

Wurtzite ferroelectrics such as scandium-doped aluminum nitride (AlScN) are promising for next-generation memory because of their compatibility with semiconductor processes and strong spontaneous polaโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification

Hongfei Wu, Ruijian Han, Yancheng Yuan ยท 2026

Imbalanced classification remains a pervasive challenge in machine learning, particularly when minority samples are too scarce to provide a robust discriminative boundary. In such extreme scenarios, cโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

A Survey of Multi-Agent Deep Reinforcement Learning with Graph Neural Network-Based Communication

Valentin Cuzin-Rambaud (LIRIS, UCBL), Laetitia Matignon (LIRIS, UCBL), Maxime Morge (LIRIS, UCBL) ยท 2026

In multi-agent reinforcement learning (MARL), the integration of a communication mechanism, allowing agents to better learn to coordinate their actions and converge on their objectives by sharing infoโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Towards Robust Deep Learning-based Rumex Obtusifolius Detection from Drone Images

Fabian Dionys Schrag, Mehmet Ozgur Turkoglu, Konrad Schindler, Ralph Lukas Stoop ยท 2026

Domain adaptation (DA) addresses the challenge of transferring a machine learning model trained on a source domain to a target domain with a different data distribution. In this work, we study DA for โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models

Li Ju, Junzhe Wang, Qi Zhang ยท 2026

Retrieval-Augmented Generation (RAG) models frequently produce answers grounded in parametric memory rather than the retrieved context, undermining the core promise of retrieval augmentation. A fundamโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

RCProb: Probabilistic Rule Extraction for Efficient Simplification of Tree Ensembles

Josue Obregon ยท 2026

Tree ensembles are widely used in industrial machine learning due to their strong predictive performance and efficient training procedures. However, as the number of trees in an ensemble grows, the reโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Learning from Medical Entity Trees: An Entity-Centric Medical Data Engineering Framework for MLLMs

Jianghang Lin, Haihua Yang, Deli Yu, Kai Wu, Kai Ye, Jinghao Lin, Zihan Wang, Yuhang Wu, Liujuan Cao ยท 2026

Multimodal Large Language Models (MLLMs) have shown transformative potential in medical applications, yet their performance is hindered by conventional data curation strategies that rely on coarse-graโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Optimization-Free Topological Sort for Causal Discovery via the Schur Complement of Score Jacobians

Rui Wu, Hong Xie ยท 2026

Continuous causal discovery typically couples representation learning with structural optimization via non-convex acyclicity penalties, which subjects solvers to local optima and restricts scalabilityโ€ฆ

Read Paper โ†’
โ† Prev Page 23 of 17334 Next โ†’