Expertini Research Research

Browse Research Papers

346,661+ open-access research outputs.

โœ• Clear
๐Ÿ” avoidance learning
Showing 346661 results for "avoidance learning"
AI & Data Science Preprint PDF DOI

Evaluating Risks in Weak-to-Strong Alignment: A Bias-Variance Perspective

Hamid Osooli, Kareema Batool, Rick Gentry, Tiasa Singha Roy, Ashwin Gupta, Anirudha Ramesh ยท 2026

Weak-to-strong alignment offers a promising route to scalable supervision, but it can fail when a strong model becomes confidently wrong on examples that lie in the weak teacher's blind spots. Understโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Zero Shot Coordination for Sparse Reward Tasks with Diverse Reward Shapings

Keenan Powell, Peihong Yu, Pratap Tokekar ยท 2026

Many Multi-Agent Reinforcement Learning (MARL) agents fail to adapt properly to cooperating with agents trained with the same objectives but different seeds, algorithms, or other training differences.โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Feasible-First Exploration for Constrained ML Deployment Optimization in Crash-Prone Hierarchical Search Spaces

Christian Lysenst{o}en ยท 2026

Deploying machine learning models under production constraints requires joint optimization over model family, quantization scheme, runtime backend, and serving configuration. This induces a hierarchicโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Beyond Accuracy: Benchmarking Cross-Task Consistency in Unified Multimodal Models

Weixing Wang, Liudvikas Zekas, Anton Hackl, Constantin Alexander Auga, Parisa Shahabinejad, Jona Otholt, Antonio Rueda-Toicen, Gerard de Melo ยท 2026

Unified Multimodal Models (uMMs) aim to support both visual understanding and visual generation within a shared representation. However, existing evaluation protocols assess these two capabilities indโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver

Joshua Sherwood, Ben Aybar, Benjamin Kaplan ยท 2026

Forecasting when AI systems will become capable of meaningfully accelerating AI research is a central challenge for AI safety. Existing benchmarks measure broad capability growth, but may not provide โ€ฆ

Read Paper โ†’
Biology & Life Sciences Preprint PDF DOI

Learning biophysical models of gene regulation with probability flow matching

Suryanarayana Maddu, Victor Chardes, Michael J. Shelley ยท 2026

Cellular differentiation is governed by gene regulatory networks, the high-dimensional stochastic biochemical systems that determine the transcriptional landscape and mediate cellular responses to sigโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Spark Policy Toolkit: Semantic Contracts and Scalable Execution for Policy Learning in Spark

Zeyu Bai ยท 2026

Custom policy-learning pipelines in Spark fail for two coupled systems reasons: rowwise Python execution makes inference impractical, and driver-side candidate materialization makes split search fragiโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Adoption of TikTok as a Learning Tool in Physical Education: Evidence from the Philippines

Vanessa B. Sibug, Jan Henry B. Sunga, Emerson Q. Fernando, Roe Vincent S. Ovejas, Arjan Gil S. Mendoza, Trisha Anne A. Onofre, Agnes R. Regala, John Paul P. Miranda ยท 2026

This study examines the factors that influence the adoption of TikTok as a learning tool for physical education (PE)-related content among tertiary students in the Philippines. The study applies the Tโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Hierarchies of No-regret Algorithms

R. Xu, E. Yachbes, J. Zhang ยท 2026

Our paper studies the setting of players using no-regret algorithms in various two-player games. We address whether having stronger regret guarantees or playing against an opponent with weaker regret โ€ฆ

Read Paper โ†’
Biology & Life Sciences Preprint PDF DOI

Equation Learning for multiscale models of infectious diseases

James W. G. Doran, Cameron A. Smith, Christian A. Yates, Ruth Bowness ยท 2026

Tuberculosis (TB) is an airborne disease caused by the pathogen Mycobacterium tuberculosis. In 2023, according to the World Health Organization, it ''probably'' replaced COVID-19 as the leading cause โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Null Measurability at the Symmetrization Interface in VC Learning

Dhruv Gupta ยท 2026

Recent work revisiting measurability in the fundamental theorem of statistical learning imposes Borel measurability of ghost-gap suprema. We show that, at the one-sided ghost-gap interface actually usโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Coasting Through Class: Learning Opportunity Loss from Practice Avoidance During Individual Seatwork

Ashish Gurung, Jordan Gutterman, Danielle R. Thomas, Mingyu Feng, Vincent Aleven, Kenneth R. Koedinger ยท 2026

Measures of disengagement provide insights into unproductive use of learning opportunities. Although measures of active disengagement, such as gaming the system and mind-wandering, are well studied, lโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

Dan Shi, Zhuowen Han, Simon Ostermann, Renren Jin, Josef van Genabith, Deyi Xiong ยท 2026

Reinforcement learning (RL)-based post-training often improves the reasoning performance of large language models (LLMs) beyond the training domain, while supervised fine-tuning (SFT) frequently leadsโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Sparse Personalized Text Generation with Multi-Trajectory Reasoning

Bo Ni, Haowei Fu, Qinwen Ge, Franck Dernoncourt, Samyadeep Basu, Nedim Lipka, Seunghyun Yoon, Yu Wang, Nesreen K. Ahmed, Subhojyoti Mukherjee, Puneet Mathur, Ryan A. Rossi, Tyler Derr ยท 2026

As Large Language Models (LLMs) advance, personalization has become a key mechanism for tailoring outputs to individual user needs. However, most existing methods rely heavily on dense interaction hisโ€ฆ

Read Paper โ†’
Education Preprint PDF DOI

Bridging the Quantum Divide: A Learning-Centric Quantum Hackathon for Underrepresented Students (Extended Version)

Fahimeh Bayeh, Linh Dinh, Dongho Lee, Scott Wesley ยท 2026

This paper describes the design and implementation of a two-day quantum hackathon for underrepresented high school students in Nova Scotia, Canada. The first day of the hackathon is spent introducing โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Dynamic Decision Learning: Test-Time Evolution for Abnormality Grounding in Rare Diseases

Jun Li, Mingxuan Liu, Jiazhen Pan, Che Liu, Wenjia Bai, Cosmin I. Bercea, Julia A. Schnabel ยท 2026

Clinical abnormality grounding for rare diseases is often hindered by data scarcity, making supervised fine-tuning impractical and single-pass inference highly unstable. We propose Dynamic Decision Leโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

CoreFlow: Low-Rank Matrix Generative Models

Dongze Wu, Linglingzhi Zhu, Yao Xie ยท 2026

Learning matrix-valued distributions from high-dimensional and possibly incomplete training data is challenging: ambient-space generative modeling is computationally expensive and statistically fragilโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Compute Aligned Training: Optimizing for Test Time Inference

Adam Ousherovitch, Ambuj Tewari ยท 2026

Scaling test-time compute has emerged as a powerful mechanism for enhancing Large Language Model (LLM) performance. However, standard post-training paradigms, Supervised Fine-Tuning (SFT) and Reinforcโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

ViPO: Visual Preference Optimization at Scale

Ming Li, Jie Wu, Justin Cui, Xiaojie Li, Rui Wang, Chen Chen ยท 2026

While preference optimization is crucial for improving visual generative models, how to effectively scale this paradigm remains largely unexplored. Current open-source preference datasets contain confโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization

Xinxin Liu, Ming Li, Zonglin Lyu, Yuzhang Shang, Chen Chen ยท 2026

Human visual preferences are inherently multi-dimensional, encompassing aesthetics, detail fidelity, and semantic alignment. However, existing datasets provide only single, holistic annotations, resulโ€ฆ

Read Paper โ†’
โ† Prev Page 26 of 17334 Next โ†’