Expertini Research Research

Browse Research Papers

346,661+ open-access research outputs.

โœ• Clear
๐Ÿ” avoidance learning
Showing 346661 results for "avoidance learning"
AI & Data Science Preprint PDF DOI

Toward a Functional Geometric Algebra for Natural Language Semantics

James Pustejovsky ยท 2026

Distributional and neural approaches to natural language semantics have been built almost exclusively on conventional linear algebra: vectors, matrices, tensors, and the operations that accompany themโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

TSN-Affinity: Similarity-Driven Parameter Reuse for Continual Offline Reinforcement Learning

Dominik Zurek, Kamil Faber, Marcin Pietron, Pawe{l} Gajewski, Roberto Corizzo ยท 2026

Continual offline reinforcement learning (CORL) aims to learn a sequence of tasks from datasets collected over time while preserving performance on previously learned tasks. This setting corresponds tโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Three Models of RLHF Annotation: Extension, Evidence, and Authority

Steve Coyne ยท 2026

Preference-based alignment methods, most prominently Reinforcement Learning with Human Feedback (RLHF), use the judgments of human annotators to shape large language model behaviour. However, the normโ€ฆ

Read Paper โ†’
Earth & Environmental Sciences Preprint PDF DOI

Observation-Guided Neural Surrogate Learning for Scientific Simulation Emulation: A Single-Gauge Flood-Inundation Proof of Concept

Marzieh Alireza Mirhoseini ยท 2026

We present an observation-guided neural surrogate-learning framework for scientific simulation emulation, demonstrated on urban flood-inundation mapping. The framework combines LISFLOOD-FP hydrodynamiโ€ฆ

Read Paper โ†’
Physics Preprint PDF DOI

QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding

Shuxiang Cao, Zijian Zhang, Abhishek Agarwal, Grace Bratrud, Niyaz R. Beysengulov, Daniel C. Cole, Alejandro Gomez Frieiro, Elena O. Glen, Hao Hsu, Gang Huang, Raymond Jow, Greshma Shaji, Tom Lubowe, Ligeng Zhu, Luis Mantilla Calderon, Nicola Pancotti, Joel Pendleton, Brandon Severin, Charles Etienne Staub, Sara Sussman, Antti Vepsalainen, Neel Rajeshbhai Vora, Yilun Xu, Varinia Bernales, Daniel Bowring, Elica Kyoseva, Ivan Rungger, Giulia Semeghini, Sam Stanwyck, Timothy Costa, Alan Aspuru-Guzik, Krysta Svore ยท 2026

Quantum computing calibration depends on interpreting experimental data, and calibration plots provide the most universal human-readable representation for this task, yet no systematic evaluation exisโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy Gradient

Shuning Shang, Hubert Strauss, Stanley Wei, Sanjeev Arora, Noam Razin ยท 2026

Training language models via reinforcement learning often relies on imperfect proxy rewards, since ground truth rewards that precisely define the intended behavior are rarely available. Standard metriโ€ฆ

Read Paper โ†’
Physics Preprint PDF DOI

Learning Neural Operator Surrogates for the Black Hole Accretion Code

Matthias Nagele, Cedric Bos, Chester Tan, Christian M. Fromm, Ingo Scholtes, Karl Mannheim ยท 2026

General-relativistic magnetohydrodynamic (GR-MHD) simulations are essential for studying black hole accretion, relativistic jets, and magnetic reconnection, yet their computational cost severely limitโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Investigation into In-Context Learning Capabilities of Transformers

Rushil Chandrupatla, Leo Bangayan, Sebastian Leng, Arya Mazumdar ยท 2026

Transformers have demonstrated a strong ability for in-context learning (ICL), enabling models to solve previously unseen tasks using only example input output pairs provided at inference time. While โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Semi-Markov Reinforcement Learning for City-Scale EV Ride-Hailing with Feasibility-Guaranteed Actions

An Nguyen, Hoang Nguyen, Phuong Le, Hung Pham, Cuong Do, Laurent El Ghaoui ยท 2026

We study city-scale control of electric-vehicle (EV) ride-hailing fleets where dispatch, repositioning, and charging decisions must respect charger and feeder limits under uncertain, spatially correlaโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Model-agnostic information transfer and fusion for classification with label noise

Zhu Guojun, Zhang Sanguo, Ren Mingyang ยท 2026

Label noise presents a fundamental challenge in modern machine learning, especially when large-scale datasets are generated via automated processes. An increasingly common and important data paradigm,โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Occam's Razor is Only as Sharp as Your ELBO

Ethan Harvey, Michael C. Hughes ยท 2026

The marginal likelihood, also known as the evidence, is regarded as a mathematical embodiment of Occam's razor, enabling model selection that avoids overfitting. The evidence lower bound (ELBO) objectโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

TrialCalibre: A Fully Automated Causal Engine for RCT Benchmarking and Observational Trial Calibration

Amir Habibdoust, Xing Song ยท 2026

Real-world evidence (RWE) studies that emulate target trials increasingly inform regulatory and clinical decisions, yet residual, hard-to-quantify biases still limit their credibility. The recently prโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Hands-on PDC in Undergraduate Computing Education

Hala ElAarag, Anas Gamal Aly ยท 2026

Parallel and Distributed Computing (PDC) is a critical yet conceptually challenging area of the undergraduate computer science curriculum. While students often encounter these concepts in theory, few โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

MAIC-UI: Making Interactive Courseware with Generative UI

Shangqing Tu, Yanjia Li, Keyu Chen, Sichen Zhang, Jifan Yu, Daniel Zhang-Li, Lei Hou, Juanzi Li, Yu Zhang, Huiqin Liu ยท 2026

Creating interactive STEM courseware traditionally requires HTML/CSS/JavaScript expertise, leaving barriers for educators. While generative AI can produce HTML codes, existing tools generate static prโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Improving Diversity in Black-box Few-shot Knowledge Distillation

Tri-Nhan Vo, Dang Nguyen, Kien Do, Sunil Gupta ยท 2026

Knowledge distillation (KD) is a well-known technique to effectively compress a large network (teacher) to a smaller network (student) with little sacrifice in performance. However, most KD methods reโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Diverse Image Priors for Black-box Data-free Knowledge Distillation

Tri-Nhan Vo, Dang Nguyen, Trung Le, Kien Do, Sunil Gupta ยท 2026

Knowledge distillation (KD) represents a vital mechanism to transfer expertise from complex teacher networks to efficient student models. However, in decentralized or secure AI ecosystems, privacy regโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning

Yixuan Huang, Bowen Li, Vaibhav Saxena, Yichao Liang, Utkarsh Aashu Mishra, Liang Ji, Lihan Zha, Jimmy Wu, Nishanth Kumar, Sebastian Scherer, Danfei Xu, Tom Silver ยท 2026

Robotic systems that interact with the physical world must reason about kinematic and dynamic constraints imposed by their own embodiment, their environment, and the task at hand. We introduce KinDER,โ€ฆ

Read Paper โ†’
Physics Preprint PDF DOI

Homogeneous Stellar Parameters from Heterogeneous Spectra with Deep Learning

Jeff Shen, Joshua S. Speagle, Shirley Ho ยท 2026

Large-scale spectroscopic surveys have collectively observed millions of stars across the Milky Way, but each derives stellar labels using independent pipelines with distinct modelling assumptions, inโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Subliminal Steering: Stronger Encoding of Hidden Signals

George Morgulis, John Hewitt ยท 2026

Subliminal learning describes a student language model inheriting a behavioral bias by fine-tuning on seemingly innocuous data generated by a biased teacher model. Prior work has begun to characterizeโ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

EOS-Bench: A Comprehensive Benchmark for Earth Observation Satellite Scheduling

Qian Yin, Jiaxing Li, Jiaqi Cheng, Qizhang Luo, Annalisa Riccardi, Abhijit Chatterjee, Rafael Vazquez, Carlo Novara, Michalis Mavrovouniotis, Ponnuthurai Nagaratnam Suganthan, Shengzhou Bai, Xiaoxuan Hu, Lining Xing, Ming Xu, Shuang Li, Zixuan Zheng, Xin Shen, Xiaoyu Chen, Yi Gu, Yanjie Song, Witold Pedrycz, Evan L. Kramer, Laio Oriel Seman, Cletah Shoko, Guohua Wu, Xinwei Wang ยท 2026

Earth observation satellite imaging scheduling is a challenging NP-hard combinatorial optimisation problem central to space mission operations. While next-generation agile Earth observation satellitesโ€ฆ

Read Paper โ†’
โ† Prev Page 19 of 17334 Next โ†’