Brain in AI & Data Science — Research Repository

AI & Data Science Preprint PDF DOI

AesRM: Improving Video Aesthetics with Expert-Level Feedback

Yujin Han, Yujie Wei, Yefei He, Xinyu Liu, Tianle Li, Zichao Yu, Andi Han, Shiwei Zhang, Tingyu Weng, Difan Zou · 2026

Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. …

Read Paper →

AI & Data Science Preprint PDF DOI

Simulating clinical interventions with a generative multimodal model of human physiology

Guy Lutsker, Gal Sapir, Jordi Merino, Smadar Shilo, Anastasia Godneva, Eli Meirom, Shie Mannor, Hagai Rossman, Gal Chechik, Eran Segal · 2026

Understanding how human health changes over time, and why responses to interventions vary between individuals, remains a central challenge in medicine. Here we present HealthFormer, a decoder-only tra…

Read Paper →

AI & Data Science Preprint PDF DOI

GourNet: A CNN-Based Model for Mango Leaf Disease Detection

Ekram Alam, Jaydip Sanyal, Akhil Kumar Das, Arijit Bhattacharya, Farhana Sultana · 2026

Mango cultivation is crucial in the agricultural sector, significantly contributing to economic development and food security. However, diseases affecting mango leaves can significantly reduce both th…

Read Paper →

AI & Data Science Preprint PDF DOI

Privacy-Preserving Federated Learning via Differential Privacy and Homomorphic Encryption for Cardiovascular Disease Risk Modeling

Gaurang Sharma, Juha Pajula, Aada Illikainen, Markus Rautell, Noora Lipsonen, Petri Alhainen, Mika Hilvo · 2026

Protecting sensitive health data while enabling collaborative analysis is a central challenge in healthcare. Traditional machine learning (ML) requires institutions to pool anonymized patient records,…

Read Paper →

AI & Data Science Preprint PDF DOI

Beyond the Training Distribution: Mapping Generalization Boundaries in Neural Program Synthesis

Henrik Voigt, Michael Habeck, Joachim Giesen · 2026

Large-scale transformers achieve impressive results on program synthesis benchmarks, yet their true generalization capabilities remain obscured by data contamination and opaque training corpora. To ri…

Read Paper →

AI & Data Science Preprint PDF DOI

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Hanzhong Guo, Jie Wu, Jie Liu, Yu Gao, Zilyu Ye, Linxiao Yuan, Xionghui Wang, Yizhou Yu, Weilin Huang · 2026

While Reinforcement Learning from Human Feedback (RLHF) has become a pivotal paradigm for text-to-image generation, its application to image editing remains largely unexplored. A key bottleneck is the…

Read Paper →

AI & Data Science Preprint PDF DOI

From Coarse to Fine: Benchmarking and Reward Modeling for Writing-Centric Generation Tasks

Qingyu Ren, Tianjun Pan, Xingzhou Chen, Xuhong Wang · 2026

Large language models have achieved remarkable progress in text generation but still struggle with generative writing tasks. In terms of evaluation, existing benchmarks evaluate writing reward models …

Read Paper →

AI & Data Science Preprint PDF DOI

AdaBFL: Multi-Layer Defensive Adaptive Aggregation for Bzantine-Robust Federated Learning

Zehui Tang, Yuchen Liu, Feihu Huang · 2026

Federated learning (FL) is a popular distributed learning paradigm in machine learning, which enables multiple clients to collaboratively train models under the guidance of a server without exposing p…

Read Paper →

AI & Data Science Preprint PDF DOI

BrainDINO: A Brain MRI Foundation Model for Generalizable Clinical Representation Learning

Yizhou Wu, Shansong Wang, Yuheng Li, Mojtaba Safari, Mingzhe Hu, Chih-Wei Chang, Harini Veeraraghavan, Xiaofeng Yang · 2026

Brain MRI underpins a wide range of neuroscientific and clinical applications, yet most learning-based methods remain task-specific and require substantial labeled data. Here we show that a single sel…

Read Paper →

AI & Data Science Preprint PDF DOI

Energy-Efficient Plant Monitoring via Knowledge Distillation

Ilyass Moummad, Reda Bensaid, Kawtar Zaher, Herve Goeau, Jean-Christophe Lombardo, Joseph Salmon, Pierre Bonnet, Alexis Joly · 2026

Recent advances in large-scale visual representation learning have significantly improved performance in plant species and plant disease recognition tasks. However, state-of-the-art models, often base…

Read Paper →

AI & Data Science Preprint PDF DOI

Better Models, Faster Training: Sigmoid Attention for single-cell Foundation Models

Vijay Sadashivaiah, Georgios Dasoulas, Judith Mueller, Soumya Ghosh · 2026

Training stable biological foundation models requires rethinking attention mechanisms: we find that using sigmoid attention as a drop in replacement for softmax attention a) produces better learned re…

Read Paper →

AI & Data Science Preprint PDF DOI

InterPartAbility: Text-Guided Part Matching for Interpretable Person Re-Identification

Shakeeb Murtaza, Aryan Shukla, Rajarshi Bhattacharya, Maguelonne Heritier, Eric Granger · 2026

Text-to-image person re-identification (TI-ReID) relies on natural-language text description to retrieve top matching individuals from a large gallery of images. While recent large vision-language mod…

Read Paper →

AI & Data Science Preprint PDF DOI

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

Wanyue Zhang, Wenxiang Wu, Wang Xu, Jiaxin Luo, Helu Zhi, Yibin Huang, Shuo Ren, Zitao Liu, Jiajun Zhang · 2026

Vision-language models (VLMs) have shown strong performance on static visual understanding, yet they still struggle with dynamic spatial reasoning that requires imagining how scenes evolve under egoce…

Read Paper →

AI & Data Science Preprint PDF DOI

ClawGym: A Scalable Framework for Building Effective Claw Agents

Fei Bai, Huatong Song, Shuang Sun, Daixuan Cheng, Yike Yang, Chuan Hao, Renyuan Li, Feng Chang, Yuan Wei, Ran Tao, Bryan Dai, Jian Yang, Wayne Xin Zhao · 2026

Claw-style environments support multi-step workflows over local files, tools, and persistent workspace states. However, scalable development around these environments remains constrained by the absenc…

Read Paper →

AI & Data Science Preprint PDF DOI

Improving Bias Correction Methods for Daily Rainfall Using a Markov Chain Approach

Danny Parsons, David Stern, Mouhamadou Bamba Sylla, James Musyoka, John Bagiliko, Lily Clements, John Mupuro, Denis Ndanguza · 2026

Accurate, localised rainfall information is essential for applications such as agricultural planning, climate risk assessment, and water resources management. Gridded climate products provide rainfall…

Read Paper →

AI & Data Science Preprint PDF DOI

Random Cloud: Finding Minimal Neural Architectures Without Training

Javier Gil Blazquez · 2026

I propose the \emph{Random Cloud} method, a training-free approach to neural architecture search that discovers minimal feedforward network topologies through stochastic exploration and progressive st…

Read Paper →

AI & Data Science Preprint PDF DOI

A Multi-Dataset Benchmark of Multiple Instance Learning for 3D Neuroimage Classification

Ethan Harvey, Dennis Johan Loevlie, Amir Ali Satani, Wansu Chen, David M. Kent, Michael C. Hughes · 2026

Despite being resource-intensive to train, 3D convolutional neural networks (CNNs) have been the standard approach to classify CT and MRI scans. Recent work suggests that deep multiple instance learni…

Read Paper →

AI & Data Science Preprint PDF DOI

Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation

Weihang Su, Hanwen Zhang, Qingyao Ai, Yiqun Liu · 2026

Parametric Retrieval-Augmented Generation (PRAG) encodes external documents into lightweight parameter modules that can be retrieved and merged at inference time, offering a promising alternative to i…

Read Paper →

AI & Data Science Preprint PDF DOI

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

Zhixin Han, Yanzhi Zhang, Chuyang Wei, Maohang Gao, Xiawei Yue, Kefei Chen, Yu Zhuang, Haoxiang Guan, Jiyan He, Jian Li, Yitong Duan, Yu Shi, Mengting Hu, Shuxin Zheng · 2026

Live future prediction refers to the task of making predictions about real-world events before they unfold. This task is increasingly studied using large language model-based agent systems, and it is …

Read Paper →

AI & Data Science Preprint PDF DOI

FunFace: Feature Utility and Norm Estimation for Face Recognition

Ziga Babnik, Fadi Boutros, Naser Damer, Deepak Kumar Jain, Peter Peer, Vitomir Struc · 2026

Face Recognition (FR) is used in a variety of application domains, from entertainment and banking to security and surveillance. Such applications rely on the FR model to be robust and perform well in …

Read Paper →

Browse Research Papers

AesRM: Improving Video Aesthetics with Expert-Level Feedback

Simulating clinical interventions with a generative multimodal model of human physiology

GourNet: A CNN-Based Model for Mango Leaf Disease Detection

Privacy-Preserving Federated Learning via Differential Privacy and Homomorphic Encryption for Cardiovascular Disease Risk Modeling

Beyond the Training Distribution: Mapping Generalization Boundaries in Neural Program Synthesis

Leveraging Verifier-Based Reinforcement Learning in Image Editing

From Coarse to Fine: Benchmarking and Reward Modeling for Writing-Centric Generation Tasks

AdaBFL: Multi-Layer Defensive Adaptive Aggregation for Bzantine-Robust Federated Learning

BrainDINO: A Brain MRI Foundation Model for Generalizable Clinical Representation Learning

Energy-Efficient Plant Monitoring via Knowledge Distillation

Better Models, Faster Training: Sigmoid Attention for single-cell Foundation Models

InterPartAbility: Text-Guided Part Matching for Interpretable Person Re-Identification

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

ClawGym: A Scalable Framework for Building Effective Claw Agents

Improving Bias Correction Methods for Daily Rainfall Using a Markov Chain Approach

Random Cloud: Finding Minimal Neural Architectures Without Training

A Multi-Dataset Benchmark of Multiple Instance Learning for 3D Neuroimage Classification

Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

FunFace: Feature Utility and Norm Estimation for Face Recognition

Browse by Category

Research Type

Publish Your Research