Felipe Belem — Research Repository

AI & Data Science Preprint PDF DOI

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Sudong Wang, Weiquan Huang, Xiaomin Yu, Zuhao Yang, Hehai Lin, Keming Wu, Chaojun Xiao, Chen Chen, Wenxuan Wang, Beier Zhu, Yunjian Zhang, Chengwei Qin · 2026

The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (RLVR). H…

Read Paper →

Sociology & Anthropology Preprint PDF DOI

Universal statistical laws governing culinary design

Ganesh Bagler, Gopal Krishna Tewari, Aditya Raj Yadav, Akshat Singh, Pranay Bansal, Ujjval Dargar, Mansi Goel, Madhvi Kumari Sinha · 2026

Cooking is a cultural expression of human creativity that transcends geography and time through the orchestration of ingredients and techniques, much like languages do through words and syntax. Yet, b…

Read Paper →

AI & Data Science Preprint PDF DOI

ZAYAN: Disentangled Contrastive Transformer for Tabular Remote Sensing Data

Al Zadid Sultan Bin Habib, Tanpia Tasnim, Md. Ekramul Islam, Muntasir Tabasum · 2026

Learning informative representations from tabular data in remote sensing and environmental science is challenging due to heterogeneity, scarce labels, and redundancy among features. We present ZAYAN (…

Read Paper →

AI & Data Science Preprint PDF DOI

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

Wanyue Zhang, Wenxiang Wu, Wang Xu, Jiaxin Luo, Helu Zhi, Yibin Huang, Shuo Ren, Zitao Liu, Jiajun Zhang · 2026

Vision-language models (VLMs) have shown strong performance on static visual understanding, yet they still struggle with dynamic spatial reasoning that requires imagining how scenes evolve under egoce…

Read Paper →

AI & Data Science Preprint PDF DOI

Who Trains Matters: Federated Learning under Enrollment and Participation Selection Biases

Gota Morishita · 2026

Federated learning (FL) trains a shared model from updates contributed by distributed clients, often implicitly assuming that contributing clients are representative of the target population. In pract…

Read Paper →

Physics Preprint PDF DOI

Geometric-Phase (Pancharatnam-Berry) Correction for Time-Bin Photonic Qudits: A Calibration and Feed-Forward Algorithm

Ryan Rae-Cheng Wee, Josef Bruzzese · 2026

We develop a geometric-phase framework for time-bin photonic qudits and propose a practical calibration and feed-forward algorithm for separating and compensating geometric (Pancharatnam-Berry), dynam…

Read Paper →

Mathematics Preprint PDF DOI

Concurring reduction schemes for Dirac structures

Dan Aguero, Alessandro Arsie, Pedro Frejlich, Igor Mencattini · 2026

The notion of \emph{concurrence} was recently proposed as the natural compatibility relation between Dirac structures, generalizing the commutativity of two Poisson structures. We address the question…

Read Paper →

AI & Data Science Preprint PDF DOI

CGU-ILALab at FoodBench-QA 2026: Comparing Traditional and LLM-based Approaches for Recipe Nutrient Estimation

Wei-Chun Chen, Yu-Xuan Chen, I-Fang Chung, Ying-Jia Lin · 2026

Accurate nutrient estimation from unstructured recipe text is an important yet challenging problem in dietary monitoring, due to ambiguous ingredient terminology and highly variable quantity expressio…

Read Paper →

Computer Science Preprint PDF DOI

Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

Venkata Pushpak Teja Menta · 2026

Commercial TTS systems produce near-native Indic audio, but the best open-source bases (Chatterbox, Indic Parler-TTS, IndicF5) trail them on measured phonological dimensions, and the most widely adopt…

Read Paper →

AI & Data Science Preprint PDF DOI

Elite-Driven Support Vector Machines for Classification

Mohammad Jafari Jozani, Bahram Moeinianfar · 2026

Support vector machines (SVMs) are a standard tool for binary classification, but their classical formulations are purely data-driven and offer no direct way to encode trusted benchmark models or stru…

Read Paper →

AI & Data Science Preprint PDF DOI

Transformer Approximations from ReLUs

Jerry Yao-Chieh Hu, Mingcheng Lu, Yi-Chen Lee, Han Liu · 2026

We provide a systematic recipe for translating ReLU approximation results to softmax attention mechanism. This recipe covers many common approximation targets. Importantly, it yields target-specific, …

Read Paper →

AI & Data Science Preprint PDF DOI

Long-Context Aware Upcycling: A New Frontier for Hybrid LLM Scaling

Parsa Ashrafi Fashi, Utkarsh Saxena, Mehdi Rezagholizadeh, Aref Jafari, Akash Haridas, Mingyu Yang, Vansh Bhatia, Guihong Li, Vikram Appia, Emad Barsoum · 2026

Hybrid sequence models that combine efficient Transformer components with linear sequence modeling blocks are a promising alternative to pure Transformers, but most are still pretrained from scratch a…

Read Paper →

AI & Data Science Preprint PDF DOI

Zero-to-CAD: Agentic Synthesis of Interpretable CAD Programs at Million-Scale Without Real Data

Mohammadmehdi Ataei, Farzaneh Askari, Kamal Rahimi Malekshan, Pradeep Kumar Jayaraman · 2026

Computer-Aided Design (CAD) models are defined by their construction history: a parametric recipe that encodes design intent. However, existing large-scale 3D datasets predominantly consist of boundar…

Read Paper →

AI & Data Science Preprint PDF DOI

Architecture Determines Observability in Transformers

Thomas Carmichael · 2026

Autoregressive transformers make confident errors, but activation monitoring can catch them only if the model preserves an internal signal that output confidence does not expose. This preservation is …

Read Paper →

AI & Data Science Preprint PDF DOI

DecompKAN: Decomposed Patch-KAN for Long-Term Time Series Forecasting

Naveen Mysore · 2026

Accurate time series forecasting in scientific domains such as climate modeling, physiological monitoring, and energy systems benefits from both competitive predictions and model transparency. This wo…

Read Paper →

Computer Science Preprint PDF DOI

Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval

Dun Zhang · 2026

Modern retrieval pipelines increasingly serve downstream consumers like retrieval-augmented generation (RAG) and autonomous agents that need more than a scalar relevance score. A reranker that only te…

Read Paper →

AI & Data Science Preprint PDF DOI

Human-1 by Josh Talks: A Full-Duplex Conversational Modeling Framework in Hindi using Real-World Conversations

Bhaskar Singh, Shobhit Banga, Pranav Sharma · 2026

Full-duplex spoken dialogue systems can model natural conversational behaviours such as interruptions, overlaps, and backchannels, yet such systems remain largely unexplored for Indian languages. We p…

Read Paper →

AI & Data Science Preprint PDF DOI

A Scale-Adaptive Framework for Joint Spatiotemporal Super-Resolution with Diffusion Models

Max Defez, Filippo Quarenghi, Mathieu Vrac, Stephan Mandt, Tom Beucler · 2026

Deep-learning video super-resolution has progressed rapidly, but climate applications typically super-resolve (increase resolution) either space or time, and joint spatiotemporal models are often desi…

Read Paper →

Mathematics Preprint PDF DOI

A chain of $\mathbb{C}^{*}$-flips of the moduli spaces of $\mathcal{O}$-twisted rank 2 constrained framed Hitchin pairs on a smooth curve

YongJoo Shin, Sang-Bum Yoo · 2026

Let $X$ be a smooth complex projective curve. We prove that there exists a surjective commutative forgetful diagram from the chain of $\mathbb{C}^{*}$-flips of the moduli spaces of $\mathcal{O}_{X}$-t…

Read Paper →

AI & Data Science Preprint PDF DOI

Near-Future Policy Optimization

Chuanyu Qin, Chenxu Yang, Qingyi Si, Naibin Gu, Dingyu Yao, Zheng Lin, Peng Fu, Nan Duan, Jiaqi Wang · 2026

Reinforcement learning with verifiable rewards (RLVR) has become a core post-training recipe. Introducing suitable off-policy trajectories into on-policy exploration accelerates RLVR convergence and r…

Read Paper →

Browse Research Papers

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Universal statistical laws governing culinary design

ZAYAN: Disentangled Contrastive Transformer for Tabular Remote Sensing Data

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

Who Trains Matters: Federated Learning under Enrollment and Participation Selection Biases

Geometric-Phase (Pancharatnam-Berry) Correction for Time-Bin Photonic Qudits: A Calibration and Feed-Forward Algorithm

Concurring reduction schemes for Dirac structures

CGU-ILALab at FoodBench-QA 2026: Comparing Traditional and LLM-based Approaches for Recipe Nutrient Estimation

Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

Elite-Driven Support Vector Machines for Classification

Transformer Approximations from ReLUs

Long-Context Aware Upcycling: A New Frontier for Hybrid LLM Scaling

Zero-to-CAD: Agentic Synthesis of Interpretable CAD Programs at Million-Scale Without Real Data

Architecture Determines Observability in Transformers

DecompKAN: Decomposed Patch-KAN for Long-Term Time Series Forecasting

Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval

Human-1 by Josh Talks: A Full-Duplex Conversational Modeling Framework in Hindi using Real-World Conversations

A Scale-Adaptive Framework for Joint Spatiotemporal Super-Resolution with Diffusion Models

A chain of $\mathbb{C}^{*}$-flips of the moduli spaces of $\mathcal{O}$-twisted rank 2 constrained framed Hitchin pairs on a smooth curve

Near-Future Policy Optimization

Browse by Category

Research Type

Publish Your Research