Pedro Cabalar — Research Repository

AI & Data Science Preprint PDF DOI

TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering

An-Yang Ji, Jun-Peng Jiang, De-Chuan Zhan, Han-Jia Ye · 2026

Large Language Models (LLMs) have advanced Table Question Answering, where most queries can be answered by extracting information or simple aggregation. However, a common class of real-world queries i…

Read Paper →

AI & Data Science Preprint PDF DOI

PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking

Qing Lyu, Jeremy Hudson, Mohammad Kawas, Yuming Jiang, Chenyu You, Christopher T Whitlow · 2026

Individualized Alzheimer's disease (AD) progression prediction requires models that use irregular visits, account for censoring, avoid diagnostic leakage, and provide calibrated horizon risks. We prop…

Read Paper →

AI & Data Science Preprint PDF DOI

MIFair: A Mutual-Information Framework for Intersectionality and Multiclass Fairness

Jeanne Monnier, Thomas George, Frederic Guyard, Christele Tarnec, Marios Kountouris · 2026

Fairness in machine learning remains challenging due to its ethical complexity, the absence of a universal definition, and the need for context-specific bias metrics. Existing methods still struggle w…

Read Paper →

AI & Data Science Preprint PDF DOI

Differential Subgroup Discovery: Characterizing Where Two Populations Differ, and Why

Sascha Xu, Jilles Vreeken · 2026

We study the problem of understanding where two populations differ within a feature space, which we formalize in the concept of a differential subgroup: a subset of individuals from both populations w…

Read Paper →

Engineering Preprint PDF DOI

Can Tabular Foundation Models Guide Exploration in Robot Policy Learning?

Buqing Ou, Frederike Dumbgen · 2026

Policy optimization in high-dimensional continuous control for robotics remains a challenging problem. Predominant methods are inherently local and often require extensive tuning and carefully chosen …

Read Paper →

AI & Data Science Preprint PDF DOI

ZAYAN: Disentangled Contrastive Transformer for Tabular Remote Sensing Data

Al Zadid Sultan Bin Habib, Tanpia Tasnim, Md. Ekramul Islam, Muntasir Tabasum · 2026

Learning informative representations from tabular data in remote sensing and environmental science is challenging due to heterogeneity, scarce labels, and redundancy among features. We present ZAYAN (…

Read Paper →

AI & Data Science Preprint PDF DOI

CoAX: Cognitive-Oriented Attribution eXplanation User Model of Human Understanding of AI Explanations

Louth Bin Rawshan, Zhuoyu Wang, Brian Y. Lim · 2026

Explainable AI (XAI) aims to improve user understanding and decisions when using AI models. However, despite innovations in XAI, recent user evaluations reveal that this goal remains elusive. Understa…

Read Paper →

AI & Data Science Preprint PDF DOI

Evaluating TabPFN for Mild Cognitive Impairment to Alzheimer's Disease Conversion in Data Limited Settings

Brad Ye, Bulent Soykan, Gulsah Hancerliogullari Koksalmis, Hsin-Hsiung Huang, Laura J. Brattain · 2026

Accurate prediction of conversion from Mild Cognitive Impairment (MCI) to Alzheimers Diseases (AD) is essential for early intervention, however, developing reliable conversion predictive models is dif…

Read Paper →

AI & Data Science Preprint PDF DOI

SCOPE-FE: Structured Control of Operator and Pairwise Exploration for Feature Engineering

Minhee Park, Seongyeon Son, Yonghyun Lee, Eunchan Kim · 2026

Automatic feature engineering is an effective approach for improving predictive performance in tabular learning. However, expand-and-reduce methods, such as OpenFE, become increasingly computationally…

Read Paper →

AI & Data Science Preprint PDF DOI

Efficient and Interpretable Transformer for Counterfactual Fairness

Panyi Dong, Zhiyu Quan · 2026

The growing reliance of machine learning models in high-stakes, highly regulated domains such as finance and insurance has created a growing tension between predictive performance, interpretability, a…

Read Paper →

AI & Data Science Preprint PDF DOI

Correcting Performance Estimation Bias in Imbalanced Classification with Minority Subconcepts

Taylor Maxson, Roberto Corizzo, Yaning Wu, Nathalie Japkowicz, Colin Bellinger · 2026

Class-level evaluation can conceal substantial performance disparities across subconcepts within the same class, causing models that perform well on average to fail on specific subpopulations. Prior w…

Read Paper →

Mathematics Preprint PDF DOI

Enumerating Multi-Operator Monomials in Commutative and Noncommutative Settings

Yu Hin Au, Murray R. Bremner · 2026

We study enumeration problems for multi-operator monomials generated from one indeterminate by an associative multiplication together with finitely many unary operators. We consider four regimes, acco…

Read Paper →

AI & Data Science Preprint PDF DOI

Knowledge-Data Dually Driven Paradigm for Accurate Landslide Susceptibility Prediction under Data-Scarce Conditions Using Geomorphic Priors and Tabular Foundation Model

Yuting Yang, Gang Mei, Feng Chen, Yongshuang Zhang, Jianbing Peng · 2026

Landslide susceptibility prediction is critical for geohazard risk assessment and mitigation. Conventional data-driven paradigm achieves high predictive accuracy but require sufficient conditioning fa…

Read Paper →

AI & Data Science Preprint PDF DOI

Accurate and Robust Generative Approach for Overcoming Data Sparsity and Imbalance in Landslide Modeling with A Tabular Foundation Model

Kaixuan Shao, Gang Mei, Yinghan Wu, Nengxiong Xu, Jianbing Peng · 2026

Landslide investigation relies on sufficient and well-balanced observational data influenced by geological, hydrological, and anthropogenic factors. Available landslide inventories are often sparse an…

Read Paper →

AI & Data Science Preprint PDF DOI

Prior-Aligned Data Cleaning for Tabular Foundation Models

Laure Berti-Equille · 2026

Tabular Foundation Models (TFMs) achieve state-of-the-art zero-shot accuracy on small tabular datasets by meta-learning over synthetic data-generating processes -- making them highly attractive for pr…

Read Paper →

AI & Data Science Preprint PDF DOI

Zero Shot Coordination for Sparse Reward Tasks with Diverse Reward Shapings

Keenan Powell, Peihong Yu, Pratap Tokekar · 2026

Many Multi-Agent Reinforcement Learning (MARL) agents fail to adapt properly to cooperating with agents trained with the same objectives but different seeds, algorithms, or other training differences.…

Read Paper →

AI & Data Science Preprint PDF DOI

Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation

Seok Hwan Song, Azher Ahmed Efat, Wallapak Tavanapong · 2026

Chart-to-table translation converts chart images into structured tabular data. Accurate translation is crucial for Multimodal Language Model (MLM) to answer complex queries. We observe imbalances in t…

Read Paper →

Physics Preprint PDF DOI

Emergent Features in $U(N) \times U(\tilde{N})$ Bi-adjoint Cubic Theory

Lauren Smyth · 2026

This work investigates the role of the $U(N) \times U(\tilde{N})$ global symmetry in tree-level scattering amplitudes of the bi-adjoint $\phi^3$ theory from three perspectives: combinatorics, correlat…

Read Paper →

AI & Data Science Preprint PDF DOI

SAGE: Sparse Adaptive Guidance for Dependency-Aware Tabular Data Generation

Shuo Yang, Zheyu Zhang, Bardh Prenkaj, Gjergji Kasneci · 2026

Generating high-fidelity synthetic tabular data remains a critical challenge for enhancing data availability in privacy-sensitive and low-resource domains. Recent approaches leverage LLMs by represent…

Read Paper →

Mathematics Preprint PDF DOI

Continued fractions related to Narayana polynomials

Johann Cigler · 2026

The generating functions of some sequences of Catalan numbers and Narayana polynomials have simple expansions as continued fractions of Jacobi type. We give an overview of these facts and prove analog…

Read Paper →

Browse Research Papers

TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering

PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer's Disease Progression and Dynamic Tracking

MIFair: A Mutual-Information Framework for Intersectionality and Multiclass Fairness

Differential Subgroup Discovery: Characterizing Where Two Populations Differ, and Why

Can Tabular Foundation Models Guide Exploration in Robot Policy Learning?

ZAYAN: Disentangled Contrastive Transformer for Tabular Remote Sensing Data

CoAX: Cognitive-Oriented Attribution eXplanation User Model of Human Understanding of AI Explanations

Evaluating TabPFN for Mild Cognitive Impairment to Alzheimer's Disease Conversion in Data Limited Settings

SCOPE-FE: Structured Control of Operator and Pairwise Exploration for Feature Engineering

Efficient and Interpretable Transformer for Counterfactual Fairness

Correcting Performance Estimation Bias in Imbalanced Classification with Minority Subconcepts

Enumerating Multi-Operator Monomials in Commutative and Noncommutative Settings

Knowledge-Data Dually Driven Paradigm for Accurate Landslide Susceptibility Prediction under Data-Scarce Conditions Using Geomorphic Priors and Tabular Foundation Model

Accurate and Robust Generative Approach for Overcoming Data Sparsity and Imbalance in Landslide Modeling with A Tabular Foundation Model

Prior-Aligned Data Cleaning for Tabular Foundation Models

Zero Shot Coordination for Sparse Reward Tasks with Diverse Reward Shapings

Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation

Emergent Features in $U(N) \times U(\tilde{N})$ Bi-adjoint Cubic Theory

SAGE: Sparse Adaptive Guidance for Dependency-Aware Tabular Data Generation

Continued fractions related to Narayana polynomials

Browse by Category

Research Type

Publish Your Research