Pascal Ochem in AI & Data Science — Research Repository

AI & Data Science Preprint PDF DOI

Learning to Reason: Targeted Knowledge Discovery and Fuzzy Logic Update for Robust Image Recognition

Gurucharan Srinivas, Joshua Niemeijer, Frank Koster · 2026

Integrating domain knowledge into deep neural networks is a promising way to improve generalization. Existing methods either encode prior knowledge in the loss function or apply post-processing module…

Read Paper →

AI & Data Science Preprint PDF DOI

Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver

Joshua Sherwood, Ben Aybar, Benjamin Kaplan · 2026

Forecasting when AI systems will become capable of meaningfully accelerating AI research is a central challenge for AI safety. Existing benchmarks measure broad capability growth, but may not provide …

Read Paper →

AI & Data Science Preprint PDF DOI

Self-Supervised Representation Learning via Hyperspherical Density Shaping

Esteban Rodriguez-Betancourt, Edgar Casasola-Murillo · 2026

Modern self-supervised representation learning methods often relies on empirical heuristics that are not theoretically grounded. In this study we propose HyDeS, a theoretically grounded method based o…

Read Paper →

AI & Data Science Preprint PDF DOI

Hard to See, Hard to Label: Generative and Symbolic Acquisition for Subtle Visual Phenomena

Renjith Prasad, Rishabh Sharma, Andrew E. Shao, Annmary Justine Koomthanam, Shreyas Kulkarni, Suparna Bhattacharya, Martin Foltin, Amit Sheth, David Orozco, Matthew Quinn, Brian Sammuli · 2026

Subtle visual anomalies such as hairline cracks, sub-millimeter voids, and low-contrast inclusions are structurally atypical yet visually ambiguous, making them both difficult to annotate and easy to …

Read Paper →

AI & Data Science Preprint PDF DOI

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Yutian Chen, Shi Guo, Renbiao Jin, Tianshuo Yang, Xin Cai, Yawen Luo, Mingxin Yang, Mulin Yu, Linning Xu, Tianfan Xue · 2026

Sparse-view 3D reconstruction is essential for modeling scenes from casual captures, but remain challenging for non-generative reconstruction. Existing diffusion-based approaches mitigates this issues…

Read Paper →

AI & Data Science Preprint PDF DOI

Wan-Image: Pushing the Boundaries of Generative Visual Intelligence

Chaojie Mao, Chen-Wei Xie, Chongyang Zhong, Haoyou Deng, Jiaxing Zhao, Jie Xiao, Jinbo Xing, Jingfeng Zhang, Jingren Zhou, Jingyi Zhang, Jun Dan, Kai Zhu, Kang Zhao, Keyu Yan, Minghui Chen, Pandeng Li, Shuangle Chen, Tong Shen, Yu Liu, Yue Jiang, Yulin Pan, Yuxiang Tuo, Zeyinzi Jiang, Zhen Han, Ang Wang, Bang Zhang, Baole Ai, Bin Wen, Boang Feng, Feiwu Yu, Gang Wang, Haiming Zhao, He Kang, Jianjing Xiang, Jianyuan Zeng, Jinkai Wang, Junjie Zhou, Ke Sun, Linqian Wu, Pei Gong, Pingyu Wu, Ruiwen Wu, Tongtong Su, Wenmeng Zhou, Wenting Shen, Wenyuan Yu, Xianjun Xu, Xiaoming Huang, Xiejie Shen, Xin Xu, Yan Kou, Yangyu Lv, Yifan Zhai, Yitong Huang, Yun Zheng, Yuntao Hong, Zhe Zhang, Zhicheng Zhang · 2026

We present Wan-Image, a unified visual generation system explicitly engineered to paradigm-shift image generation models from casual synthesizers into professional-grade productivity tools. While cont…

Read Paper →

AI & Data Science Preprint PDF DOI

MARCO: Navigating the Unseen Space of Semantic Correspondence

Claudia Cuttano, Gabriele Trivigno, Carlo Masone, Stefan Roth · 2026

Recent advances in semantic correspondence rely on dual-encoder architectures, combining DINOv2 with diffusion backbones. While accurate, these billion-parameter models generalize poorly beyond traini…

Read Paper →

AI & Data Science Preprint PDF DOI

Lorentz Framework for Semantic Segmentation

Zahid Hasan, Masud Ahmed, Nirmalya Roy · 2026

Semantic segmentation in hyperbolic space enables compact modeling of hierarchical structure while providing inherent uncertainty quantification. Prior approaches predominantly rely on the Poincar\'e …

Read Paper →

AI & Data Science Preprint PDF DOI

DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis

Cheng-You Lu, Yi-Shan Hung, Wei-Ling Chi, Hao-Ping Wang, Charlie Li-Ting Tsai, Yu-Cheng Chang, Yu-Lun Liu, Thomas Do, Chin-Teng Lin · 2026

Advances in radiance fields have enabled photorealistic novel view synthesis. In several domains, large-scale real-world datasets have been developed to support comprehensive benchmarking and to facil…

Read Paper →

AI & Data Science Preprint PDF DOI

Parcae: Scaling Laws For Stable Looped Language Models

Hayden Prairie, Zachary Novack, Taylor Berg-Kirkpatrick, Daniel Y. Fu · 2026

Traditional fixed-depth architectures scale quality by increasing training FLOPs, typically through increased parameterization, at the expense of a higher memory footprint, or data. A potential altern…

Read Paper →

AI & Data Science Preprint PDF DOI

GTPBD-MM: A Global Terraced Parcel and Boundary Dataset with Multi-Modality

Zhiwei Zhang, Xingyuan Zeng, Xinkai Kong, Kunquan Zhang, Haoyuan Liang, Bohan Shi, Juepeng Zheng, Jianxi Huang, Yutong Lu, Haohuan Fu · 2026

Agricultural parcel extraction plays an important role in remote sensing-based agricultural monitoring, supporting parcel surveying, precision management, and ecological assessment. However, existing …

Read Paper →

AI & Data Science Preprint PDF DOI

BareBones: Benchmarking Zero-Shot Geometric Comprehension in VLMs

Aaditya Baranwal, Vishal Yadav, Abhishek Rajora · 2026

While Vision-Language Models (VLMs) demonstrate remarkable zero-shot recognition capabilities across a diverse spectrum of multimodal tasks, it yet remains an open question whether these architectures…

Read Paper →

AI & Data Science Preprint PDF DOI

Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration

Hang Lv, Hongchao Gu, Ruiqing Yang, Liangyue Li, Zulong Chen, Defu Lian, Hao Wang, Enhong Chen · 2026

Generative listwise reranking leverages global context for superior retrieval but is plagued by intrinsic position bias, where models exhibit structural sensitivity to input order independent of relev…

Read Paper →

AI & Data Science Preprint PDF DOI

LaScA: Language-Conditioned Scalable Modelling of Affective Dynamics

Kosmas Pinitas, Ilias Maglogiannis · 2026

Predicting affect in unconstrained environments remains a fundamental challenge in human-centered AI. While deep neural embeddings dominate contemporary approaches, they often lack interpretability an…

Read Paper →

AI & Data Science Preprint PDF DOI

Variational Feature Compression for Model-Specific Representations

Zinan Guo, Zihan Wang, Chuan Yan, Liuhuo Wan, Ethan Ma, Guangdong Bai · 2026

As deep learning inference is increasingly deployed in shared and cloud-based settings, a growing concern is input repurposing, in which data submitted for one task is reused by unauthorized models fo…

Read Paper →

AI & Data Science Preprint PDF DOI

Few-Shot Semantic Segmentation Meets SAM3

Yi-Jen Tsai, Yen-Yu Lin, Chien-Yao Wang · 2026

Few-Shot Semantic Segmentation (FSS) focuses on segmenting novel object categories from only a handful of annotated examples. Most existing approaches rely on extensive episodic training to learn tran…

Read Paper →

AI & Data Science Preprint PDF DOI

MTLSI-Net: A Linear Semantic Interaction Network for Parameter-Efficient Multi-Task Dense Prediction

Chen Liu, Hengyu Man, Xiaopeng Fan, Debin Zhao · 2026

Multi-task dense prediction aims to perform multiple pixel-level tasks simultaneously. However, capturing global cross-task interactions remains non-trivial due to the quadratic complexity of standard…

Read Paper →

AI & Data Science Preprint PDF DOI

Kernel Dynamics under Path Entropy Maximization

Jnaneshwar Das · 2026

We propose a variational framework in which the kernel function k : X x X -> R, interpreted as the foundational object encoding what distinctions an agent can represent, is treated as a dynamical vari…

Read Paper →

AI & Data Science Preprint PDF DOI

ESGLens: An LLM-Based RAG Framework for Interactive ESG Report Analysis and Score Prediction

Tsung-Yu Yang, Meng-Chi Chen · 2026

Environmental, Social, and Governance (ESG) reports are central to investment decision-making, yet their length, heterogeneous content, and lack of standardized structure make manual analysis costly a…

Read Paper →

AI & Data Science Preprint PDF DOI

Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Guohuan Xie, Xin He, Dingying Fan, Le Zhang, Ming-Ming Cheng, Yun Liu · 2026

Generalized few-shot semantic segmentation (GFSS) is fundamentally limited by the coverage of novel-class appearances under scarce annotations. While diffusion models can synthesize novel-class images…

Read Paper →

Browse Research Papers

Learning to Reason: Targeted Knowledge Discovery and Fuzzy Logic Update for Robust Image Recognition

Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver

Self-Supervised Representation Learning via Hyperspherical Density Shaping

Hard to See, Hard to Label: Generative and Symbolic Acquisition for Subtle Visual Phenomena

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Wan-Image: Pushing the Boundaries of Generative Visual Intelligence

MARCO: Navigating the Unseen Space of Semantic Correspondence

Lorentz Framework for Semantic Segmentation

DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis

Parcae: Scaling Laws For Stable Looped Language Models

GTPBD-MM: A Global Terraced Parcel and Boundary Dataset with Multi-Modality

BareBones: Benchmarking Zero-Shot Geometric Comprehension in VLMs

Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration

LaScA: Language-Conditioned Scalable Modelling of Affective Dynamics

Variational Feature Compression for Model-Specific Representations

Few-Shot Semantic Segmentation Meets SAM3

MTLSI-Net: A Linear Semantic Interaction Network for Parameter-Efficient Multi-Task Dense Prediction

Kernel Dynamics under Path Entropy Maximization

ESGLens: An LLM-Based RAG Framework for Interactive ESG Report Analysis and Score Prediction

Make It Up: Fake Images, Real Gains in Generalized Few-shot Semantic Segmentation

Browse by Category

Research Type

Publish Your Research