Galip Aydin — Research Repository

Physics Preprint PDF DOI

Permutation-invariant codes: a numerical study and qudit constructions

Liam J. Bond, Jiri Minar, Maris Ozols, Arghavan Safavi-Naini, Vladyslav Visnevskyi · 2026

We investigate Permutation-Invariant (PI) quantum error-correcting codes encoding a logical qudit of dimension $\mathrm{d}_\mathrm{L}$ in PI states using physical qudits of dimension $\mathrm{d}_\math…

Read Paper →

AI & Data Science Preprint PDF DOI

LAB-Det: Language as a Domain-Invariant Bridge for Training-Free One-Shot Domain Generalization in Object Detection

Xu Zhang, Zhe Chen, Jing Zhang, Dacheng Tao · 2026

Foundation object detectors such as GLIP and Grounding DINO excel on general-domain data but often degrade in specialized and data-scarce settings like underwater imagery or industrial defects. Typica…

Read Paper →

AI & Data Science Preprint PDF DOI

One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Shuo Lu, Haohan Wang, Wei Feng, Weizhen Wang, Shen Zhang, Yaoyu Li, Ao Ma, Zheng Zhang, Jingjing Lv, Junjie Shen, Ching Law, Bing Zhan, Yuan Xu, Huizai Yao, Yongcan Yu, Chenyang Si, Jian Liang · 2026

Advertising image generation has increasingly focused on online metrics like Click-Through Rate (CTR), yet existing approaches adopt a ``one-size-fits-all" strategy that optimizes for overall CTR whil…

Read Paper →

AI & Data Science Preprint PDF DOI

Exact Graph Learning via Integer Programming

Lucas Kook, S{o}ren Wengel Mogensen · 2026

Learning the dependence structure among variables in complex systems is a central problem across medical, natural, and social sciences. These structures can be naturally represented by graphs, and the…

Read Paper →

Mathematics Preprint PDF DOI

The divisor function along sums of two biquadrates

Wing Hong Leung, Mayank Pandey · 2026

We establish power saving asymptotics for the sum of the divisor function along a binary quartic form, improving on work of Daniel. The proof involves an application of a recent two dimensional delta …

Read Paper →

AI & Data Science Preprint PDF DOI

Language-Grounded Multi-Domain Image Translation via Semantic Difference Guidance

Jongwon Ryu, Joonhyung Park, Jaeho Han, Yeong-Seok Kim, Hye-rin Kim, Sunjae Yoon, Junyeong Kim · 2026

Multi-domain image-to-image translation re quires grounding semantic differences ex pressed in natural language prompts into corresponding visual transformations, while preserving unrelated structural…

Read Paper →

Mathematics Preprint PDF DOI

Signed Mahonian Polynomials on Colored Derangements

Hasan Arslan, Moussa Ahmia, Nazmiye Alemdar · 2025

The polynomial $\sum_{\pi \in W}q^{maj(\pi)}$ of major index over a classical Weyl group $W$ with a generating set $S$ is called the Mahonian polynomial over $W$, and also the polynomial $\sum_{\pi \i…

Read Paper →

AI & Data Science Preprint PDF DOI

Semantic-Guided Natural Language and Visual Fusion for Cross-Modal Interaction Based on Tiny Object Detection

Xian-Hong Huang, Hui-Kai Su, Chi-Chia Sun, Jun-Wei Hsieh · 2025

This paper introduces a cutting-edge approach to cross-modal interaction for tiny object detection by combining semantic-guided natural language processing with advanced visual recognition backbones. …

Read Paper →

Physics Preprint PDF DOI

Integrated Luminosity with 100 ppm Precision, Methods for $\sqrt{s}$ Precision of 1 ppm, and Beyond Standard Model Sensitivity using Photonic Events, at $\mathrm{e^{+}e^{-}}$ Higgs Factories

Brendon Madison · 2025

Future electron-positron ($\ee$) colliders, operating as Higgs factories or Z factories, promise unprecedented precision electroweak measurements that are vital to testing the Standard Model (SM) and …

Read Paper →

Mathematics Preprint PDF DOI

Circular sorting, strong complete mappings and wreath product constructions

Paul Bastide, Anurag Bishnoi, Carla Groenland, Dion Gijswijt, Rohinee Joshi · 2025

We continue the study of Adin, Alon and Roichman [arXiv:2502.14398, 2025] on the number of steps required to sort $n$ labelled points on a circle by transpositions. Imagine that the vertices of a cycl…

Read Paper →

AI & Data Science Preprint PDF DOI

GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition

Tianyue Wang, Shuang Yang, Shiguang Shan, Xilin Chen · 2025

Visual speech recognition (VSR), also known as lip reading, is the task of recognizing speech from silent video. Despite significant advancements in VSR over recent decades, most existing methods pay …

Read Paper →

Engineering Preprint PDF DOI

Hierarchical Reduced-Order Model Predictive Control for Robust Locomotion on Humanoid Robots

Adrian B. Ghansah, Sergio A. Esteban, Aaron D. Ames · 2025

As humanoid robots enter real-world environments, ensuring robust locomotion across diverse environments is crucial. This paper presents a computationally efficient hierarchical control framework for …

Read Paper →

AI & Data Science Preprint PDF DOI

Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset

Ziye Deng, Ruihan He, Jiaxiang Liu, Yuan Wang, Zijie Meng, Songtao Jiang, Yong Xie, Zuozhu Liu · 2025

Medical image grounding aims to align natural language phrases with specific regions in medical images, serving as a foundational task for intelligent diagnosis, visual question answering (VQA), and a…

Read Paper →

Computer Science Preprint PDF DOI

A Compact Post-quantum Strong Designated Verifier Signature Scheme from Isogenies

Farzin Renan · 2025

Digital signatures are fundamental cryptographic tools that provide authentication and integrity in digital communications. However, privacy-sensitive applications, such as e-voting and digital cash, …

Read Paper →

AI & Data Science Preprint PDF DOI

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Hayeon Kim, Ji Ha Jang, Se Young Chun · 2025

Recent advances in 3D neural representations and instance-level editing models have enabled the efficient creation of high-quality 3D content. However, achieving precise local 3D edits remains challen…

Read Paper →

Physics Preprint PDF DOI

Chaoticus: a parallel approach to the computation of chaos indicators

Javier Jimenez-Lopez, Jose Saez-Landete, Victor J. Garcia-Garrido · 2025

In this paper we present Chaoticus, a Python-based package for the GPU-accelerated integration of ODE systems and the computation of chaos indicators, including SALI, GALI, Lagrangian Descriptors base…

Read Paper →

Computer Science Preprint PDF DOI

Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models

Andela Ilic, Jiaxi Jiang, Paul Streli, Xintong Liu, Christian Holz · 2025

Motion capture using sparse inertial sensors has shown great promise due to its portability and lack of occlusion issues compared to camera-based tracking. Existing approaches typically assume that IM…

Read Paper →

AI & Data Science Preprint PDF DOI

Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations

Yibo Cui, Liang Xie, Yu Zhao, Jiawei Sun, Erwei Yin · 2025

Vision-Language Navigation (VLN) enables intelligent agents to navigate environments by integrating visual perception and natural language instructions, yet faces significant challenges due to the sca…

Read Paper →

AI & Data Science Preprint PDF DOI

Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures

Shun Inadumi, Nobuhiro Ueda, Koichiro Yoshino · 2025

Multimodal reference resolution, including phrase grounding, aims to understand the semantic relations between mentions and real-world objects. Phrase grounding between images and their captions is a …

Read Paper →

AI & Data Science Preprint PDF DOI

GLIP-OOD: Zero-Shot Graph OOD Detection with Graph Foundation Model

Haoyan Xu, Zhengtao Yao, Xuzhi Zhang, Ziyi Wang, Langzhou He, Yushun Dong, Philip S. Yu, Mengyuan Li, Yue Zhao · 2025

Out-of-distribution (OOD) detection is critical for ensuring the safety and reliability of machine learning systems, particularly in dynamic and open-world environments. In the vision and text domains…

Read Paper →

Browse Research Papers

Permutation-invariant codes: a numerical study and qudit constructions

LAB-Det: Language as a Domain-Invariant Bridge for Training-Free One-Shot Domain Generalization in Object Detection

One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Exact Graph Learning via Integer Programming

The divisor function along sums of two biquadrates

Language-Grounded Multi-Domain Image Translation via Semantic Difference Guidance

Signed Mahonian Polynomials on Colored Derangements

Semantic-Guided Natural Language and Visual Fusion for Cross-Modal Interaction Based on Tiny Object Detection

Integrated Luminosity with 100 ppm Precision, Methods for $\sqrt{s}$ Precision of 1 ppm, and Beyond Standard Model Sensitivity using Photonic Events, at $\mathrm{e^{+}e^{-}}$ Higgs Factories

Circular sorting, strong complete mappings and wreath product constructions

GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition

Hierarchical Reduced-Order Model Predictive Control for Robust Locomotion on Humanoid Robots

Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset

A Compact Post-quantum Strong Designated Verifier Signature Scheme from Isogenies

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Chaoticus: a parallel approach to the computation of chaos indicators

Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models

Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations

Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures

GLIP-OOD: Zero-Shot Graph OOD Detection with Graph Foundation Model

Browse by Category

Research Type

Publish Your Research