Iddo Drori — Preprint — Research Repository

AI & Data Science Preprint PDF DOI

CA-IDD: Cross-Attention Guided Identity-Conditional Diffusion for Identity-Consistent Face Swapping

Md Shohel Rana, Tanoy Debnath · 2026

Face swapping aims to optimize realistic facial image generation by leveraging the identity of a source face onto a target face while preserving pose, expression, and context. However, existing method…

Read Paper →

AI & Data Science Preprint PDF DOI

Interpolating Discrete Diffusion Models with Controllable Resampling

Marcel Kollovieh, Sirine Ayadi, Stephan Gunnemann · 2026

Discrete diffusion models form a powerful class of generative models across diverse domains, including text and graphs. However, existing approaches face fundamental limitations. Masked diffusion mode…

Read Paper →

AI & Data Science Preprint PDF DOI

GeM-EA: A Generative and Meta-learning Enhanced Evolutionary Algorithm for Streaming Data-Driven Optimization

Yue Wu, Yuan-Ting Zhong, Ze-Yuan Ma, Yue-Jiao Gong · 2026

Streaming Data-Driven Optimization (SDDO) problems arise in many applications where data arrive continuously and the optimization environment evolves over time. Concept drift produces non-stationary l…

Read Paper →

AI & Data Science Preprint PDF DOI

Phonological distances for linguistic typology and the origin of Indo-European languages

Marius Mavridis, Juan De Gregorio, Raul Toral, David Sanchez · 2026

We show that short-range phoneme dependencies encode large-scale patterns of linguistic relatedness, with direct implications for quantitative typology and evolutionary linguistics. Specifically, usin…

Read Paper →

AI & Data Science Preprint PDF DOI

DDO-RM: Distribution-Level Policy Improvement after Reward Learning

Tiantian Zhang, Jierui Zuo, Michael Chen, Wenping Wang · 2026

Recent theory suggests that reward-model-first methods can be more sample-efficient than direct policy fitting when the reward function is statistically simpler than the induced policy. We propose DDO…

Read Paper →

AI & Data Science Preprint PDF DOI

IDDM: Identity-Decoupled Personalized Diffusion Models with a Tunable Privacy-Utility Trade-off

Linyan Dai, Xinwei Zhang, Haoyang Li, Qingqing Ye, Haibo Hu · 2026

Personalized text-to-image diffusion models (e.g., DreamBooth, LoRA) enable users to synthesize high-fidelity avatars from a few reference photos for social expression. However, once these generations…

Read Paper →

Computer Science Preprint PDF DOI

Scalable AI-assisted Workflow Management for Detector Design Optimization Using Distributed Computing

Derek Anderson, Amit Bashyal, Markus Diefenthaler, Cristiano Fanelli, Wen Guan, Tanja Horn, Alex Jentsch Meifeng Lin, Tadashi Maeno, Kei Nagai, Hemalata Nayak, Connor Pecar, Karthik Suresh, Fang-Ying Tsai, Anselm Vossen, Tianle Wang, Torre Wenaus · 2026

The Production and Distributed Analysis (PanDA) system, originally developed for the ATLAS experiment at the CERN Large Hadron Collider (LHC), has evolved into a robust platform for orchestrating larg…

Read Paper →

AI & Data Science Preprint PDF DOI

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

Giovana Kerche Bonas, Roseval Malaquias Junior, Marcos Piau, Thiago Laitz, Thales Sales Almeida, Hugo Abonizio, Celio Larcher, Ramon Pires, Rodrigo Nogueira · 2026

We introduce CAPITU, a benchmark for evaluating instruction-following capabilities of Large Language Models (LLMs) in Brazilian Portuguese. Unlike existing benchmarks that focus on English or use gene…

Read Paper →

AI & Data Science Preprint PDF DOI

Do Multilingual VLMs Reason Equally? A Cross-Lingual Visual Reasoning Audit for Indian Languages

Swastik R · 2026

Vision-language models score well on mathematical, scientific, and spatial reasoning benchmarks, yet these evaluations are overwhelmingly English. I present the first cross-lingual visual reasoning au…

Read Paper →

AI & Data Science Preprint PDF DOI

Evidence Packing for Cross-Domain Image Deepfake Detection with LVLMs

Yuxin Liu, Fei Wang, Kun Li, Yiqi Nie, Junjie Chen, Zhangling Duan, Zhaohong Jia · 2026

Image Deepfake Detection (IDD) separates manipulated images from authentic ones by spotting artifacts of synthesis or tampering. Although large vision-language models (LVLMs) offer strong image unders…

Read Paper →

AI & Data Science Preprint PDF DOI

TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation

Prajwal Panth, Agniva Maiti · 2026

The rapid proliferation of Large Language Models (LLMs) has created a profound digital divide, effectively excluding indigenous languages of the Global South from the AI revolution. The Tharu language…

Read Paper →

AI & Data Science Preprint PDF DOI

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Tianwei Xiong, Jun Hao Liew, Zilong Huang, Zhijie Lin, Jiashi Feng, Xihui Liu · 2026

Autoregressive (AR) video generative models rely on video tokenizers that compress pixels into discrete token sequences. The length of these token sequences is crucial for balancing reconstruction qua…

Read Paper →

AI & Data Science Preprint PDF DOI

Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary

Nazia Tasnim, Keanu Nichols, Yuting Yang, Nicholas Ikechukwu, Elva Zou, Deepti Ghadiyaram, Bryan A. Plummer · 2026

Humans learn object orientation progressively, from recognizing which way an object faces, to mentally rotating it, to reasoning about orientations between objects. Current vision-language benchmarks …

Read Paper →

Physics Preprint PDF DOI

Obscured Star Formation in the Dwarf Galaxy DDO 43? A Comparative UV-IR Analysis

Aron Juhasz, Eniko Pichler · 2026

We present a study of recent star formation in the dwarf irregular galaxy DDO 43 using GALEX FUV and WISE NIR imaging. We identify regions of elevated FUV flux, indicating unobscured star-forming acti…

Read Paper →

AI & Data Science Preprint PDF DOI

Oral to Web: Digitizing 'Zero Resource'Languages of Bangladesh

Mohammad Mamun Or Rashid · 2026

We present the Multilingual Cloud Corpus, the first national-scale, parallel, multimodal linguistic dataset of Bangladesh's ethnic and indigenous languages. Despite being home to approximately 40 mino…

Read Paper →

Physics Preprint PDF DOI

Quantum Simulations for Extreme Ultraviolet Photolithography

Tyler D. Kharazi, Stepan Fomichev, Shu Kanno, Takao Kobayashi, Juan Miguel Arrazola, Qi Gao, Torin F. Stetina · 2026

Extreme Ultraviolet (EUV) lithography is the state-of-the-art process in semiconductor fabrication, yet its spatial resolution is fundamentally limited by the ``blur'' originating from absorption of p…

Read Paper →

AI & Data Science Preprint PDF DOI

Multilingual Large Language Models do not comprehend all natural languages to equal degrees

Natalia Moskvina, Raquel Montero, Masaya Yoshida, Ferdy Hubers, Paolo Morosi, Walid Irhaymi, Jin Yan, Tamara Serrano, Elena Pagliarini, Fritz Gunther, Evelina Leivada · 2026

Large Language Models (LLMs) play a critical role in how humans access information. While their core use relies on comprehending written requests, our understanding of this ability is currently limite…

Read Paper →

AI & Data Science Preprint PDF DOI

T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization

Tunyu Zhang, Xinxi Zhang, Ligong Han, Haizhou Shi, Xiaoxiao He, Zhuowei Li, Hao Wang, Kai Xu, Akash Srivastava, Hao Wang, Vladimir Pavlovic, Dimitris N. Metaxas · 2026

Diffusion large language models (DLLMs) have the potential to enable fast text generation by decoding multiple tokens in parallel. However, in practice, their inference efficiency is constrained by th…

Read Paper →

AI & Data Science Preprint PDF DOI

LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts

Chen Zhao, Jiawei Chen, Hongyu Li, Zhuoliang Kang, Shilin Lu, Xiaoming Wei, Kai Zhang, Jian Yang, Ying Tai · 2026

Recent advances in video diffusion models have significantly improved visual quality, yet ultra-high-resolution (UHR) video generation remains a formidable challenge due to the compounded difficulties…

Read Paper →

AI & Data Science Preprint PDF DOI

MultiLexNorm++: A Unified Benchmark and a Generative Model for Lexical Normalization for Asian Languages

Weerayut Buaphet, Thanh-Nhi Nguyen, Risa Kondo, Tomoyuki Kajiwara, Yumin Kim, Jimin Lee, Hwanhee Lee, Holy Lovenia, Peerat Limkonchotiwat, Sarana Nutanong, Rob Van der Goot · 2026

Social media data has been of interest to Natural Language Processing (NLP) practitioners for over a decade, because of its richness in information, but also challenges for automatic processing. Since…

Read Paper →

Browse Research Papers

CA-IDD: Cross-Attention Guided Identity-Conditional Diffusion for Identity-Consistent Face Swapping

Interpolating Discrete Diffusion Models with Controllable Resampling

GeM-EA: A Generative and Meta-learning Enhanced Evolutionary Algorithm for Streaming Data-Driven Optimization

Phonological distances for linguistic typology and the origin of Indo-European languages

DDO-RM: Distribution-Level Policy Improvement after Reward Learning

IDDM: Identity-Decoupled Personalized Diffusion Models with a Tunable Privacy-Utility Trade-off

Scalable AI-assisted Workflow Management for Detector Design Optimization Using Distributed Computing

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

Do Multilingual VLMs Reason Equally? A Cross-Lingual Visual Reasoning Audit for Indian Languages

Evidence Packing for Cross-Domain Image Deepfake Detection with LVLMs

TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary

Obscured Star Formation in the Dwarf Galaxy DDO 43? A Comparative UV-IR Analysis

Oral to Web: Digitizing 'Zero Resource'Languages of Bangladesh

Quantum Simulations for Extreme Ultraviolet Photolithography

Multilingual Large Language Models do not comprehend all natural languages to equal degrees

T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization

LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts

MultiLexNorm++: A Unified Benchmark and a Generative Model for Lexical Normalization for Asian Languages

Browse by Category

Research Type

Publish Your Research