Ashish Tiwari — Research Repository

AI & Data Science Preprint PDF DOI

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation

Xin Zhou, Dingkang Liang, Xiwu Chen, Feiyang Tan, Dingyuan Zhang, Hengshuang Zhao, Xiang Bai · 2026

Driving world models serve as a pivotal technology for autonomous driving by simulating environmental dynamics. However, existing approaches predominantly focus on future scene generation, often overl…

Read Paper →

Physics Preprint PDF DOI

Towards Systematics of Calabi-Yau Landscape for String Cosmology

George K. Leontaris, Pramod Shukla · 2026

In this review, we discuss the relevance and impact of studying Calabi-Yau threefolds in the context of global model building in string phenomenology. First, taking a phenomenologist-friendly approach…

Read Paper →

AI & Data Science Preprint PDF DOI

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Keming Wu, Zuhao Yang, Kaichen Zhang, Shizun Wang, Haowei Zhu, Sicong Leng, Zhongyu Yang, Qijie Wang, Sudong Wang, Ziting Wang, Zili Wang, Hui Zhang, Haonan Wang, Hang Zhou, Yifan Pu, Xingxuan Li, Fangneng Zhan, Bo Li, Lidong Bing, Yuxin Song, Ziwei Liu, Wenhu Chen, Jingdong Wang, Xinchao Wang, Xiaojuan Qi, Shijian Lu, Bin Wang · 2026

Recent visual generation models have made major progress in photorealism, typography, instruction following, and interactive editing, yet they still struggle with spatial reasoning, persistent state, …

Read Paper →

Physics Preprint PDF DOI

Uniaxial strain-driven ferroelastic domain control in LaAlO3

Matthias Roeper, Robin Buschbeck, Jakob Wetzel, Tobias Ritschel, Anna-Lena Hofmann, Vladyslav Kovtunovych, Mike N. Pionteck, Javier Taboada-Gutierrez, Alexey B. Kuzmenko, Martina Basini, Vivek Unikandanunni, Iuliia Kiseleva, Jochen Geck, Susanne C. Kehr, Maximilian Lederer, Simone Sanna, Lukas M. Eng, Samuel D. Seddon · 2026

Multiferroic domain walls in functional oxides exhibit properties distinct from the bulk and are increasingly exploited as active elements in nanoelectronic and photonic devices. Deterministic control…

Read Paper →

AI & Data Science Preprint PDF DOI

PhyCo: Learning Controllable Physical Priors for Generative Motion

Sriram Narayanan, Ziyu Jiang, Srinivasa Narasimhan, Manmohan Chandraker · 2026

Modern video diffusion models excel at appearance synthesis but still struggle with physical consistency: objects drift, collisions lack realistic rebound, and material responses seldom match their un…

Read Paper →

AI & Data Science Preprint PDF DOI

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Sudong Wang, Weiquan Huang, Xiaomin Yu, Zuhao Yang, Hehai Lin, Keming Wu, Chaojun Xiao, Chen Chen, Wenxuan Wang, Beier Zhu, Yunjian Zhang, Chengwei Qin · 2026

The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (RLVR). H…

Read Paper →

AI & Data Science Preprint PDF DOI

Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression

Junqi Gao, Dazhi Zhang, Zhichang Guo, Biqing Qi, Yi Ran, Wangmeng Zuo · 2026

Model merging has attracted attention as an effective path toward multi-task adaptation by integrating knowledge from multiple task-specific models. Among existing approaches, dynamic merging mitigate…

Read Paper →

AI & Data Science Preprint PDF DOI

3D Reconstruction Techniques in the Manufacturing Domain: Applications, Research Opportunities and Use Cases

Chialoon Cheng, Kaijun liu, Zhiyang Liu, Marcelo H Ang Jr · 2026

This comprehensive review examines the evolution and the current state of the art in three-dimensional (3D) reconstruction techniques in manufacturing applications. The analysis covers both traditiona…

Read Paper →

AI & Data Science Preprint PDF DOI

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

Jialu Shen, Han Lyu, Suyang Zhong, Hanzheng Li, Haoyi Tao, Nan Wang, Changhong Chen, Xi Fang · 2026

Spectra are a prevalent yet highly information-dense form of scientific imagery, presenting substantial challenges to multimodal large language models (MLLMs) due to their unstructured and domain-spec…

Read Paper →

AI & Data Science Preprint PDF DOI

Echo-{\alpha}: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation

Jing Zhang, Wentao Jiang, Tao Huang, Zhiwei Wang, Jianxin Liu, Jian Chen, Ping Ye, Gang Wang, Zengmao Wang, Bo Du, Dacheng Tao · 2026

Ultrasound interpretation requires both precise lesion localization and holistic clinical reasoning, yet existing methods typically excel at only one of these capabilities: specialized detectors offer…

Read Paper →

Computer Science Preprint PDF DOI

When and How AI Should Assist Brainstorming for AI Impact Assessment

Jarod Govers, Sanja Scepanovic, Daniele Quercia · 2026

A key task in AI practice is to assess potential impacts to prevent harm. Current AI tools assisting AI impact assessment have not been designed or evaluated for collaborative team brainstorming, and …

Read Paper →

Physics Preprint PDF DOI

International Optical Clock Comparison Using the European Optical Fiber Network

Marco Pizzocaro, Clara Zyskind, Anne Amy-Klein, Erik Benkler, Sebastien Bize, Davide Calonico, Etienne Cantin, Christian Chardonnet, Cecilia Clivati, Stefano Condio, E. Anne Curtis, Simone Donadello, Soren Dorscher, Chen-Hao Feng, Melina Filzinger, Jacques-Olivier Gaudron, Rachel M. Godun, Irene Goti, Ian R. Hill, Wei Huang, Nils Huntemann, Matthew Johnson, Joshua Klose, Jochen Kronjager, Alexander Kuhl, Rodolphe Le Targat, Filippo Levi, Burghard Lipphardt, Christian Lisdat, Jerome Lodewyck, Olivier Lopez, Helen S. Margolis, Maxime Mazouth-Laurol, Alberto Mura, Benjamin Pointard, Paul-Eric Pottie, Matias Risaro, Billy I. Robertson, Marco Schioppo, Kilian Stahl, Martin Steinel, Alexandra Tofful, Mads T{o}nnes, Jacob Tunes · 2026

Optical clocks have achieved remarkable estimated fractional frequency uncertainties reaching the $10^{-18}$ level and below, enabling applications in fundamental physics, general relativity, and geod…

Read Paper →

AI & Data Science Preprint PDF DOI

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

Junan Hu, Jian Liu, Jingxiang Lai, Jiarui Hu, Yiwei Sheng, Shuang Chen, Jian Li, Dazhao Du, Song Guo · 2026

Graphical User Interface (GUI) agents have emerged as a promising paradigm for intelligent systems that perceive and interact with graphical interfaces visually. Yet supervised fine-tuning alone canno…

Read Paper →

AI & Data Science Preprint PDF DOI

Training-Free Tunnel Defect Inspection and Engineering Interpretation via Visual Recalibration and Entity Reconstruction

Shipeng Liu, Liang Zhao, Dengfeng Chen, Zhanping Song · 2026

Tunnel inspection requires outputs that can support defect localization, measurement, severity grading, and engineering documentation. Existing training-free foundation-model pipelines usually stop at…

Read Paper →

AI & Data Science Preprint PDF DOI

Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future

Sihong Wu, Owen Jiang, Yilun Zhao, Tiansheng Hu, Yiling Ma, Kaiyan Zhang, Manasi Patwardhan, Arman Cohan · 2026

Peer review is a multi-stage process involving reviews, rebuttals, meta-reviews, final decisions, and subsequent manuscript revisions. Recent advances in large language models (LLMs) have motivated me…

Read Paper →

Computer Science Preprint PDF DOI

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

Jin Xin Ng, Ori Livneh, Richard O'Grady, Josh Don, Peng Ding, Samuel Grossman, Luis Otero, Chris Kennelly, David Lo, Carlos Villavieja · 2026

Modern large multicore systems often run multiple workloads that share CPUs under schedulers such as Linux CFS. To keep CPUs busy, these schedulers load-balance runnable work, causing each workload to…

Read Paper →

Physics Preprint PDF DOI

The Complex Structure of the Abell 548 - Abell 3367 Region

Mark J. Henriksen, Layla Ahmed · 2026

Archival XMM and ROSAT X-ray data are used to investigate the structure of the Abell 548 - Abell 3367 region. Based on previous optical studies, this is a region likely to be rich in structure though …

Read Paper →

Physics Preprint PDF DOI

Acoustic modulation of shear thickening transition in dense adhesive suspensions

Aoxuan Wang, Fabrice Toussaint, Thomas Gibaud · 2026

Discontinuous shear thickening (DST) in dense suspensions leads to flow instabilities that limit processing in many systems. While high-power ultrasound has been reported to reduce the apparent viscos…

Read Paper →

AI & Data Science Preprint PDF DOI

Frequency-Aware Semantic Fusion with Gated Injection for AI-generated Image Detection

Shuchang Zhou, Shangkun Wu, Jiwei Wei, Ke Liu, Ran Ran, Caiyan Qin, Yang Yang · 2026

AI-generated images are becoming increasingly realistic and diverse, posing significant challenges for generalizable detection. While Vision Foundation Models (VFMs) provide rich semantic representati…

Read Paper →

AI & Data Science Preprint PDF DOI

Reasoning over Object Descriptions Improves Coreference Resolution in Task-Based Dialogue Systems

Oier Ijurco, Oier Lopez de Lacalle · 2026

Task-based dialogue systems assist users in achieving specific goals, such as executing actions or retrieving information, through natural language interactions. Accurate coreference resolution is ess…

Read Paper →

Browse Research Papers

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation

Towards Systematics of Calabi-Yau Landscape for String Cosmology

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Uniaxial strain-driven ferroelastic domain control in LaAlO3

PhyCo: Learning Controllable Physical Priors for Generative Motion

PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning

Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression

3D Reconstruction Techniques in the Manufacturing Domain: Applications, Research Opportunities and Use Cases

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

Echo-{\alpha}: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation

When and How AI Should Assist Brainstorming for AI Impact Assessment

International Optical Clock Comparison Using the European Optical Fiber Network

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

Training-Free Tunnel Defect Inspection and Engineering Interpretation via Visual Recalibration and Entity Reconstruction

Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future

Affinity Tailor: Dynamic Locality-Aware Scheduling at Scale

The Complex Structure of the Abell 548 - Abell 3367 Region

Acoustic modulation of shear thickening transition in dense adhesive suspensions

Frequency-Aware Semantic Fusion with Gated Injection for AI-generated Image Detection

Reasoning over Object Descriptions Improves Coreference Resolution in Task-Based Dialogue Systems

Browse by Category

Research Type

Publish Your Research