Samira Shaikh — Research Repository

Engineering Preprint PDF DOI

OmniRobotHome: A Multi-Camera Platform for Real-Time Multiadic Human-Robot Interaction

Junyoung Lee, Sookwan Han, Jeonghwan Kim, Inhee Lee, Mingi Choi, Jisoo Kim, Wonjung Woo, Hanbyul Joo · 2026

Human-robot collaboration has been studied primarily in dyadic or sequential settings. However, real homes require multiadic collaboration, where multiple humans and robots share a workspace, acting c…

Read Paper →

AI & Data Science Preprint PDF DOI

Generalizable Sparse-View 3D Reconstruction from Unconstrained Images

Vinayak Gupta, Chih-Hao Lin, Shenlong Wang, Anand Bhattad, Jia-Bin Huang · 2026

Reconstructing 3D scenes from sparse, unposed images remains challenging under real-world conditions with varying illumination and transient occlusions. Existing methods rely on scene-specific optimiz…

Read Paper →

AI & Data Science Preprint PDF DOI

Beyond Gaussian Bottlenecks: Topologically Aligned Encoding of Vision-Transformer Feature Spaces

Andrew Bond, Ilkin Umut Melanlioglu, Erkut Erdem, Aykut Erdem · 2026

Modern visual world modeling systems increasingly rely on high-capacity architectures and large-scale data to produce plausible motion, yet they often fail to preserve underlying 3D geometry or physic…

Read Paper →

Engineering Preprint PDF DOI

FreeOcc: Training-Free Embodied Open-Vocabulary Occupancy Prediction

Zeyu Jiang, Changqing Zhou, Xingxing Zuo, Changhao Chen · 2026

Existing learning-based occupancy prediction methods rely on large-scale 3D annotations and generalize poorly across environments. We present FreeOcc, a training-free framework for open-vocabulary occ…

Read Paper →

Physics Preprint PDF DOI

Low-cost passive single-shot ultrafast imaging at 685 Gfps

Dilem Eslik, Bahad{i}r Utku Kesgin, Ugur Tegin · 2026

Capturing ultrafast transient phenomena conventionally requires streak cameras or computational imaging based on compressed sensing, which lead to complex and costly systems. In this Letter, we demons…

Read Paper →

AI & Data Science Preprint PDF DOI

The TEA Nets framework combines AI and cognitive network science to model targets, events and actors in text

Sebastiano Franchini, Alexis Carrillo, Edoardo Sebastiano De Duro, Riccardo Improta, Ali Aghazadeh Ardebili, Massimo Stella · 2026

We introduce Target-Event-Agent Networks (TEA Nets) as a computational framework to extract subjects (``Agents"), verbs (``Events"), and objects (``Targets") from texts. Grounded in cognitive network …

Read Paper →

Computer Science Preprint PDF DOI

Unified 5G-IoT Framework with CAMARA Gateways and SDN Federation

Zihan Jia, Ze Wang, Chen Chen, Ziren Xiao, Fung Po Tso · 2026

The convergence of 5G and IoT enables fully connected, intelligent environments, but it faces challenges from the fragmentation of public/private 5G networks and the heterogeneity of IoT networks. We …

Read Paper →

AI & Data Science Preprint PDF DOI

Learning from a single labeled face and a stream of unlabeled data

Branislav Kveton, Michal Valko · 2026

Face recognition from a single image per person is a challenging problem because the training sample is extremely small. We consider a variation of this problem. In our problem, we recognize only one …

Read Paper →

Physics Preprint PDF DOI

X-Ray Spectral Variability of the TeV HBL Blazar PG 1553+113 with XMM-Newton

P. U. Devanand, Alok C. Gupta, Paul J. Wiita, V. Jithesh, Archana Gupta · 2026

We present an extensive X-ray spectral variability study of the TeV photon-emitting high-energy-peaked BL Lacertae object PG 1553+113, using the data from EPIC-PN camera of XMM-Newton, which observed …

Read Paper →

AI & Data Science Preprint PDF DOI

LA-Pose: Latent Action Pretraining Meets Pose Estimation

Zhengqing Wang, Saurabh Nair, Prajwal Chidananda, Pujith Kachana, Samuel Li, Matthew Brown, Yasutaka Furukawa · 2026

This paper revisits camera pose estimation through the lens of self-supervised pretraining, focusing on inverse-dynamics pretraining as a scalable alternative to the current trend of fully supervised …

Read Paper →

Physics Preprint PDF DOI

Dual role of core electrons in electronic friction

Runfeng Zhou, Emilio Artacho · 2026

Non-equilibrium energy dissipation in multi-shell swift-ion/matter systems remains a fundamental yet incompletely understood problem, with electronic stopping power $\mathcal{S}_\text{e}$ as a relevan…

Read Paper →

Physics Preprint PDF DOI

Six New Variable Stars Discovered from Ground-Based Photometry and Characterized with TESS Data

Maksym Yu. Pyatnytskyy · 2026

We report the discovery of six new variable stars identified through an exploratory analysis of several sky fields observed by the author using a small telescope and a CMOS camera. The search employed…

Read Paper →

AI & Data Science Preprint PDF DOI

Automated Detection of Mutual Gaze and Joint Attention in Dual-Camera Settings via Dual-Stream Transformers

Jakub Kosmydel, Pawe{l} Gajewski, Arkadiusz Bia{l}ek · 2026

Analyzing mutual gaze (MG) and joint attention (JA) is critical in developmental psychology but traditionally relies on labor-intensive manual coding. Automating this process in multi-camera laborator…

Read Paper →

AI & Data Science Preprint PDF DOI

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

Wanyue Zhang, Wenxiang Wu, Wang Xu, Jiaxin Luo, Helu Zhi, Yibin Huang, Shuo Ren, Zitao Liu, Jiajun Zhang · 2026

Vision-language models (VLMs) have shown strong performance on static visual understanding, yet they still struggle with dynamic spatial reasoning that requires imagining how scenes evolve under egoce…

Read Paper →

AI & Data Science Preprint PDF DOI

Color-Encoded Illumination for High-Speed Volumetric Scene Reconstruction

David Novikov, Eilon Vaknin, Narek Tumanyan, Mark Sheinin · 2026

The task of capturing and rendering 3D dynamic scenes from 2D images has become increasingly popular in recent years. However, most conventional cameras are bandwidth-limited to 30-60 FPS, restricting…

Read Paper →

AI & Data Science Preprint PDF DOI

SEAL: Semantic-aware Single-image Sticker Personalization with a Large-scale Sticker-tag Dataset

Changhyun Roh, Yonghyun Jeong, Jonghyun Lee, Chanho Eom, Jihyong Oh · 2026

Synthesizing a target concept from a single reference image is challenging in diffusion-based personalized text-to-image generation, particularly for sticker personalization where prompts often requir…

Read Paper →

AI & Data Science Preprint PDF DOI

Bridge: Basis-Driven Causal Inference Marries VFMs for Domain Generalization

Mingbo Hong, Feng Liu, Caroline Gevaert, George Vosselman, Hao Cheng · 2026

Detectors often suffer from degraded performance, primarily due to the distributional gap between the source and target domains. This issue is especially evident in single-source domains with limited …

Read Paper →

Computer Science Preprint PDF DOI

Distributed Multi-View Vision-Only RSSI Estimation

Jung-Beom Kim, Woongsup Lee · 2026

Received Signal Strength Indicator (RSSI) estimation is essential for wireless link management, yet conventional feedback-based approaches incur uplink overhead, suffer from measurement instability, a…

Read Paper →

AI & Data Science Preprint PDF DOI

CO-EVO: Co-evolving Semantic Anchoring and Style Diversification for Federated DG-ReID

Fengchun Zhang, Qiang Ma, Liuyu Xiang, Jinshan Lai, Tingxuan Huang, Jianwei Hu · 2026

Federated domain generalization for person re-identification (FedDG-ReID) aims to collaboratively train a pedestrian retrieval model across multiple decentralized source domains such that it can gener…

Read Paper →

AI & Data Science Preprint PDF DOI

Multiple Consistent 2D-3D Mappings for Robust Zero-Shot 3D Visual Grounding

Yufei Yin, Jie Zheng, Qianke Meng, Zhou Yu, Minghao Chen, Jiajun Ding, Min Tan, Yuling Xi, Zhiwen Chen, Chengfei Lv · 2026

Zero-shot 3D Visual Grounding (3DVG) is a critical capability for open-world embodied AI. However, existing methods are fundamentally bottlenecked by the poor quality of open-vocabulary 3D proposals, …

Read Paper →

Browse Research Papers

OmniRobotHome: A Multi-Camera Platform for Real-Time Multiadic Human-Robot Interaction

Generalizable Sparse-View 3D Reconstruction from Unconstrained Images

Beyond Gaussian Bottlenecks: Topologically Aligned Encoding of Vision-Transformer Feature Spaces

FreeOcc: Training-Free Embodied Open-Vocabulary Occupancy Prediction

Low-cost passive single-shot ultrafast imaging at 685 Gfps

The TEA Nets framework combines AI and cognitive network science to model targets, events and actors in text

Unified 5G-IoT Framework with CAMARA Gateways and SDN Federation

Learning from a single labeled face and a stream of unlabeled data

X-Ray Spectral Variability of the TeV HBL Blazar PG 1553+113 with XMM-Newton

LA-Pose: Latent Action Pretraining Meets Pose Estimation

Dual role of core electrons in electronic friction

Six New Variable Stars Discovered from Ground-Based Photometry and Characterized with TESS Data

Automated Detection of Mutual Gaze and Joint Attention in Dual-Camera Settings via Dual-Stream Transformers

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

Color-Encoded Illumination for High-Speed Volumetric Scene Reconstruction

SEAL: Semantic-aware Single-image Sticker Personalization with a Large-scale Sticker-tag Dataset

Bridge: Basis-Driven Causal Inference Marries VFMs for Domain Generalization

Distributed Multi-View Vision-Only RSSI Estimation

CO-EVO: Co-evolving Semantic Anchoring and Style Diversification for Federated DG-ReID

Multiple Consistent 2D-3D Mappings for Robust Zero-Shot 3D Visual Grounding

Browse by Category

Research Type

Publish Your Research