Memory in Engineering — Research Repository

Engineering Preprint PDF DOI

LatentAM: Real-Time, Large-Scale Latent Gaussian Attention Mapping via Online Dictionary Learning

Junwoon Lee, Yulun Tian · 2026

We present LatentAM, an online 3D Gaussian Splatting (3DGS) mapping framework that builds scalable latent feature maps from streaming RGB-D observations for open-vocabulary robotic perception. Instead…

Read Paper →

Engineering Preprint PDF DOI

3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting

Wancai Zheng, Hao Chen, Xianlong Lu, Linlin Ou, Xinyi Yu · 2026

Object navigation is a core capability of embodied intelligence, enabling an agent to locate target objects in unknown environments. Recent advances in vision-language models (VLMs) have facilitated z…

Read Paper →

Engineering Preprint PDF DOI

Multi Graph Search for High-Dimensional Robot Motion Planning

Itamar Mishani, Maxim Likhachev · 2026

Efficient motion planning for high-dimensional robotic systems, such as manipulators and mobile manipulators, is critical for real-time operation and reliable deployment. Although advances in planning…

Read Paper →

Engineering Preprint PDF DOI

LAMP: Implicit Language Map for Robot Navigation

Sibaek Lee, Hyeonwoo Yu, Giseop Kim, Sunwook Choi · 2026

Recent advances in vision-language models have made zero-shot navigation feasible, enabling robots to follow natural language instructions without requiring labeling. However, existing methods that ex…

Read Paper →

Engineering Preprint PDF DOI

ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation

Zedong Chu, Shichao Xie, Xiaolong Wu, Yanfen Shen, Minghua Luo, Zhengbo Wang, Fei Liu, Xiaoxu Leng, Junjun Hu, Mingyang Yin, Jia Lu, Yingnan Guo, Kai Yang, Jiawei Han, Xu Chen, Yanqing Zhu, Yuxiang Zhao, Xin Liu, Yirong Yang, Ye He, Jiahang Wang, Yang Cai, Tianlin Zhang, Li Gao, Liu Liu, Mingchao Sun, Fan Jiang, Chiyu Wang, Zhicheng Liu, Hongyu Pan, Honglin Han, Zhining Gu, Kuan Yang, Jianfang Zhang, Di Jing, Zihao Guan, Wei Guo, Guoqing Liu, Di Yang, Xiangpo Yang, Menglin Yang, Hongguang Xing, Weiguo Li, Mu Xu · 2026

Embodied navigation has long been fragmented by task-specific architectures. We introduce ABot-N0, a unified Vision-Language-Action (VLA) foundation model that achieves a ``Grand Unification'' across …

Read Paper →

Engineering Preprint PDF DOI

TC-BiMamba: Trans-Chunk bidirectionally within BiMamba for unified streaming and non-streaming ASR

Qingshun She, Jing Peng, Yangui Fang, Yu Xi, Kai Yu · 2026

This work investigates bidirectional Mamba (BiMamba) for unified streaming and non-streaming automatic speech recognition (ASR). Dynamic chunk size training enables a single model for offline decoding…

Read Paper →

Engineering Preprint PDF DOI

SceneVGGT: VGGT-based online 3D semantic SLAM for indoor scene understanding and navigation

Anna Gelencser-Horvath, Gergely Dinya, Dorka Boglarka Eros, Peter Halasz, Islam Muhammad Muqsit, Kristof Karacs · 2026

We present SceneVGGT, a spatio-temporal 3D scene understanding framework that combines SLAM with semantic mapping for autonomous and assistive navigation. Built on VGGT, our method scales to long vide…

Read Paper →

Engineering Preprint PDF DOI

Developing Neural Network-Based Gaze Control Systems for Social Robots

Ramtin Tabatabaei, Alireza Taheri · 2026

During multi-party interactions, gaze direction is a key indicator of interest and intent, making it essential for social robots to direct their attention appropriately. Understanding the social conte…

Read Paper →

Engineering Preprint PDF DOI

Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning

Zixuan Wang, Huang Fang, Shaoan Wang, Yuanfei Luo, Heng Dong, Wei Li, Yiming Gan · 2026

While large vision-language models (VLMs) show promise for object goal navigation, current methods still struggle with low success rates and inefficient localization of unseen objects--failures primar…

Read Paper →

Engineering Preprint PDF DOI

TVTSyn: Content-Synchronous Time-Varying Timbre for Streaming Voice Conversion and Anonymization

Waris Quamer, Mu-Ruei Tseng, Ghady Nasrallah, Ricardo Gutierrez-Osuna · 2026

Real-time voice conversion and speaker anonymization require causal, low-latency synthesis without sacrificing intelligibility or naturalness. Current systems have a core representational mismatch: co…

Read Paper →

Engineering Preprint PDF DOI

STaR: Scalable Task-Conditioned Retrieval for Long-Horizon Multimodal Robot Memory

Mingfeng Yuan, Hao Zhang, Mahan Mohammadi, Runhao Li, Jinjun Shan, Steven L. Waslander · 2026

Mobile robots are often deployed over long durations in diverse open, dynamic scenes, including indoor setting such as warehouses and manufacturing facilities, and outdoor settings such as agricultura…

Read Paper →

Engineering Preprint PDF DOI

Improving Reliability of Hybrid Bit-Semantic Communications for Cellular Networks

Nikos G. Evgenidis, Sotiris A. Tegos, Panagiotis D. Diamantoulakis, Ioannis Krikidis, George K. Karagiannidis · 2026

Semantic communications (SemComs) have been considered as a promising solution to reduce the amount of transmitted information, thus paving the way for more energy-and spectrum-efficient wireless netw…

Read Paper →

Engineering Preprint PDF DOI

Trajectory Stitching for Solving Inverse Problems with Flow-Based Models

Alexander Denker, Moshe Eliasof, Zeljko Kereta, Carola-Bibiane Schonlieb · 2026

Flow-based generative models have emerged as powerful priors for solving inverse problems. One option is to directly optimize the initial latent code (noise), such that the flow output solves the inve…

Read Paper →

Engineering Preprint PDF DOI

Reconfigurable Low-Complexity Architecture for High Resolution Doppler Velocity Estimation in Integrated Sensing and Communication System

Aakanksha Tewari, Samarth Sharma Bhardwaj, Sumit J Darak, Shobha Sundar Ram · 2026

In millimeter wave integrated sensing and communication (ISAC) systems for intelligent transportation, radar and communication share spectrum and hardware in a time division manner. Radar rapidly dete…

Read Paper →

Engineering Preprint PDF DOI

Vec-QMDP: Vectorized POMDP Planning on CPUs for Real-Time Autonomous Driving

Xuanjin Jin, Yanxin Dong, Bin Sun, Huan Xu, Zhihui Hao, XianPeng Lang, Panpan Cai · 2026

Planning under uncertainty for real-world robotics tasks, such as autonomous driving, requires reasoning in enormous high-dimensional belief spaces, rendering the problem computationally intensive. Wh…

Read Paper →

Engineering Preprint PDF DOI

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Yalcin Tur, Jalal Naghiyev, Haoquan Fang, Wei-Chuan Tsai, Jiafei Duan, Dieter Fox, Ranjay Krishna · 2026

Current Vision-Language-Action (VLA) models rely on fixed computational depth, expending the same amount of compute on simple adjustments and complex multi-step manipulation. While Chain-of-Thought (C…

Read Paper →

Engineering Preprint PDF DOI

Deep Neural Network Architectures for Electrocardiogram Classification: A Comprehensive Evaluation

Yun Song, Wenjia Zheng, Tiedan Chen, Ziyu Wang, Jiazhao Shi, Yisong Chen · 2026

With the rising prevalence of cardiovascular diseases, electrocardiograms (ECG) remain essential for the non-invasive detection of cardiac abnormalities. This study presents a comprehensive evaluation…

Read Paper →

Engineering Preprint PDF DOI

Wavelet-Domain Masked Image Modeling for Color-Consistent HDR Video Reconstruction

Yang Zhang, Zhangkai Ni, Wenhan Yang, Hanli Wang · 2026

High Dynamic Range (HDR) video reconstruction aims to recover fine brightness, color, and details from Low Dynamic Range (LDR) videos. However, existing methods often suffer from color inaccuracies an…

Read Paper →

Engineering Preprint PDF DOI

Wildfire Simulation with Differentiable Randers-Finsler Eikonal Solvers

Barak Gahtan, Jacob Shpund, Alex M. Bronstein · 2026

Fast and differentiable solvers for anisotropic and asymmetric distance fields are a key primitive in geometry processing, enabling gradient-based optimization over metrics, drift fields, and downstre…

Read Paper →

Engineering Preprint PDF DOI

Wireless Context Engineering for Efficient Mobile Agentic AI and Edge General Intelligence

Changyuan Zhao, Jiacheng Wang, Yunting Xu, Geng Sun, Dusit Niyato, Zan Li, Abbas Jamalipour, Dong In Kim · 2026

Future wireless networks demand increasingly powerful intelligence to support sensing, communication, and autonomous decision-making. While scaling laws suggest improving performance by enlarging mode…

Read Paper →

Browse Research Papers

LatentAM: Real-Time, Large-Scale Latent Gaussian Attention Mapping via Online Dictionary Learning

3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting

Multi Graph Search for High-Dimensional Robot Motion Planning

LAMP: Implicit Language Map for Robot Navigation

ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation

TC-BiMamba: Trans-Chunk bidirectionally within BiMamba for unified streaming and non-streaming ASR

SceneVGGT: VGGT-based online 3D semantic SLAM for indoor scene understanding and navigation

Developing Neural Network-Based Gaze Control Systems for Social Robots

Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning

TVTSyn: Content-Synchronous Time-Varying Timbre for Streaming Voice Conversion and Anonymization

STaR: Scalable Task-Conditioned Retrieval for Long-Horizon Multimodal Robot Memory

Improving Reliability of Hybrid Bit-Semantic Communications for Cellular Networks

Trajectory Stitching for Solving Inverse Problems with Flow-Based Models

Reconfigurable Low-Complexity Architecture for High Resolution Doppler Velocity Estimation in Integrated Sensing and Communication System

Vec-QMDP: Vectorized POMDP Planning on CPUs for Real-Time Autonomous Driving

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Deep Neural Network Architectures for Electrocardiogram Classification: A Comprehensive Evaluation

Wavelet-Domain Masked Image Modeling for Color-Consistent HDR Video Reconstruction

Wildfire Simulation with Differentiable Randers-Finsler Eikonal Solvers

Wireless Context Engineering for Efficient Mobile Agentic AI and Edge General Intelligence

Browse by Category

Research Type

Publish Your Research