Memory in Engineering — Research Repository

Engineering Preprint PDF DOI

Two-Phase Cell Switching in 6G vHetNets: Sleeping-Cell Load Estimation and Renewable-Aware Switching Toward NES

Maryam Salamatmoghadasi, Metin Ozturk, Halim Yanikomeroglu · 2026

This paper proposes a two phase framework to improve the sustainability in vertical heterogeneous networks that integrate various types of base stations~(BSs), including terrestrial macro BSs~(MBSs), …

Read Paper →

Engineering Preprint PDF DOI

Topological Motion Planning Diffusion: Generative Tangle-Free Path Planning for Tethered Robots in Obstacle-Rich Environments

Yifu Tian, Xinhang Xu, Thien-Minh Nguyen, Muqing Cao · 2026

In extreme environments such as underwater exploration and post-disaster rescue, tethered robots require continuous navigation while avoiding cable entanglement. Traditional planners struggle in these…

Read Paper →

Engineering Preprint PDF DOI

How Attention Shapes Emotion: A Comparative Study of Attention Mechanisms for Speech Emotion Recognition

Marc Casals-Salvador, Federico Costa, Rodolfo Zevallos, Javier Hernando · 2026

Speech Emotion Recognition (SER) plays a key role in advancing human-computer interaction. Attention mechanisms have become the dominant approach for modeling emotional speech due to their ability to …

Read Paper →

Engineering Preprint PDF DOI

Dual-Domain Sparse Adaptive Filtering: Exploiting Error Memory for Improved Performance

Mohammad Salman, Hadi Zayyani, Felipe A. P. de Figueiredo, Hasan Abu Hilal, Mostafa Rashdan · 2026

Many signal processing applications such as acoustic echo cancellation and wireless channel estimation require identifying systems where only a small fraction of coefficients are actually active, i.e.…

Read Paper →

Engineering Preprint PDF DOI

On the Degrees of Freedom of Gridded Control Points in Learning-Based Medical Image Registration

Wen Yan, Qianye Yang, Yipei Wang, Shonit Punwani, Mark Emberton, Vasilis Stavrinides, Yipeng Hu, Dean Barratt · 2026

Many registration problems are ill-posed in homogeneous or noisy regions, and dense voxel-wise decoders can be unnecessarily high-dimensional. A sparse control-point parameterisation provides a compac…

Read Paper →

Engineering Preprint PDF DOI

OxyGen: Unified KV Cache Management for Vision-Language-Action Models under Multi-Task Parallelism

Xiangyu Li, Huaizhi Tang, Xin Ding, Weijun Wang, Ting Cao, Yunxin Liu · 2026

Embodied AI agents increasingly require parallel execution of multiple tasks, such as manipulation, conversation, and memory construction, from shared observations under distinct time constraints. Rec…

Read Paper →

Engineering Preprint PDF DOI

DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression

Junqi Shi, Ming Lu, Xingchen Li, Anle Ke, Ruiqi Zhang, Zhan Ma · 2026

Diffusion-based image compression has recently shown outstanding perceptual fidelity, yet its practicality is hindered by prohibitive sampling overhead and high memory usage. Most existing diffusion c…

Read Paper →

Engineering Preprint PDF DOI

DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation

Zihao Xin, Wentong Li, Yixuan Jiang, Bin Wang, Runmin Cong, Jie Qin, Shengjun Huang · 2026

Vision-and-Language Navigation (VLN) requires agents to follow long-horizon instructions and navigate complex 3D environments. However, existing approaches face two major challenges: constructing an e…

Read Paper →

Engineering Preprint PDF DOI

ReMem-VLA: Empowering Vision-Language-Action Model with Memory via Dual-Level Recurrent Queries

Hang Li, Fengyi Shen, Dong Chen, Liudi Yang, Xudong Wang, Jinkui Shi, Zhenshan Bing, Ziyuan Liu, Alois Knoll · 2026

Vision-language-action (VLA) models for closed-loop robot control are typically cast under the Markov assumption, making them prone to errors on tasks requiring historical context. To incorporate memo…

Read Paper →

Engineering Preprint PDF DOI

RoboStream: Weaving Spatio-Temporal Reasoning with Memory in Vision-Language Models for Robotics

Yuzhi Huang, Jie Wu, Weijue Bu, Ziyi Xiong, Gaoyang Jiang, Ye Li, Kangye Ji, Shuzhao Xie, Yue Huang, Chenglei Wu, Jingyan Jiang, Zhi Wang · 2026

Enabling reliable long-horizon robotic manipulation is a crucial step toward open-world embodied intelligence. However, VLM-based planners treat each step as an isolated observation-to-action mapping,…

Read Paper →

Engineering Preprint PDF DOI

Consistent and Efficient MSCKF-based LiDAR-Inertial Odometry with Inferred Cluster-to-Plane Constraints for UAVs

Jinwen Zhu, Xudong Zhao, Fangcheng Zhu, Jun Hu, Shi Jin, Yinian Mao, Guoquan Huang · 2026

Robust and accurate navigation is critical for Unmanned Aerial Vehicles (UAVs) especially for those with stringent Size, Weight, and Power (SWaP) constraints. However, most state-of-the-art (SOTA) LiD…

Read Paper →

Engineering Preprint PDF DOI

AnchorVLA4D: an Anchor-Based Spatial-Temporal Vision-Language-Action Model for Robotic Manipulation

Juan Zhu, Zhanying Shao, Xiaoqi Li, Ethan Morgan, Jiadong Xu, Hongwei Fan, Hao Dong · 2026

Since current Vision-Language-Action (VLA) systems suffer from limited spatial perception and the absence of memory throughout manipulation, we investigate visual anchors as a means to enhance spatial…

Read Paper →

Engineering Preprint PDF DOI

BrainMem: Brain-Inspired Evolving Memory for Embodied Agent Task Planning

Xiaoyu Ma, Lianyu Hu, Wenbing Tang, Zixuan Hu, Zeqin Liao, Zhizhen Wu, Yang Liu · 2026

Embodied task planning requires agents to execute long-horizon, goal-directed actions in complex 3D environments, where success depends on both immediate perception and accumulated experience across t…

Read Paper →

Engineering Preprint PDF DOI

Online Learning of Strategic Defense against Ecological Adversaries under Partial Observability with Semi-Bandit Feedback

Anjali Purathekandy, Deepak N. Subramani · 2026

We introduce an online learning algorithm for computing adaptive resource allocation policies against strategic ecological adversaries with unknown behavioral models and partial observability. Our set…

Read Paper →

Engineering Preprint PDF DOI

NFPO: Stabilized Policy Optimization of Normalizing Flow for Robotic Policy Learning

Diyuan Shi, Yiqi Tang, Zifeng Zhuang, Donglin Wang · 2026

Deep Reinforcement Learning (DRL) has experienced significant advancements in recent years and has been widely used in many fields. In DRL-based robotic policy learning, however, current de facto poli…

Read Paper →

Engineering Preprint PDF DOI

Cough activity detection for automatic tuberculosis screening

Joshua Jansen van Vuren, Devendra Singh Parihar, Daphne Naidoo, Kimsey Zajac, Willy Ssengooba, Grant Theron, Thomas Niesler · 2026

The automatic identification of cough segments in audio through the determination of start and end points is pivotal to building scalable screening tools in health technologies for pulmonary related d…

Read Paper →

Engineering Preprint PDF DOI

A Control-Theoretic Foundation for Agentic Systems

Ali Eslami, Jiangbo Yu · 2026

This paper develops a control-theoretic framework for analyzing agentic systems embedded within feedback control loops, where an AI agent may adapt controller parameters, select among control strategi…

Read Paper →

Engineering Preprint PDF DOI

Parallel-in-Time Nonlinear Optimal Control via GPU-native Sequential Convex Programming

Yilin Zou, Zhong Zhang, Maxime Robic, Fanghua Jiang · 2026

Real-time trajectory optimization for nonlinear constrained autonomous systems is critical and typically performed by CPU-based sequential solvers. Specifically, reliance on global sparse linear algeb…

Read Paper →

Engineering Preprint PDF DOI

OnFly: Onboard Zero-Shot Aerial Vision-Language Navigation toward Safety and Efficiency

Guiyong Zheng, Yueting Ban, Mingjie Zhang, Juepeng Zheng, Boyu Zhou · 2026

Aerial vision-language navigation (AVLN) enables UAVs to follow natural-language instructions in complex 3D environments. However, existing zero-shot AVLN methods often suffer from unstable single-str…

Read Paper →

Engineering Preprint PDF DOI

AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory

Lianjie Ma, Yuquan Li, Bingzheng Jiang, Ziming Zhong, Han Ding, Lijun Zhu · 2026

Foundation-model-based monocular depth estimation offers a viable alternative to active sensors for robot perception, yet its computational cost often prohibits deployment on edge platforms. Existing …

Read Paper →

Browse Research Papers

Two-Phase Cell Switching in 6G vHetNets: Sleeping-Cell Load Estimation and Renewable-Aware Switching Toward NES

Topological Motion Planning Diffusion: Generative Tangle-Free Path Planning for Tethered Robots in Obstacle-Rich Environments

How Attention Shapes Emotion: A Comparative Study of Attention Mechanisms for Speech Emotion Recognition

Dual-Domain Sparse Adaptive Filtering: Exploiting Error Memory for Improved Performance

On the Degrees of Freedom of Gridded Control Points in Learning-Based Medical Image Registration

OxyGen: Unified KV Cache Management for Vision-Language-Action Models under Multi-Task Parallelism

DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression

DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation

ReMem-VLA: Empowering Vision-Language-Action Model with Memory via Dual-Level Recurrent Queries

RoboStream: Weaving Spatio-Temporal Reasoning with Memory in Vision-Language Models for Robotics

Consistent and Efficient MSCKF-based LiDAR-Inertial Odometry with Inferred Cluster-to-Plane Constraints for UAVs

AnchorVLA4D: an Anchor-Based Spatial-Temporal Vision-Language-Action Model for Robotic Manipulation

BrainMem: Brain-Inspired Evolving Memory for Embodied Agent Task Planning

Online Learning of Strategic Defense against Ecological Adversaries under Partial Observability with Semi-Bandit Feedback

NFPO: Stabilized Policy Optimization of Normalizing Flow for Robotic Policy Learning

Cough activity detection for automatic tuberculosis screening

A Control-Theoretic Foundation for Agentic Systems

Parallel-in-Time Nonlinear Optimal Control via GPU-native Sequential Convex Programming

OnFly: Onboard Zero-Shot Aerial Vision-Language Navigation toward Safety and Efficiency

AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory

Browse by Category

Research Type

Publish Your Research