Expertini Research Research

Browse Research Papers

30+ open-access research outputs.

โœ• Clear
๐Ÿ” xiaomo jiang ๐Ÿ“‚ Engineering
Showing 30 results for "xiaomo jiang" in Engineering
Engineering Preprint PDF DOI

CAR-EnKF: A Covariance-Adaptive and Recalibrated Ensemble Kalman Filter Framework

Shida Jiang, Shengyu Tao, Zihe Liu, Scott Moura ยท 2026

The ensemble Kalman filter (EnKF) is widely used for nonlinear and high-dimensional state estimation because it replaces complex covariance propagation with simple ensemble statistics. However, convenโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

ACAVCaps: Enabling large-scale training for fine-grained and diverse audio understanding

Yadong Niu, Tianzi Wang, Heinrich Dinkel, Xingwei Sun, Jiahao Zhou, Gang Li, Jizhong Liu, Junbo Zhang, Jian Luan ยท 2026

General audio understanding is a fundamental goal for large audio-language models, with audio captioning serving as a cornerstone task for their development. However, progress in this domain is hinderโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Design Guidelines for Nonlinear Kalman Filters via Covariance Compensation

Shida Jiang, Jaewoong Lee, Shengyu Tao, Scott Moura ยท 2026

Nonlinear extensions of the Kalman filter (KF), such as the extended Kalman filter (EKF) and the unscented Kalman filter (UKF), are indispensable for state estimation in complex dynamical systems, yetโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Physics-Informed Anomaly Detection of Terrain Material Change in Radar Imagery

Abdel Hakiem Mohamed Abbas Mohamed Ahmed, Beth Jelfs, Airlie Chapman, Eric Schoof, Christopher Gilliam ยท 2026

In this paper we consider physics-informed detection of terrain material change in radar imagery (e.g., shifts in permittivity, roughness or moisture). We propose a lightweight electromagnetic (EM) foโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

Rui Cai, Jun Guo, Xinze He, Piaopiao Jin, Jie Li, Bingxuan Lin, Futeng Liu, Wei Liu, Fei Ma, Kun Ma, Feng Qiu, Heng Qu, Yifei Su, Qiao Sun, Dong Wang, Donghao Wang, Yunhong Wang, Rujie Wu, Diyun Xiang, Yu Yang, Hangjun Ye, Yuan Zhang, Quanyun Zhou ยท 2026

In this report, we introduce Xiaomi-Robotics-0, an advanced vision-language-action (VLA) model optimized for high performance and fast and smooth real-time execution. The key to our method lies in a cโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Rapid stabilization of the heat equation with localized disturbance

Patricio Guzman, Hugo Parada (SPHINX, IECL), Christian Calle-Cardenas ยท 2025

This paper studies the rapid stabilization of a multidimensional heat equation in the presence of an unknown spatially localized disturbance. A novel multivalued feedback control strategy is proposed,โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

MS-PPO: Morphological-Symmetry-Equivariant Policy for Legged Robot Locomotion

Sizhe Wei, Xulin Chen, Fengze Xie, Garrett Ethan Katz, Zhenyu Gan, Lu Gan ยท 2025

Reinforcement learning has recently enabled impressive locomotion capabilities on legged robots; however, most policy architectures remain morphology- and symmetry-agnostic, leading to inefficient traโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Team Xiaomi EV-AD VLA: Caption-Guided Retrieval System for Cross-Modal Drone Navigation -- Technical Report for IROS 2025 RoboSense Challenge Track 4

Lingfeng Zhang, Erjia Xiao, Yuchen Zhang, Haoxiang Fu, Ruibin Hu, Yanbiao Ma, Wenbo Ding, Long Chen, Hangjun Ye, Xiaoshuai Hao ยท 2025

Cross-modal drone navigation remains a challenging task in robotics, requiring efficient retrieval of relevant images from large-scale databases based on natural language descriptions. The RoboSense 2โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

FARM: Frame-Accelerated Augmentation and Residual Mixture-of-Experts for Physics-Based High-Dynamic Humanoid Control

Tan Jing, Shiting Chen, Yangfan Li, Weisheng Xu, Renjing Xu ยท 2025

Unified physics-based humanoid controllers are pivotal for robotics and character animation, yet models that excel on gentle, everyday motions still stumble on explosive actions, hampering real-world โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

InnerGS: Internal Scenes Reconstruction and Segmentation via Factorized 3D Gaussian Splatting

Shuxin Liang, Yihan Xiao, Wenlu Tang ยท 2025

3D Gaussian Splatting (3DGS) has recently gained popularity for efficient scene rendering by representing scenes as explicit sets of anisotropic 3D Gaussians. However, most existing work focuses primaโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks

Yadong Niu, Tianzi Wang, Heinrich Dinkel, Xingwei Sun, Jiahao Zhou, Gang Li, Jizhong Liu, Xunying Liu, Junbo Zhang, Jian Luan ยท 2025

While large audio-language models have advanced open-ended audio understanding, they still fall short of nuanced human-level comprehension. This gap persists largely because current benchmarks, limiteโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement

Liwen Xiao, Zhiyu Pan, Zhicheng Wang, Zhiguo Cao, Wei Li ยท 2025

Accurate prediction of multi-agent future trajectories is crucial for autonomous driving systems to make safe and efficient decisions. Trajectory refinement has emerged as a key strategy to enhance prโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Decoding Covert Speech from EEG Using a Functional Areas Spatio-Temporal Transformer

Muyun Jiang, Yi Ding, Wei Zhang, Kok Ann Colin Teo, LaiGuan Fong, Shuailei Zhang, Zhiwei Guo, Chenyu Liu, Raghavan Bhuvanakantham, Wei Khang Jeremy Sim, Chuan Huat Vince Foo, Rong Hui Jonathan Chua, Parasuraman Padmanabhan, Victoria Leong, Jia Lu, Balazs Gulyas, Cuntai Guan ยท 2025

Covert speech involves imagining speaking without audible sound or any movements. Decoding covert speech from electroencephalogram (EEG) is challenging due to a limited understanding of neural pronuncโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

M2P2: A Multi-Modal Passive Perception Dataset for Off-Road Mobility in Extreme Low-Light Conditions

Aniket Datar, Anuj Pokhrel, Mohammad Nazeri, Madhan B. Rao, Chenhui Pan, Yufan Zhang, Andre Harrison, Maggie Wigness, Philip R. Osteen, Jinwei Ye, Xuesu Xiao ยท 2024

Long-duration, off-road, autonomous missions require robots to continuously perceive their surroundings regardless of the ambient lighting conditions. Most existing autonomy systems heavily rely on acโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Do We Need iPhone Moment or Xiaomi Moment for Robots? Design of Affordable Home Robots for Health Monitoring

Bo Wei, Yaya Bian, Mingcen Gao ยท 2024

In this paper, we study cost-effective home robot solutions which are designed for home health monitoring. The recent advancements in Artificial Intelligence (AI) have significantly advanced the capabโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Mitigating Overconfidence in Nonlinear Kalman Filters via Covariance Recalibration

Shida Jiang, Junzhe Shi, Scott Moura ยท 2024

The Kalman filter (KF) is an optimal linear state estimator for linear systems, and numerous extensions, including the extended Kalman filter (EKF), unscented Kalman filter (UKF), and cubature Kalman โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Customizable Perturbation Synthesis for Robust SLAM Benchmarking

Xiaohao Xu, Tianyi Zhang, Sibo Wang, Xiang Li, Yongqi Chen, Ye Li, Bhiksha Raj, Matthew Johnson-Roberson, Xiaonan Huang ยท 2024

Robustness is a crucial factor for the successful deployment of robots in unstructured environments, particularly in the domain of Simultaneous Localization and Mapping (SLAM). Simulation-based benchmโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

InCoRo: In-Context Learning for Robotics Control with Feedback Loops

Jiaqiang Ye Zhu, Carla Gomez Cano, David Vazquez Bermudez, Michal Drozdzal ยท 2024

One of the challenges in robotics is to enable robotic units with the reasoning capability that would be robust enough to execute complex tasks in dynamic environments. Recent advances in LLMs have poโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Onboard View Planning of a Flying Camera for High Fidelity 3D Reconstruction of a Moving Actor

Qingyuan Jiang, Volkan Isler ยท 2023

Capturing and reconstructing a human actor's motion is important for filmmaking and gaming. Currently, motion capture systems with static cameras are used for pixel-level high-fidelity reconstructionsโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

The BARN Challenge 2023 -- Autonomous Navigation in Highly Constrained Spaces -- Inventec Team

Hanjaya Mandala, Guilherme Christmann ยท 2023

Navigation in the real-world is hard and filled with complex scenarios. The Benchmark Autonomous Robot Navigation (BARN) Challenge is a competition that focuses on highly constrained spaces. Teams comโ€ฆ

Read Paper โ†’
Page 1 of 2 Next โ†’