David Harvey in Engineering — Research Repository

Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung · 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annota…

Read Paper →

Engineering Preprint PDF DOI

Robot Learning from Human Videos: A Survey

Junyi Ma, Erhang Zhang, Haoran Yang, Ditao Li, Chenyang Xu, Guangming Wang, Hesheng Wang · 2026

A critical bottleneck hindering further advancement in embodied AI and robotics is the challenge of scaling robot data. To address this, the field of learning robot manipulation skills from human vide…

Read Paper →

Engineering Preprint PDF DOI

3D Generation for Embodied AI and Robotic Simulation: A Survey

Tianwei Ye, Yifan Mao, Minwen Liao, Jian Liu, Chunchao Guo, Dazhao Du, Quanxin Shou, Fangqi Zhu, Song Guo · 2026

Embodied AI and robotic systems increasingly depend on scalable, diverse, and physically grounded 3D content for simulation-based training and real-world deployment. While 3D generative modeling has a…

Read Paper →

Engineering Preprint PDF DOI

VEHRON: A Configuration-Driven BEV Simulation Framework for Subsystem-Level Studies

Subramanyam Natarajan · 2026

In practical early-stage battery-electric vehicle studies, analysis workflows may become fragmented across spreadsheets, notebooks, and project-specific scripts, making reuse, audit, and extension har…

Read Paper →

Engineering Preprint PDF DOI

Vision-Language-Action in Robotics: A Survey of Datasets, Benchmarks, and Data Engines

Ziyao Wang, Bingying Wang, Hanrong Zhang, Tingting Du, Tianyang Chen, Guoheng Sun, Yexiao He, Zheyu Shen, Wanghao Ye, Ang Li · 2026

Despite remarkable progress in Vision--Language--Action (VLA) models, a central bottleneck remains underexamined: the data infrastructure that underlies embodied learning. In this survey, we argue tha…

Read Paper →

Engineering Preprint PDF DOI

When AI Meets Terahertz: A Survey on the Symbiosis of Artificial Intelligence and Terahertz Networks

Chong Han, Jingting Jiang, Zhengdong Hu, Meixia Tao, Wenjun Zhang · 2026

The Terahertz (THz) band (0.1-10 THz) has emerged as a critical frontier for future communication systems, offering ultra-wide bandwidths that enable Terabits-per-second (Tbps) wireless links and high…

Read Paper →

Engineering Preprint PDF DOI

A Survey of Legged Robotics in Non-Inertial Environments: Past, Present, and Future

I-Chia Chang, Xinyan Huang, Tzu-Yuan Lin, Sangli Teng, Wenjing Li, Maani Ghaffari, Jingang Yi, Yan Gu · 2026

Legged robots have demonstrated remarkable agility on rigid, stationary ground, but their locomotion reliability remains limited in non-inertial environments, where the supporting ground moves, tilts,…

Read Paper →

Engineering Preprint PDF DOI

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

Huakang Chen, Jingbin Hu, Liumeng Xue, Qirui Zhan, Wenhao Li, Guobin Ma, Hanke Xie, Dake Guo, Linhan Ma, Yuepeng Jiang, Bengu Wu, Pengyuan Xie, Chuan Xie, Qiang Zhang, Lei Xie · 2026

Instruction-following text-to-speech (TTS) has emerged as an important capability for controllable and expressive speech generation, yet its evaluation remains underdeveloped due to limited benchmark …

Read Paper →

Engineering Preprint PDF DOI

Abstract Sim2Real through Approximate Information States

Yunfu Deng, Yuhao Li, Josiah P. Hanna · 2026

In recent years, reinforcement learning (RL) has shown remarkable success in robotics when a fast and accurate simulator is available for a given task. When using RL and simulation, more simulator rea…

Read Paper →

Engineering Preprint PDF DOI

Symmetry Is Almost All You Need: Robust Stability with Uncertainty Induced by Symmetric SRG Regions

Ding Zhang, Di Zhao, Philipp Braun, Jianqi Chen · 2026

This paper investigates the robust stability problem of a feedback system in the presence of uncertainties induced by graphical regions in the plane where the scaled relative graphs (SRGs) reside. Our…

Read Paper →

Engineering Preprint PDF DOI

The Sustainability Gap in Robotics: A Large-Scale Survey of Sustainability Awareness in 50,000 Research Articles

Antun Skuric, Leandro Von Werra, Thomas Wolf · 2026

We present a large-scale survey of sustainability communication and motivation in robotics research. Our analysis covers nearly 50,000 open-access papers from arXiv's cs.RO category published between …

Read Paper →

Engineering Preprint PDF DOI

Networking-Aware Energy Efficiency in Agentic AI Inference: A Survey

Xiaojing Chen, Haiqi Yu, Wei Ni, Dusit Niyato, Ruichen Zhang, Xin Wang, Shunqing Zhang, Shugong Xu · 2026

The rapid emergence of Large Language Models (LLMs) has catalyzed Agentic artificial intelligence (AI), autonomous systems integrating perception, reasoning, and action into closed-loop pipelines for …

Read Paper →

Engineering Preprint PDF DOI

Biologically Inspired Event-Based Perception and Sample-Efficient Learning for High-Speed Table Tennis Robots

Ziqi Wang, Jingyue Zhao, Xun Xiao, Jichao Yang, Yaohua Wang, Shi Xu, Lei Wang, Huadong Dai · 2026

Perception and decision-making in high-speed dynamic scenarios remain challenging for current robots. In contrast, humans and animals can rapidly perceive and make decisions in such environments. Taki…

Read Paper →

Engineering Preprint PDF DOI

A Survey on Robust Deep Joint Source-Channel Coding for Semantic Communications

Eunhye Hong, Taewoo Park, Yongjune Kim · 2026

Semantic communications (SCs) aim to transmit only the essential information required to perform given tasks, thereby improving communication efficiency. Deep learning-based joint source-channel codin…

Read Paper →

Engineering Preprint PDF DOI

A Survey on Sensor-based Planning and Control for Unmanned Underwater Vehicles

Shivam Vishwakarma, Tejal Bedmutha, Dharmendra Kumar Patel, Vijay Bhaskar Semwal, Leena Vachhani · 2026

This survey examines recent sensor-based planning and control methods for Unmanned Underwater Vehicles (UUVs). In complex, uncertain underwater environments, UUVs require advanced planning and control…

Read Paper →

Engineering Preprint PDF DOI

From Video to Control: A Survey of Learning Manipulation Interfaces from Temporal Visual Data

Linfang Zheng, Zikai Ouyang, Chen Wang, Jia Pan, Wei Zhang · 2026

Video is a scalable observation of physical dynamics: it captures how objects move, how contact unfolds, and how scenes evolve under interaction -- all without requiring robot action labels. Yet trans…

Read Paper →

Engineering Preprint PDF DOI

Foundation Models Defining A New Era In Sensor-based Human Activity Recognition: A Survey And Outlook

Sizhen Bian, Mengxi Liu, Lala Shakti Swarup Ray, Bo Zhou, Bin Guo, Zhiwen Yu, Thomas Ploetz, Paul Lukowicz, Siyu Yuan, Vitor Fortes Rey · 2026

Sensor-based Human Activity Recognition (HAR) underpins many ubiquitous and wearable computing applications, yet current models remain limited by scarce labels, sensor heterogeneity, and weak generali…

Read Paper →

Engineering Preprint PDF DOI

Spherical Antenna Arrays for Future Communications: Principles, Applications, and Research Directions

Cunhua Pan, Xianzhe Chen, Hong Ren, Jiangzhou Wang · 2026

With the development of 6G technologies, traditional uniform linear arrays (ULAs) and uniform planar arrays (UPAs) can hardly meet the demands of three-dimensional (3D) full-space coverage and high an…

Read Paper →

Engineering Preprint PDF DOI

MIMO OFDM-Enabled ISAC for Low-Altitude Non-Cooperative UAV Surveillance: A Survey

Shiyu Bai, Sijia Li, Cunyi Yin, Wenqiu Qu, Li-Ta Hsu, Yuanwei Liu, Wen-Hua Chen · 2026

The widespread use of unmanned aerial vehicles (UAVs) in low-altitude airspace has raised significant safety and security concerns, motivating the development of reliable non-cooperative UAV surveilla…

Read Paper →

Engineering Preprint PDF DOI

Tune to Learn: How Controller Gains Shape Robot Policy Learning

Antonia Bronars, Younghyo Park, Pulkit Agrawal · 2026

Position controllers have become the dominant interface for executing learned manipulation policies. Yet a critical design decision remains understudied: how should we choose controller gains for poli…

Read Paper →

Browse Research Papers

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Robot Learning from Human Videos: A Survey

3D Generation for Embodied AI and Robotic Simulation: A Survey

VEHRON: A Configuration-Driven BEV Simulation Framework for Subsystem-Level Studies

Vision-Language-Action in Robotics: A Survey of Datasets, Benchmarks, and Data Engines

When AI Meets Terahertz: A Survey on the Symbiosis of Artificial Intelligence and Terahertz Networks

A Survey of Legged Robotics in Non-Inertial Environments: Past, Present, and Future

MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech

Abstract Sim2Real through Approximate Information States

Symmetry Is Almost All You Need: Robust Stability with Uncertainty Induced by Symmetric SRG Regions

The Sustainability Gap in Robotics: A Large-Scale Survey of Sustainability Awareness in 50,000 Research Articles

Networking-Aware Energy Efficiency in Agentic AI Inference: A Survey

Biologically Inspired Event-Based Perception and Sample-Efficient Learning for High-Speed Table Tennis Robots

A Survey on Robust Deep Joint Source-Channel Coding for Semantic Communications

A Survey on Sensor-based Planning and Control for Unmanned Underwater Vehicles

From Video to Control: A Survey of Learning Manipulation Interfaces from Temporal Visual Data

Foundation Models Defining A New Era In Sensor-based Human Activity Recognition: A Survey And Outlook

Spherical Antenna Arrays for Future Communications: Principles, Applications, and Research Directions

MIMO OFDM-Enabled ISAC for Low-Altitude Non-Cooperative UAV Surveillance: A Survey

Tune to Learn: How Controller Gains Shape Robot Policy Learning

Browse by Category

Research Type

Publish Your Research