Audrey Huang in Engineering — Research Repository

Engineering Preprint PDF DOI

Agent-Driven Autonomous Reinforcement Learning Research: Iterative Policy Improvement for Quadruped Locomotion

Nimesh Khandelwal, Shakti S. Gupta · 2026

This paper documents a case study in agent-driven autonomous reinforcement learning research for quadruped locomotion. The setting was not a fully self-starting research system. A human provided high-…

Read Paper →

Engineering Preprint PDF DOI

Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding

Mingyue Huo, Wei-Cheng Tseng, Yiwen Shao, Hao Zhang, Dong Yu · 2025

Human voice encodes both identity and paralinguistic cues, yet encoders in large audio-language models (LALMs) rarely balance both aspects. In this work, we present a study toward building a general-p…

Read Paper →

Engineering Preprint PDF DOI

TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation

Wei Liu, Jiahong Li, Yiwen Shao, Dong Yu · 2025

Speech-LLM models have demonstrated great performance in multi-modal and multi-task speech understanding. A typical speech-LLM paradigm is integrating speech modality with a large language model (LLM)…

Read Paper →

Engineering Preprint PDF DOI

One-shot Adaptation of Humanoid Whole-body Motion with Walking Priors

Hao Huang, Geeta Chandra Raju Bethala, Shuaihang Yuan, Congcong Wen, Mengyu Wang, Anthony Tzes, Yi Fang · 2025

Whole-body humanoid motion represents a fundamental challenge in robotics, requiring balance, coordination, and adaptability to enable human-like behaviors. However, existing methods typically require…

Read Paper →

Engineering Preprint PDF DOI

Single-Rod Brachiation Robot: Mechatronic Control Design and Validation of Prejump Phases

Juraj Lieskovsky, Hijiri Akahane, Aoto Osawa, Jaroslav Busek, Ikuo Mizuuchi, Tomas Vyhlidal · 2025

A complete mechatronic design of a minimal configuration brachiation robot is presented. The robot consists of a single rigid rod with gripper mechanisms attached to both ends. The grippers are used t…

Read Paper →

Engineering Preprint PDF DOI

Audio Deepfake Verification

Li Wang, Junyi Ao, Linyong Gan, Yuancheng Wang, Xueyao Zhang, Zhizheng Wu · 2025

With the rapid development of deepfake technology, simply making a binary judgment of true or false on audio is no longer sufficient to meet practical needs. Accurately determining the specific deepfa…

Read Paper →

Engineering Preprint PDF DOI

Constructed Realities? Technical and Contextual Anomalies in a High-Profile Image

Matthias Wjst · 2025

This study offers a forensic assessment of a widely circulated photograph featuring Andrew Mountbatten-Windsor, Virginia Giuffre, and Ghislaine Maxwell, an image that has played a pivotal role in publ…

Read Paper →

Engineering Preprint PDF DOI

DIGS: Dynamic CBCT Reconstruction using Deformation-Informed 4D Gaussian Splatting and a Low-Rank Free-Form Deformation Model

Yuliang Huang, Imraj Singh, Thomas Joyce, Kris Thielemans, Jamie R. McClelland · 2025

3D Cone-Beam CT (CBCT) is widely used in radiotherapy but suffers from motion artifacts due to breathing. A common clinical approach mitigates this by sorting projections into respiratory phases and r…

Read Paper →

Engineering Preprint PDF DOI

CodeDiffuser: Attention-Enhanced Diffusion Policy via VLM-Generated Code for Instruction Ambiguity

Guang Yin, Yitong Li, Yixuan Wang, Dale McConachie, Paarth Shah, Kunimatsu Hashimoto, Huan Zhang, Katherine Liu, Yunzhu Li · 2025

Natural language instructions for robotic manipulation tasks often exhibit ambiguity and vagueness. For instance, the instruction "Hang a mug on the mug tree" may involve multiple valid actions if the…

Read Paper →

Engineering Preprint PDF DOI

Conversations with Andrea: Visitors' Opinions on Android Robots in a Museum

Marcel Heisler, Christian Becker-Asano · 2025

The android robot Andrea was set up at a public museum in Germany for six consecutive days to have conversations with visitors, fully autonomously. No specific context was given, so visitors could sta…

Read Paper →

Engineering Preprint PDF DOI

AuDeRe: Automated Strategy Decision and Realization in Robot Planning and Control via LLMs

Yue Meng, Fei Chen, Yongchao Chen, Chuchu Fan · 2025

Recent advancements in large language models (LLMs) have shown significant promise in various domains, especially robotics. However, most prior LLM-based work in robotic applications either directly p…

Read Paper →

Engineering Preprint PDF DOI

Computational Imaging Through Atmospheric Turbulence

Nicholas Chimitt, Stanley H. Chan · 2024

Since the seminal work of Andrey Kolmogorov in the early 1940's, imaging through atmospheric turbulence has grown from a pure scientific pursuit to an important subject across a multitude of civilian,…

Read Paper →

Engineering Preprint PDF DOI

ScissorBot: Learning Generalizable Scissor Skill for Paper Cutting via Simulation, Imitation, and Sim2Real

Jiangran Lyu, Yuxing Chen, Tao Du, Feng Zhu, Huiquan Liu, Yizhou Wang, He Wang · 2024

This paper tackles the challenging robotic task of generalizable paper cutting using scissors. In this task, scissors attached to a robot arm are driven to accurately cut curves drawn on the paper, wh…

Read Paper →

Engineering Preprint PDF DOI

Cross-Slice Attention and Evidential Critical Loss for Uncertainty-Aware Prostate Cancer Detection

Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Kaifeng Pang, Demetri Terzopoulos, Kyunghyun Sung · 2024

Current deep learning-based models typically analyze medical images in either 2D or 3D albeit disregarding volumetric information or suffering sub-optimal performance due to the anisotropic resolution…

Read Paper →

Engineering Preprint PDF DOI

Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition

Alaa Nfissi, Wassim Bouachir, Nizar Bouguila, Brian Mishara · 2024

Speech emotion recognition (SER) has gained significant attention due to its several application fields, such as mental health, education, and human-computer interaction. However, the accuracy of SER …

Read Paper →

Engineering Preprint PDF DOI

Autonomous Robot for Disaster Mapping and Victim Localization

Michael Potter, Rahil Bhowal, Richard Zhao, Anuj Patel, Jingming Cheng · 2024

In response to the critical need for effective reconnaissance in disaster scenarios, this research article presents the design and implementation of a complete autonomous robot system using the Turtle…

Read Paper →

Engineering Preprint PDF DOI

Comparison of different methods for identification of dominant oscillation mode

Maja Muftic Dedovic, Samir Avdakovic, Adnan Mujezinovic, Nedis Dautbasic · 2024

This paper introduces and compares the various techniques for identification and analysis of low frequency oscillations in a power system. Inter-area electromechanical oscillations are the focus of th…

Read Paper →

Engineering Preprint PDF DOI

Augmented Reality User Interface for Command, Control, and Supervision of Large Multi-Agent Teams

Frank Regal, Chris Suarez, Fabian Parra, Mitch Pryor · 2024

Multi-agent human-robot teaming allows for the potential to gather information about various environments more efficiently by exploiting and combining the strengths of humans and robots. In industries…

Read Paper →

Engineering Preprint PDF DOI

SKT-Hang: Hanging Everyday Objects via Object-Agnostic Semantic Keypoint Trajectory Generation

Chia-Liang Kuo, Yu-Wei Chao, Yi-Ting Chen · 2023

We study the problem of hanging a wide range of grasped objects on diverse supporting items. Hanging objects is a ubiquitous task that is encountered in numerous aspects of our everyday lives. However…

Read Paper →

Engineering Preprint PDF DOI

Simultaneous Robot-World and Hand-Eye Calibration

Fadi Dornaika, Radu Horaud · 2023

Recently, Zhuang, Roth, \& Sudhakar [1] proposed a method that allows simultaneous computation of the rigid transformations from world frame to robot base frame and from hand frame to camera frame. Th…

Read Paper →

Browse Research Papers

Agent-Driven Autonomous Reinforcement Learning Research: Iterative Policy Improvement for Quadruped Locomotion

Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding

TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation

One-shot Adaptation of Humanoid Whole-body Motion with Walking Priors

Single-Rod Brachiation Robot: Mechatronic Control Design and Validation of Prejump Phases

Audio Deepfake Verification

Constructed Realities? Technical and Contextual Anomalies in a High-Profile Image

DIGS: Dynamic CBCT Reconstruction using Deformation-Informed 4D Gaussian Splatting and a Low-Rank Free-Form Deformation Model

CodeDiffuser: Attention-Enhanced Diffusion Policy via VLM-Generated Code for Instruction Ambiguity

Conversations with Andrea: Visitors' Opinions on Android Robots in a Museum

AuDeRe: Automated Strategy Decision and Realization in Robot Planning and Control via LLMs

Computational Imaging Through Atmospheric Turbulence

ScissorBot: Learning Generalizable Scissor Skill for Paper Cutting via Simulation, Imitation, and Sim2Real

Cross-Slice Attention and Evidential Critical Loss for Uncertainty-Aware Prostate Cancer Detection

Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition

Autonomous Robot for Disaster Mapping and Victim Localization

Comparison of different methods for identification of dominant oscillation mode

Augmented Reality User Interface for Command, Control, and Supervision of Large Multi-Agent Teams

SKT-Hang: Hanging Everyday Objects via Object-Agnostic Semantic Keypoint Trajectory Generation

Simultaneous Robot-World and Hand-Eye Calibration

Browse by Category

Research Type

Publish Your Research