70+ open-access research outputs.
This paper develops a comprehensive analytical framework for the outage probability of fluid antenna system (FAS)-aided communications by modeling the antenna as a continuous aperture and approximatin…
With the increasing integration of robots into daily life, human-robot interaction has become more complex and multifaceted. A critical component of this interaction is Interactive Visual Grounding (I…
Spatial information is a critical clue for multi-channel multi-speaker target speech recognition. Most state-of-the-art multi-channel Automatic Speech Recognition (ASR) systems extract spatial feature…
Zero-shot text-to-speech models can clone a speaker's timbre from a short reference audio, but they also strongly inherit the speaking style present in the reference. As a result, synthesizing speech …
Sound separation (SS) and target sound extraction (TSE) are fundamental techniques for addressing complex acoustic scenarios. While existing SS methods struggle with determining the unknown number of …
Recent advances in text-to-speech (TTS) have enabled models to clone arbitrary unseen speakers and synthesize high-quality, natural-sounding speech. However, evaluation methods lag behind: typical mea…
Terahertz inter-satellite links enable unprecedented sensing precision for Low Earth Orbit (LEO) constellations, yet face fundamental bounds from hardware impairments, pointing errors, and network int…
Reinforcement learning (RL) is a promising avenue for post-training vision-language-action (VLA) models, but practical deployment is hindered by sparse rewards and unstable training. This work mitigat…
Recent advances in target sound extraction (TSE) utilize directional clues derived from direction of arrival (DoA), which represent an inherent spatial property of sound available in any acoustic scen…
This paper establishes a theoretical framework for analyzing the fundamental performance limits of terahertz (THz) Low Earth Orbit (LEO) inter-satellite link (ISL) Integrated Sensing and Communication…
The spatial semantic segmentation task focuses on separating and classifying sound objects from multichannel signals. To achieve two different goals, conventional methods fine-tune a large classificat…
Reliability assessment of engineering systems often requires repeated evaluations of limit-state functions that may rely on computationally expensive high-fidelity models, rendering direct sampling-ba…
The success of deep learning in medical imaging applications has led several companies to deploy proprietary models in diagnostic workflows, offering monetized services. Even though model weights are …
'The cardinal sin in control is to believe that the plant is given' Karl Astrom. Astrom, a towering figure of control theory and practice and awardee of the 1993 IEEE Medal of Honor for his work on ad…
Humanoid teleoperation plays a vital role in demonstrating and collecting data for complex humanoid-scene interactions. However, current teleoperation systems face critical limitations: they decouple …
Manipulation has long been a challenging task for robots, while humans can effortlessly perform complex interactions with objects, such as hanging a cup on the mug rack. A key reason is the lack of a …
This paper aims to achieve single-channel target speech extraction (TSE) in enclosures utilizing distance clues and room information. Recent works have verified the feasibility of distance clues for t…
This dissertation proposes an electrocardiogram (ECG) tracking device that diagnoses cardiopulmonary problems using the Internet of Things (IoT) desired results. The initiative is built on the interne…
In this work, an Aluminum Scandium Nitride (AlScN) on Diamond Sezawa-mode surface acoustic wave (SAW) platform for RF filtering at Ku-band (12-18 GHz) is demonstrated. Thanks to the high acoustic velo…
Near Field Communication (NFC) is widely used in security applications such as door access systems and ID cards. However, clone attacks can replicate digital information, enabling unauthorized access.…
Free open-access publishing with Google Scholar indexing.
Submission Guide →