Expertini Research Research

Browse Research Papers

5,254+ open-access research outputs.

โœ• Clear
๐Ÿ” recognition ๐Ÿ“‚ Engineering
Showing 5254 results for "recognition" in Engineering
Engineering Preprint PDF DOI

RadarSplat-RIO: Indoor Radar-Inertial Odometry with Gaussian Splatting-Based Radar Bundle Adjustment

Pou-Chun Kung, Yuan Tian, Zhengqin Li, Yue Liu, Eric Whitmire, Wolf Kienzle, Hrvoje Benko ยท 2026

Radar is more resilient to adverse weather and lighting conditions than visual and Lidar simultaneous localization and mapping (SLAM). However, most radar SLAM pipelines still rely heavily on frame-toโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

In-Sync: Adaptation of Speech Aware Large Language Models for ASR with Word Level Timestamp Predictions

Xulin Fan, Vishal Sunder, Samuel Thomas, Mark Hasegawa-Johnson, Brian Kingsbury, George Saon ยท 2026

Recent advances in speech-aware language models have coupled strong acoustic encoders with large language models, enabling systems that move beyond transcription to produce richer outputs. Among theseโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

RIS-Aided Sensing: Experimental Validation of Radar 3D Imaging in the mmWave Band

Sergio Mico-Rosa, Alvaro Villaescusa-Tebar, Saul Fenollosa, Carlos Villena-Jimenez, Monika Drozdowska, Narcis Cardona ยท 2026

The transition toward 6G networks demands energy-efficient hardware capable of active interaction with the environment. Reconfigurable Intelligent Surfaces (RIS) have emerged as a key technology for Iโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Contextual Biasing for ASR in Speech LLM with Common Word Cues and Bias Word Position Prediction

Sashi Novitasari, Takashi Fukuda, Kurata Gakuto, George Saon ยท 2026

Speech-aware LLMs (SLLMs) have recently achieved state-of-the-art ASR performance; however, they still fail to accurately transcribe bias words that appear rarely or never in the training data. Contexโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

VLMaterial: Vision-Language Model-Based Camera-Radar Fusion for Physics-Grounded Material Identification

Jiangyou Zhu, He Chen ยท 2026

Accurate material recognition is a fundamental capability for intelligent perception systems to interact safely and effectively with the physical world. For instance, distinguishing visually similar oโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Dyadic Partnership(DP): A Missing Link Towards Full Autonomy in Medical Robotics

Nassir Navab, Zhongliang Jiang ยท 2026

For the past decades medical robotic solutions were mostly based on the concept of tele-manipulation. While their design was extremely intelligent, allowing for better access, improved dexterity, reduโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Minimal Embodiment Enables Efficient Learning of Number Concepts in Robot

Zhegong Shangguan, Alessandro Di Nuovo, Angelo Cangelosi ยท 2026

Robots are increasingly entering human-interactive scenarios that require understanding of quantity. How intelligent systems acquire abstract numerical concepts from sensorimotor experience remains a โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Speaker Attributed Automatic Speech Recognition Using Speech Aware LLMS

Hagai Aronowitz, Zvi Kons, Avihu Dekel, George Saon, Ron Hoory ยท 2026

Speaker-Attributed Automatic Speech Recognition (SAA) enhances traditional ASR systems by incorporating relative speaker identity tags directly into the transcript (e.g., [Speaker 1]:, [Speaker 2]:). โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Teaching the Teachers: Boosting unsupervised domain adaptation in speech recognition by ensemble update

Rehan Ahmad, Muhammad Umar Farooq, Qihang Feng, Thomas Hain ยท 2026

Speech recognition systems often struggle with data domains that have not been included in the training. To address this, unsupervised domain adaptation has been explored with ensemble and multi-stageโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Unsupervised Equivalent Contrastive Learning for Radio Signal Recognition

Shilian Zheng, Jie Chen, Luxin Zhang, Xiaoniu Yang ยท 2026

Robust radio signal recognition is fundamental to spectrum management, electromagnetic space security, and intelligent wireless applications, yet existing deep-learning methods rely heavily on large lโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Coordinate-Invariant Local Representation of Motion and Force Trajectories for Identification and Generalization Across Coordinate Systems

Arno Verduyn, Erwin Aertbelien, Maxim Vochten, Joris De Schutter ยท 2026

Identifying the trajectories of rigid bodies and of interaction forces is essential for a wide range of tasks in robotics, biomechanics, and related domains. These tasks include trajectory segmentatioโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Utterance-Level Methods for Identifying Reliable ASR-Output for Child Speech

Gus Lathouwers, Lingyun Gao, Catia Cucchiarini, Helmer Strik ยท 2026

Automatic Speech Recognition (ASR) is increasingly used in applications involving child speech, such as language learning and literacy acquisition. However, the effectiveness of such applications is lโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Data Selection Effects on Self-Supervised Learning of Audio Representations for French Audiovisual Broadcasts

Valentin Pelloin, Lina Bekkali, Reda Dehak, David Doukhan ยท 2026

Audio and speech self-supervised encoder models are now widely used for a lot of different tasks. Many of these models are often trained on clean segmented speech content such as LibriSpeech. In this โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Adaptor: Advancing Assistive Teleoperation with Few-Shot Learning and Cross-Operator Generalization

Yu Liu, Yihang Yin, Tianlv Huang, Fei Yan, Yuan Xu, Weinan Hong, Wei Han, Yue Cao, Xiangyu Chen, Zipei Fan, Xuan Song ยท 2026

Assistive teleoperation enhances efficiency via shared control, yet inter-operator variability, stemming from diverse habits and expertise, induces highly heterogeneous trajectory distributions that uโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Enhancing ASR Performance in the Medical Domain for Dravidian Languages

Sri Charan Devarakonda, Ravi Sastry Kolluru, Manjula Sri Rayudu, Rashmi Kapoor, Madhu G, Anil Kumar Vuppala ยท 2026

Automatic Speech Recognition (ASR) for low-resource Dravidian languages like Telugu and Kannada faces significant challenges in specialized medical domains due to limited annotated data and morphologiโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Towards Lifelong Aerial Autonomy: Geometric Memory Management for Continual Visual Place Recognition in Dynamic Environments

Xingyu Shao, Zhiqiang Yan, Liangzheng Sun, Mengfan He, Chao Chen, Jinhui Zhang, Chunyu Li, Ziyang Meng ยท 2026

Robust geo-localization in changing environmental conditions is critical for long-term aerial autonomy. While visual place recognition (VPR) models perform well when airborne views match the training โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

TASU2: Controllable CTC Simulation for Alignment and Low-Resource Adaptation of Speech LLMs

Jing Peng, Chenghao Wang, Yi Yang, Lirong Qian, Junjie Li, Yu Xi, Shuai Wang, Kai Yu ยท 2026

Speech LLM post-training increasingly relies on efficient cross-modal alignment and robust low-resource adaptation, yet collecting large-scale audio-text pairs remains costly. Text-only alignment methโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Rethinking Entropy Allocation in LLM-based ASR: Understanding the Dynamics between Speech Encoders and LLMs

Yuan Xie, Jiaqi Song, Guang Qiu, Xianliang Wang, Ming Lei, Jie Gao, Jie Wu ยท 2026

Integrating large language models (LLMs) into automatic speech recognition (ASR) has become a dominant paradigm. Although recent LLM-based ASR models have shown promising performance on public benchmaโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Harnessing Embodied Agents: Runtime Governance for Policy-Constrained Execution

Xue Qin, Simin Luan, John See, Cong Yang, Zhijun Li ยท 2026

Embodied agents are evolving from passive reasoning systems into active executors that interact with tools, robots, and physical environments. Once granted execution authority, the central challenge bโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

ELC: Evidential Lifelong Classifier for Uncertainty Aware Radar Pulse Classification

Mohamed Rabie, Chinthana Panagamuwa, Konstantinos G. Kyriakopoulos ยท 2026

Reliable radar pulse classification is essential in Electromagnetic Warfare for situational awareness and decision support. Deep Neural Networks have shown strong performance in radar pulse and RF emiโ€ฆ

Read Paper โ†’
โ† Prev Page 2 of 263 Next โ†’