Abhinav Joshi in Engineering — Research Repository

Showing 6 results for "abhinav joshi" in Engineering

Engineering Preprint PDF DOI

RoSHI: A Versatile Robot-oriented Suit for Human Data In-the-Wild

Wenjing Margaret Mao, Jefferson Ng, Luyang Hu, Daniel Gehrig, Antonio Loquercio · 2026

Scaling up robot learning will likely require human data containing rich and long-horizon interactions in the wild. Existing approaches for collecting such data trade off portability, robustness to oc…

Read Paper →

Engineering Preprint PDF DOI

Privacy-Preserving End-to-End Full-Duplex Speech Dialogue Models

Nikita Kuzmin, Tao Zhong, Jiajun Deng, Yingke Zhu, Tristan Tsoi, Tianxiang Cao, Simon Lui, Kong Aik Lee, Eng Siong Chng · 2026

End-to-end full-duplex speech models feed user audio through an always-on LLM backbone, yet the speaker privacy implications of their hidden representations remain unexamined. Following the VoicePriva…

Read Paper →

Engineering Preprint PDF DOI

Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks

Viraj Joshi, Zifan Xu, Bo Liu, Peter Stone, Amy Zhang · 2025

Multi-task Reinforcement Learning (MTRL) has emerged as a critical training paradigm for applying reinforcement learning (RL) to a set of complex real-world robotic tasks, which demands a generalizabl…

Read Paper →

Engineering Preprint PDF DOI

FD-Bench: A Full-Duplex Benchmarking Pipeline Designed for Full Duplex Spoken Dialogue Systems

Yizhou Peng, Yi-Wen Chao, Dianwen Ng, Yukun Ma, Chongjia Ni, Bin Ma, Eng Siong Chng · 2025

Full-duplex spoken dialogue systems (FDSDS) enable more natural human-machine interactions by allowing real-time user interruptions and backchanneling, compared to traditional SDS that rely on turn-ta…

Read Paper →

Engineering Preprint PDF DOI

AINav: Large Language Model-Based Adaptive Interactive Navigation

Kangjie Zhou, Yao Mu, Haoyang Song, Yi Zeng, Pengying Wu, Han Gao, Chang Liu · 2025

Robotic navigation in complex environments remains a critical research challenge. Traditional navigation methods focus on optimal trajectory generation within fixed free workspace, therefore strugglin…

Read Paper →

Engineering Preprint PDF DOI

Moshi: a speech-text foundation model for real-time dialogue

Alexandre Defossez, Laurent Mazare, Manu Orsini, Amelie Royer, Patrick Perez, Herve Jegou, Edouard Grave, Neil Zeghidour · 2024

We introduce Moshi, a speech-text foundation model and full-duplex spoken dialogue framework. Current systems for spoken dialogue rely on pipelines of independent components, namely voice activity det…

Read Paper →

📝

Publish Your Research

Free open-access publishing with Google Scholar indexing.

Submission Guide →

Browse Research Papers

RoSHI: A Versatile Robot-oriented Suit for Human Data In-the-Wild

Privacy-Preserving End-to-End Full-Duplex Speech Dialogue Models

Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks

FD-Bench: A Full-Duplex Benchmarking Pipeline Designed for Full Duplex Spoken Dialogue Systems

AINav: Large Language Model-Based Adaptive Interactive Navigation

Moshi: a speech-text foundation model for real-time dialogue

Browse by Category

Research Type

Publish Your Research