Deepak Garg · Engineering · Preprint — Research Repository

Engineering Preprint PDF DOI

A Matrix-Free Galerkin Multigrid Solver and Failure-Mode Screen for Single-GPU 3D SIMP Linear Systems

Shaoliang Yang, Jun Wang, Yunsheng Wang · 2026

Large 3D SIMP studies require repeated elasticity solves for density-dependent operators whose finest matrices are expensive to assemble and whose conditioning degrades under high contrast. We study t…

Read Paper →

Engineering Preprint PDF DOI

Similarity Choice and Negative Scaling in Supervised Contrastive Learning for Deepfake Audio Detection

Jaskirat Sudan, Hashim Ali, Surya Subramani, Hafiz Malik · 2026

Supervised contrastive learning (SupCon) is widely used to shape representations, but has seen limited targeted study for audio deepfake detection. Existing work typically combines contrastive terms w…

Read Paper →

Engineering Preprint PDF DOI

Passage-Aware Structural Mapping for RGB-D Visual SLAM

Ali Tourani, Miguel Fernandez-Cortizas, Saad Ejaz, David Perez Saura, Asier Bikandi-Noya, Jose Luis Sanchez-Lopez, Holger Voos · 2026

Doorways and passages are critical structural elements for indoor robot navigation, yet they remain underexplored in modern Visual SLAM (VSLAM) frameworks. This paper presents a passage-aware structur…

Read Paper →

Engineering Preprint PDF DOI

Benefits of Low-Cost Bio-Inspiration in the Age of Overparametrization

Kevin Godin-Dubois, Anil Yaman, Anna V. Kononova · 2026

While Central Pattern Generators (CPGs) and Multi-Layer Perceptrons (MLP) are widely used paradigms in robot control, few systematic studies have been performed on the relative merits of large paramet…

Read Paper →

Engineering Preprint PDF DOI

HCFD: A Benchmark for Audio Deepfake Detection in Healthcare

Mohd Mujtaba Akhtar, Girish, Muskaan Singh · 2026

In this study, we present Healthcare Codec-Fake Detection (HCFD), a new task for detecting codec-fakes under pathological speech conditions. We intentionally focus on codec based synthetic speech in t…

Read Paper →

Engineering Preprint PDF DOI

Classical Machine Learning Baselines for Deepfake Audio Detection on the Fake-or-Real Dataset

Faheem Ahmad, Ajan Ahmed, Masudul Imtiaz · 2026

Deep learning has enabled highly realistic synthetic speech, raising concerns about fraud, impersonation, and disinformation. Despite rapid progress in neural detectors, transparent baselines are need…

Read Paper →

Engineering Preprint PDF DOI

ProSDD: Learning Prosodic Representations for Speech Deepfake Detection against Expressive and Emotional Attacks

Aurosweta Mahapatra, Ismail Rasim Ulgen, Kong Aik Lee, Nicholas Andrews, Berrak Sisman · 2026

Speech deepfake detection (SDD) systems perform well on standard benchmarks datasets but often fail to generalize to expressive and emotional spoofing attacks. Many methods rely on spoof-heavy trainin…

Read Paper →

Engineering Preprint PDF DOI

StreamMark: A Deep Learning-Based Semi-Fragile Audio Watermarking for Proactive Deepfake Detection

Zhentao Liu, Milos Cernak · 2026

The rapid advancement of generative AI has made it increasingly challenging to distinguish between deepfake audio and authentic human speech. To overcome the limitations of passive detection methods, …

Read Paper →

Engineering Preprint PDF DOI

HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation

Chengjie Fan, Cong Pan, Zijian Liu, Ningzhong Liu, Jie Qin · 2026

Inspired by the general Vision-and-Language Navigation (VLN) task, aerial VLN has attracted widespread attention, owing to its significant practical value in applications such as logistics delivery an…

Read Paper →

Engineering Preprint PDF DOI

Coalitional Zero-Sum Games for ${H_{\infty}}$ Leader-Following Consensus Control

Yunxiao Ren, Dingguo Liang, Yuezu Lv, Zhisheng Duan · 2026

This paper investigates the leader-following consensus problem for a class of multi-agent systems subject to adversarial attack-like external inputs. To address this, we formulate the robust leader-fo…

Read Paper →

Engineering Preprint PDF DOI

Disentangling Speaker Traits for Deepfake Source Verification via Chebyshev Polynomial and Riemannian Metric Learning

Xi Xuan, Wenxin Zhang, Zhiyu Li, Jennifer Williams, Ville Hautamaki, Tomi H. Kinnunen · 2026

Speech deepfake source verification systems aims to determine whether two synthetic speech utterances originate from the same source generator, often assuming that the resulting source embeddings are …

Read Paper →

Engineering Preprint PDF DOI

GAPG: Geometry Aware Push-Grasping Synergy for Goal-Oriented Manipulation in Clutter

Lijingze Xiao, Jinhong Du, Yang Cong, Supeng Diao, Yu Ren · 2026

Grasping target objects is a fundamental skill for robotic manipulation, but in cluttered environments with stacked or occluded objects, a single-step grasp is often insufficient. To address this, pre…

Read Paper →

Engineering Preprint PDF DOI

Industrial cuVSLAM Benchmark & Integration

Charbel Abi Hana, Kameel Amareen, Mohamad Mostafa, Dmitry Slepichev, Hesam Rabeti, Zheng Wang, Mihir Acharya, Anthony Rizk · 2026

This work presents a comprehensive benchmark evaluation of visual odometry (VO) and visual SLAM (VSLAM) systems for mobile robot navigation in real-world logistical environments. We compare multiple v…

Read Paper →

Engineering Preprint PDF DOI

Understanding the strengths and weaknesses of SSL models for audio deepfake model attribution

Gabriel Pirlogeanu, Adriana Stan, Horia Cucu · 2026

Audio deepfake model attribution aims to mitigate the misuse of synthetic speech by identifying the source model responsible for generating a given audio sample, enabling accountability and informing …

Read Paper →

Engineering Preprint PDF DOI

Cyclostationarity Analysis as a Complement to Self-Supervised Representations for Speech Deepfake Detection

Cemal Hanilci, Md Sahidullah, Tomi Kinnunen · 2026

Speech deepfake detection (SDD) is essential for maintaining trust in voice-driven technologies and digital media. Although recent SDD systems increasingly rely on self-supervised learning (SSL) repre…

Read Paper →

Engineering Preprint PDF DOI

Point Cloud Feature Coding for Object Detection over an Error-Prone Cloud-Edge Collaborative System

Chongzhen Tian, Hui Yuan, Pan Zhao, Chang Sun, Raouf Hamzaoui, Sam Kwong · 2026

Cloud-edge collaboration enhances machine perception by combining the strengths of edge and cloud computing. Edge devices capture raw data (e.g., 3D point clouds) and extract salient features, which a…

Read Paper →

Engineering Preprint PDF DOI

Does Fine-tuning by Reinforcement Learning Improve Generalization in Binary Speech Deepfake Detection?

Xin Wang, Ge Wanying, Junichi Yamagishi · 2026

Building speech deepfake detection models that are generalizable to unseen attacks remains a challenging problem. Although the field has shifted toward a pre-training and fine-tuning paradigm using sp…

Read Paper →

Engineering Preprint PDF DOI

A SUPERB-Style Benchmark of Self-Supervised Speech Models for Audio Deepfake Detection

Hashim Ali, Nithin Sai Adupa, Surya Subramani, Hafiz Malik · 2026

Self-supervised learning (SSL) has transformed speech processing, with benchmarks such as SUPERB establishing fair comparisons across diverse downstream tasks. Despite it's security-critical importanc…

Read Paper →

Engineering Preprint PDF DOI

Deepfake Word Detection by Next-token Prediction using Fine-tuned Whisper

Hoan My Tran, Xin Wang, Wanying Ge, Xuechen Liu, Junichi Yamagishi · 2026

Deepfake speech utterances can be forged by replacing one or more words in a bona fide utterance with semantically different words synthesized with speech-generative models. While a dedicated syntheti…

Read Paper →

Engineering Preprint PDF DOI

VIGOR: Visual Goal-In-Context Inference for Unified Humanoid Fall Safety

Osher Azulay, Zhengjie Xu, Andrew Scheffer, Stella X. Yu · 2026

Reliable fall recovery is critical for humanoids operating in cluttered environments. Unlike quadrupeds or wheeled robots, humanoids experience high-energy impacts, complex whole-body contact, and lar…

Read Paper →

Browse Research Papers

A Matrix-Free Galerkin Multigrid Solver and Failure-Mode Screen for Single-GPU 3D SIMP Linear Systems

Similarity Choice and Negative Scaling in Supervised Contrastive Learning for Deepfake Audio Detection

Passage-Aware Structural Mapping for RGB-D Visual SLAM

Benefits of Low-Cost Bio-Inspiration in the Age of Overparametrization

HCFD: A Benchmark for Audio Deepfake Detection in Healthcare

Classical Machine Learning Baselines for Deepfake Audio Detection on the Fake-or-Real Dataset

ProSDD: Learning Prosodic Representations for Speech Deepfake Detection against Expressive and Emotional Attacks

StreamMark: A Deep Learning-Based Semi-Fragile Audio Watermarking for Proactive Deepfake Detection

HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation

Coalitional Zero-Sum Games for ${H_{\infty}}$ Leader-Following Consensus Control

Disentangling Speaker Traits for Deepfake Source Verification via Chebyshev Polynomial and Riemannian Metric Learning

GAPG: Geometry Aware Push-Grasping Synergy for Goal-Oriented Manipulation in Clutter

Industrial cuVSLAM Benchmark & Integration

Understanding the strengths and weaknesses of SSL models for audio deepfake model attribution

Cyclostationarity Analysis as a Complement to Self-Supervised Representations for Speech Deepfake Detection

Point Cloud Feature Coding for Object Detection over an Error-Prone Cloud-Edge Collaborative System

Does Fine-tuning by Reinforcement Learning Improve Generalization in Binary Speech Deepfake Detection?

A SUPERB-Style Benchmark of Self-Supervised Speech Models for Audio Deepfake Detection

Deepfake Word Detection by Next-token Prediction using Fine-tuned Whisper

VIGOR: Visual Goal-In-Context Inference for Unified Humanoid Fall Safety

Browse by Category

Research Type

Publish Your Research