D.K. Bhattacharya in Engineering — Research Repository

Engineering Preprint PDF DOI

Indic-CodecFake meets SATYAM: Towards Detecting Neural Audio Codec Synthesized Speech Deepfakes in Indic Languages

Girish, Mohd Mujtaba Akhtar, Orchid Chetia Phukan, Arun Balaji Buduru · 2026

The rapid advancement of Audio Large Language Models (ALMs), driven by Neural Audio Codecs (NACs), has led to the emergence of highly realistic speech deepfakes, commonly referred to as CodecFakes (CF…

Read Paper →

Engineering Preprint PDF DOI

Deep Hierarchical Knowledge Loss for Fault Intensity Diagnosis

Yu Sha, Shuiping Gou, Bo Liu, Haofan Lu, Ningtao Liu, Jiahui Fu, Horst Stoecker, Domagoj Vnucec, Nadine Wetzstein, Andreas Widl, Kai Zhou · 2026

Fault intensity diagnosis (FID) plays a pivotal role in intelligent manufacturing while neglecting dependencies among target classes hinders its practical deployment. This paper introduces a novel and…

Read Paper →

Engineering Preprint PDF DOI

Doppler Shift Keying Modulation for Uplink Multiple Access over Doubly-Dispersive Channels

Xuehan Wang, Jintao Wang, Hai Lin, Jinhong Yuan, Xu Shi, Hengyu Zhang, Jian Song · 2026

The delay-Doppler (DD) domain modulation has been regarded as one of the most competitive candidates to support wireless communications for emerging high-mobility applications in the sixth-generation …

Read Paper →

Engineering Preprint PDF DOI

Airspace-aware Contingency Landing Planning

H. Emre Tekaslan, Ella M. Atkins · 2026

This paper develops a real-time, search-based aircraft contingency landing planner that minimizes traffic disruptions while accounting for ground risk. The airspace model captures dense air traffic de…

Read Paper →

Engineering Preprint PDF DOI

AmbER$^2$: Dual Ambiguity-Aware Emotion Recognition Applied to Speech and Text

Jingyao Wu, Grace Lin, Yinuo Song, Rosalind Picard · 2026

Emotion recognition is inherently ambiguous, with uncertainty arising both from rater disagreement and from discrepancies across modalities such as speech and text. There is growing interest in modeli…

Read Paper →

Engineering Preprint PDF DOI

A Novel Preprocessing Unit for Effective Deep Learning based Classification and Grading of Diabetic Retinopathy

Pranoti Nage, Sanjay Shitole · 2025

Early detection of diabetic retinopathy (DR) is crucial as it allows for timely intervention, preventing vision loss and enabling effective management of diabetic complications. This research performs…

Read Paper →

Engineering Preprint PDF DOI

Design of Input-Output Observers for a Population of Systems with Bounded Frequency-Domain Variation using $DK$-iteration

Timothy Everett Adams, James Richard Forbes · 2025

This paper proposes a linear input-output observer design methodology for a population of systems in which each observer uses knowledge of the linear time-invariant dynamics of the particular device. …

Read Paper →

Engineering Preprint PDF DOI

DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments

Qi Chen, Rui Liu, Kangtong Mo, Boli Zhang, Dezhi Yu · 2025

Trajectory planning for robotic manipulators operating in dynamic orbital debris environments poses significant challenges due to complex obstacle movements and uncertainties. This paper presents Deep…

Read Paper →

Engineering Preprint PDF DOI

Theoretical Analysis for the CommSense Measurement System

Sandip Jana, Amit Kumar Mishra, Mohammed Zafar Ali Khan · 2025

Future 6G networks envisions to blur the line between communication and sensing, leveraging ubiquitous OFDM waveforms for both high throughput data and environmental awareness. In this work, we do a t…

Read Paper →

Engineering Preprint PDF DOI

Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction

Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Swarup Ranjan Behera, Pailla Balakrishna Reddy, Arun Balaji Buduru, Rajesh Sharma · 2025

In this study, we focus on Singing Voice Mean Opinion Score (SingMOS) prediction. Previous research have shown the performance benefit with the use of state-of-the-art (SOTA) pre-trained models (PTMs)…

Read Paper →

Engineering Preprint PDF DOI

Evaluating the Effectiveness of Pre-Trained Audio Embeddings for Classification of Parkinson's Disease Speech Data

Emmy Postma, Cristian Tejedor-Garcia · 2025

Speech impairments are prevalent biomarkers for Parkinson's Disease (PD), motivating the development of diagnostic techniques using speech data for clinical applications. Although deep acoustic featur…

Read Paper →

Engineering Preprint PDF DOI

Comparative Analysis of Unsupervised and Supervised Autoencoders for Nuclei Classification in Clear Cell Renal Cell Carcinoma Images

Fatemeh Javadian, Zahra Aminparast, Johannes Stegmaier, Abin Jose · 2025

This study explores the application of supervised and unsupervised autoencoders (AEs) to automate nuclei classification in clear cell renal cell carcinoma (ccRCC) images, a diagnostic task traditional…

Read Paper →

Engineering Preprint PDF DOI

Semantic Communication and Control Co-Design for Multi-Objective Distinct Dynamics

Abanoub M. Girgis, Hyowoon Seo, Mehdi Bennis · 2024

This letter introduces a machine-learning approach to learning the semantic dynamics of correlated systems with different control rules and dynamics. By leveraging the Koopman operator in an autoencod…

Read Paper →

Engineering Preprint PDF DOI

A novel brain registration model combining structural and functional MRI information

Baolong Li, Yuhu Shi, Lei Wang, Weiming Zeng, Changming Zhu · 2024

Although developed functional magnetic resonance imaging (fMRI) registration algorithms based on deep learning have achieved a certain degree of alignment of functional area, they underutilized fine s…

Read Paper →

Engineering Preprint PDF DOI

D-Net: Dynamic Large Kernel with Dynamic Feature Fusion for Volumetric Medical Image Segmentation

Jin Yang, Peijie Qiu, Yichi Zhang, Daniel S. Marcus, Aristeidis Sotiras · 2024

Hierarchical transformers have achieved significant success in medical image segmentation due to their large receptive field and capabilities of effectively leveraging global long-range contextual inf…

Read Paper →

Engineering Preprint PDF DOI

Catch Me If You Can: Combatting Fraud in Artificial Currency Based Government Benefits Programs

Devansh Jalota, Matthew Tsao, Marco Pavone · 2024

Artificial currencies have grown in popularity in many real-world resource allocation settings, gaining traction in government benefits programs like food assistance and transit benefits programs. How…

Read Paper →

Engineering Preprint PDF DOI

DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking and Loop-Closing

Hao Qu, Lilian Zhang, Jun Mao, Junbo Tie, Xiaofeng He, Xiaoping Hu, Yifei Shi, Changhao Chen · 2024

The performance of visual SLAM in complex, real-world scenarios is often compromised by unreliable feature extraction and matching when using handcrafted features. Although deep learning-based local f…

Read Paper →

Engineering Preprint PDF DOI

Anti-Delay Kalman Filter Fusion Algorithm for Vehicle-borne Sensor Network with Finite-Time Convergence

Hang Yu, Keren Dai, Haojie Li, Yao Zou, Xiang Ma, Shaojie Ma, He Zhang · 2022

Intelligent vehicles in autonomous driving and obstacle avoidance, the precise relative state of vehicles put forward a higher demand. For a vehicle-borne sensor network with time-varying transmission…

Read Paper →

Engineering Preprint PDF DOI

DDKtor: Automatic Diadochokinetic Speech Analysis

Yael Segal, Kasia Hitczenko, Matthew Goldrick, Adam Buchwald, Angela Roberts, Joseph Keshet · 2022

Diadochokinetic speech tasks (DDK), in which participants repeatedly produce syllables, are commonly used as part of the assessment of speech motor impairments. These studies rely on manual analyses t…

Read Paper →

Engineering Preprint PDF DOI

Estimating probabilistic dynamic origin-destination demands using multi-day traffic data on computational graphs

Wei Ma, Sean Qian · 2022

System-level decision making in transportation needs to understand day-to-day variation of network flows, which calls for accurate modeling and estimation of probabilistic dynamic travel demand on net…

Read Paper →

Browse Research Papers

Indic-CodecFake meets SATYAM: Towards Detecting Neural Audio Codec Synthesized Speech Deepfakes in Indic Languages

Deep Hierarchical Knowledge Loss for Fault Intensity Diagnosis

Doppler Shift Keying Modulation for Uplink Multiple Access over Doubly-Dispersive Channels

Airspace-aware Contingency Landing Planning

AmbER$^2$: Dual Ambiguity-Aware Emotion Recognition Applied to Speech and Text

A Novel Preprocessing Unit for Effective Deep Learning based Classification and Grading of Diabetic Retinopathy

Design of Input-Output Observers for a Population of Systems with Bounded Frequency-Domain Variation using $DK$-iteration

DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments

Theoretical Analysis for the CommSense Measurement System

Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction

Evaluating the Effectiveness of Pre-Trained Audio Embeddings for Classification of Parkinson's Disease Speech Data

Comparative Analysis of Unsupervised and Supervised Autoencoders for Nuclei Classification in Clear Cell Renal Cell Carcinoma Images

Semantic Communication and Control Co-Design for Multi-Objective Distinct Dynamics

A novel brain registration model combining structural and functional MRI information

D-Net: Dynamic Large Kernel with Dynamic Feature Fusion for Volumetric Medical Image Segmentation

Catch Me If You Can: Combatting Fraud in Artificial Currency Based Government Benefits Programs

DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking and Loop-Closing

Anti-Delay Kalman Filter Fusion Algorithm for Vehicle-borne Sensor Network with Finite-Time Convergence

DDKtor: Automatic Diadochokinetic Speech Analysis

Estimating probabilistic dynamic origin-destination demands using multi-day traffic data on computational graphs

Browse by Category

Research Type

Publish Your Research