Chandan Saha in Engineering — Research Repository

Engineering Preprint PDF DOI

$AutoDrive\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning

Yuqi Ye, Zijian Zhang, Junhong Lin, Shangkun Sun, Changhao Peng, Wei Gao · 2026

Vision-language models (VLMs) are increasingly being adopted for end-to-end autonomous driving systems due to their exceptional performance in handling long-tail scenarios. However, current VLM-based …

Read Paper →

Engineering Preprint PDF DOI

Let's Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments

Haoyuan Li, Rui Liu, Hehe Fan, Yi Yang · 2026

Vision-Language Navigation in Continuous Environments (VLN-CE) requires agents to learn complex reasoning from long-horizon human interactions. While Multi-modal Large Language Models (MLLMs) have dri…

Read Paper →

Engineering Preprint PDF DOI

Molecular Dynamics Simulations Reveal PolyQ-Length-Dependent Conformational Changes in Huntingtin Exon-1: Implications for Environmental Co-Solvent Modulation of Aggregation-Prone States

Jai Geddes-Nelson, Xiaochen Liu, Ken-Tye Yong · 2026

Huntington's disease (HD) is caused by CAG-repeat expansion in HTT, which lengthens the polyglutamine (polyQ) tract in huntingtin (HTT) and promotes misfolding and aggregation. While polyQ-length-depe…

Read Paper →

Engineering Preprint PDF DOI

ELEAT-SAGA: Early & Late Integration with Evading Alternating Training for Spoof-Robust Speaker Verification

Amro Asali, Yehuda Ben-Shimol, Itshak Lapidot · 2026

Spoofing-robust automatic speaker verification (SASV) seeks to build automatic speaker verification systems that are robust against both zero-effort impostor attacks and sophisticated spoofing techniq…

Read Paper →

Engineering Preprint PDF DOI

A Renderer-Enabled Framework for Computing Parameter Estimation Lower Bounds in Plenoptic Imaging Systems

Abhinav V. Sambasivan, Liam J. Coulter, Richard G. Paxman, Jarvis D. Haupt · 2026

This work focuses on assessing the information-theoretic limits of scene parameter estimation in plenoptic imaging systems. A general framework to compute lower bounds on the parameter estimation erro…

Read Paper →

Engineering Preprint PDF DOI

SAHA: Supervised Autonomous HArvester for selective forest thinning

Fang Nan, Meher Malladi, Qingqing Li, Fan Yang, Joonas Juola, Tiziano Guadagnino, Jens Behley, Cesar Cadena, Cyrill Stachniss, Marco Hutter · 2026

Forestry plays a vital role in our society, creating significant ecological, economic, and recreational value. Efficient forest management involves labor-intensive and complex operations. One essentia…

Read Paper →

Engineering Preprint PDF DOI

Cosmos-H-Surgical: Learning Surgical Robot Policies from Videos via World Modeling

Yufan He, Pengfei Guo, Mengya Xu, Zhaoshuo Li, Andriy Myronenko, Dillan Imans, Bingjie Liu, Dongren Yang, Mingxue Gu, Yongnan Ji, Yueming Jin, Ren Zhao, Baiyong Shen, Daguang Xu · 2025

Data scarcity remains a fundamental barrier to achieving fully autonomous surgical robots. While large scale vision language action (VLA) models have shown impressive generalization in household and i…

Read Paper →

Engineering Preprint PDF DOI

Planetary Terrain Datasets and Benchmarks for Rover Path Planning

Marvin Chancan, Avijit Banerjee, George Nikolakopoulos · 2025

Planetary rover exploration is attracting renewed interest with several upcoming space missions to the Moon and Mars. However, a substantial amount of data from prior missions remain underutilized for…

Read Paper →

Engineering Preprint PDF DOI

SAGA: Open-World Mobile Manipulation via Structured Affordance Grounding

Kuan Fang, Yuxin Chen, Xinghao Zhu, Farzad Niroui, Lingfeng Sun, Jiuguang Wang · 2025

We present SAGA, a versatile and adaptive framework for visuomotor control that can generalize across various environments, task objectives, and user specifications. To efficiently learn such capabili…

Read Paper →

Engineering Preprint PDF DOI

CT-CFAR A Robust CFAR Detector Based on CLEAN and Truncated Statistics in Sidelobe-Contaminated Environments

Jiachen Zhu, Fangjiong Chen, Jie Wu, Ming Xia · 2025

This paper proposes a constant false alarm rate (CFAR) target detection algorithm based on the CLEAN concept and truncated statistics to mitigate the non-homogeneity of reference samples caused by sid…

Read Paper →

Engineering Preprint PDF DOI

Spatially anchored Tactile Awareness for Robust Dexterous Manipulation

Jialei Huang, Yang Ye, Yuanqing Gong, Xuezhou Zhu, Yang Gao, Kaifeng Zhang · 2025

Dexterous manipulation requires precise geometric reasoning, yet existing visuo-tactile learning methods struggle with sub-millimeter precision tasks that are routine for traditional model-based appro…

Read Paper →

Engineering Preprint PDF DOI

SAGA-SR: Semantically and Acoustically Guided Audio Super-Resolution

Jaekwon Im, Juhan Nam · 2025

Versatile audio super-resolution (SR) aims to predict high-frequency components from low-resolution audio across diverse domains such as speech, music, and sound effects. Existing diffusion-based SR m…

Read Paper →

Engineering Preprint PDF DOI

EEND-SAA: Enrollment-Less Main Speaker Voice Activity Detection Using Self-Attention Attractors

Wen-Yung Wu, Pei-Chin Hsieh, Tai-Shih Chi · 2025

Voice activity detection (VAD) is essential in speech-based systems, but traditional methods detect only speech presence without identifying speakers. Target-speaker VAD (TS-VAD) extends this by detec…

Read Paper →

Engineering Preprint PDF DOI

Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-Based Models

Branislav Gerazov, Marcello Politi, Sebastien Bratieres · 2025

We explore the performance of several state-of-the-art automatic speech recognition (ASR) models on a large-scale Arabic speech dataset, the SADA (Saudi Audio Dataset for Arabic), which contains 668 h…

Read Paper →

Engineering Preprint PDF DOI

SaWa-ML: Structure-Aware Pose Correction and Weight Adaptation-Based Robust Multi-Robot Localization

Junho Choi, Kihwan Ryoo, Jeewon Kim, Taeyun Kim, Eungchang Lee, Myeongwoo Jeong, Kevin Christiansen Marsim, Hyungtae Lim, Hyun Myung · 2025

Multi-robot localization is a crucial task for implementing multi-robot systems. Numerous researchers have proposed optimization-based multi-robot localization methods that use camera, IMU, and UWB se…

Read Paper →

Engineering Preprint PDF DOI

Sequential Attention-based Sampling for Histopathological Analysis

Tarun G, Naman Malpani, Gugan Thoppe, Sridharan Devarajan · 2025

Deep neural networks are increasingly applied in automated histopathology. Yet, whole-slide images (WSIs) are often acquired at gigapixel sizes, rendering them computationally infeasible to analyze en…

Read Paper →

Engineering Preprint PDF DOI

SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation

Yuqi Fan, Zhiyong Cui, Zhenning Li, Yilong Ren, Haiyang Yu · 2025

Reliable planning is crucial for achieving autonomous driving. Rule-based planners are efficient but lack generalization, while learning-based planners excel in generalization yet have limitations in …

Read Paper →

Engineering Preprint PDF DOI

ATMM-SAGA: Alternating Training for Multi-Module with Score-Aware Gated Attention SASV system

Amro Asali, Yehuda Ben-Shimol, Itshak Lapidot · 2025

The objective of automatic speaker verification (ASV) systems is to determine whether a given test speech utterance corresponds to a claimed enrolled speaker. These systems have a wide range of applic…

Read Paper →

Engineering Preprint PDF DOI

UNet with Self-Adaptive Mamba-Like Attention and Causal-Resonance Learning for Medical Image Segmentation

Saqib Qamar, Mohd Fazil, Parvez Ahmad, Shakir Khan, Abu Taha Zamani · 2025

Medical image segmentation plays an important role in various clinical applications; however, existing deep learning models face trade-offs between efficiency and accuracy. Convolutional Neural Networ…

Read Paper →

Engineering Preprint PDF DOI

SACA: A Scenario-Aware Collision Avoidance Framework for Autonomous Vehicles Integrating LLMs-Driven Reasoning

Shiyue Zhao, Junzhi Zhang, Neda Masoud, Heye Huang, Xiaohui Hou, Chengkun He · 2025

Reliable collision avoidance under extreme situations remains a critical challenge for autonomous vehicles. While large language models (LLMs) offer promising reasoning capabilities, their application…

Read Paper →

Browse Research Papers

$AutoDrive\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning

Let's Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments

Molecular Dynamics Simulations Reveal PolyQ-Length-Dependent Conformational Changes in Huntingtin Exon-1: Implications for Environmental Co-Solvent Modulation of Aggregation-Prone States

ELEAT-SAGA: Early & Late Integration with Evading Alternating Training for Spoof-Robust Speaker Verification

A Renderer-Enabled Framework for Computing Parameter Estimation Lower Bounds in Plenoptic Imaging Systems

SAHA: Supervised Autonomous HArvester for selective forest thinning

Cosmos-H-Surgical: Learning Surgical Robot Policies from Videos via World Modeling

Planetary Terrain Datasets and Benchmarks for Rover Path Planning

SAGA: Open-World Mobile Manipulation via Structured Affordance Grounding

CT-CFAR A Robust CFAR Detector Based on CLEAN and Truncated Statistics in Sidelobe-Contaminated Environments

Spatially anchored Tactile Awareness for Robust Dexterous Manipulation

SAGA-SR: Semantically and Acoustically Guided Audio Super-Resolution

EEND-SAA: Enrollment-Less Main Speaker Voice Activity Detection Using Self-Attention Attractors

Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-Based Models

SaWa-ML: Structure-Aware Pose Correction and Weight Adaptation-Based Robust Multi-Robot Localization

Sequential Attention-based Sampling for Histopathological Analysis

SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation

ATMM-SAGA: Alternating Training for Multi-Module with Score-Aware Gated Attention SASV system

UNet with Self-Adaptive Mamba-Like Attention and Causal-Resonance Learning for Medical Image Segmentation

SACA: A Scenario-Aware Collision Avoidance Framework for Autonomous Vehicles Integrating LLMs-Driven Reasoning

Browse by Category

Research Type

Publish Your Research