Expertini Research Research

Browse Research Papers

64+ open-access research outputs.

โœ• Clear
๐Ÿ” chandan saha ๐Ÿ“‚ Engineering
Showing 64 results for "chandan saha" in Engineering
Engineering Preprint PDF DOI

$AutoDrive\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning

Yuqi Ye, Zijian Zhang, Junhong Lin, Shangkun Sun, Changhao Peng, Wei Gao ยท 2026

Vision-language models (VLMs) are increasingly being adopted for end-to-end autonomous driving systems due to their exceptional performance in handling long-tail scenarios. However, current VLM-based โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Let's Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments

Haoyuan Li, Rui Liu, Hehe Fan, Yi Yang ยท 2026

Vision-Language Navigation in Continuous Environments (VLN-CE) requires agents to learn complex reasoning from long-horizon human interactions. While Multi-modal Large Language Models (MLLMs) have driโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Molecular Dynamics Simulations Reveal PolyQ-Length-Dependent Conformational Changes in Huntingtin Exon-1: Implications for Environmental Co-Solvent Modulation of Aggregation-Prone States

Jai Geddes-Nelson, Xiaochen Liu, Ken-Tye Yong ยท 2026

Huntington's disease (HD) is caused by CAG-repeat expansion in HTT, which lengthens the polyglutamine (polyQ) tract in huntingtin (HTT) and promotes misfolding and aggregation. While polyQ-length-depeโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

ELEAT-SAGA: Early & Late Integration with Evading Alternating Training for Spoof-Robust Speaker Verification

Amro Asali, Yehuda Ben-Shimol, Itshak Lapidot ยท 2026

Spoofing-robust automatic speaker verification (SASV) seeks to build automatic speaker verification systems that are robust against both zero-effort impostor attacks and sophisticated spoofing techniqโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Renderer-Enabled Framework for Computing Parameter Estimation Lower Bounds in Plenoptic Imaging Systems

Abhinav V. Sambasivan, Liam J. Coulter, Richard G. Paxman, Jarvis D. Haupt ยท 2026

This work focuses on assessing the information-theoretic limits of scene parameter estimation in plenoptic imaging systems. A general framework to compute lower bounds on the parameter estimation erroโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SAHA: Supervised Autonomous HArvester for selective forest thinning

Fang Nan, Meher Malladi, Qingqing Li, Fan Yang, Joonas Juola, Tiziano Guadagnino, Jens Behley, Cesar Cadena, Cyrill Stachniss, Marco Hutter ยท 2026

Forestry plays a vital role in our society, creating significant ecological, economic, and recreational value. Efficient forest management involves labor-intensive and complex operations. One essentiaโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Cosmos-H-Surgical: Learning Surgical Robot Policies from Videos via World Modeling

Yufan He, Pengfei Guo, Mengya Xu, Zhaoshuo Li, Andriy Myronenko, Dillan Imans, Bingjie Liu, Dongren Yang, Mingxue Gu, Yongnan Ji, Yueming Jin, Ren Zhao, Baiyong Shen, Daguang Xu ยท 2025

Data scarcity remains a fundamental barrier to achieving fully autonomous surgical robots. While large scale vision language action (VLA) models have shown impressive generalization in household and iโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Planetary Terrain Datasets and Benchmarks for Rover Path Planning

Marvin Chancan, Avijit Banerjee, George Nikolakopoulos ยท 2025

Planetary rover exploration is attracting renewed interest with several upcoming space missions to the Moon and Mars. However, a substantial amount of data from prior missions remain underutilized forโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SAGA: Open-World Mobile Manipulation via Structured Affordance Grounding

Kuan Fang, Yuxin Chen, Xinghao Zhu, Farzad Niroui, Lingfeng Sun, Jiuguang Wang ยท 2025

We present SAGA, a versatile and adaptive framework for visuomotor control that can generalize across various environments, task objectives, and user specifications. To efficiently learn such capabiliโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

CT-CFAR A Robust CFAR Detector Based on CLEAN and Truncated Statistics in Sidelobe-Contaminated Environments

Jiachen Zhu, Fangjiong Chen, Jie Wu, Ming Xia ยท 2025

This paper proposes a constant false alarm rate (CFAR) target detection algorithm based on the CLEAN concept and truncated statistics to mitigate the non-homogeneity of reference samples caused by sidโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Spatially anchored Tactile Awareness for Robust Dexterous Manipulation

Jialei Huang, Yang Ye, Yuanqing Gong, Xuezhou Zhu, Yang Gao, Kaifeng Zhang ยท 2025

Dexterous manipulation requires precise geometric reasoning, yet existing visuo-tactile learning methods struggle with sub-millimeter precision tasks that are routine for traditional model-based approโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SAGA-SR: Semantically and Acoustically Guided Audio Super-Resolution

Jaekwon Im, Juhan Nam ยท 2025

Versatile audio super-resolution (SR) aims to predict high-frequency components from low-resolution audio across diverse domains such as speech, music, and sound effects. Existing diffusion-based SR mโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

EEND-SAA: Enrollment-Less Main Speaker Voice Activity Detection Using Self-Attention Attractors

Wen-Yung Wu, Pei-Chin Hsieh, Tai-Shih Chi ยท 2025

Voice activity detection (VAD) is essential in speech-based systems, but traditional methods detect only speech presence without identifying speakers. Target-speaker VAD (TS-VAD) extends this by detecโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-Based Models

Branislav Gerazov, Marcello Politi, Sebastien Bratieres ยท 2025

We explore the performance of several state-of-the-art automatic speech recognition (ASR) models on a large-scale Arabic speech dataset, the SADA (Saudi Audio Dataset for Arabic), which contains 668 hโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SaWa-ML: Structure-Aware Pose Correction and Weight Adaptation-Based Robust Multi-Robot Localization

Junho Choi, Kihwan Ryoo, Jeewon Kim, Taeyun Kim, Eungchang Lee, Myeongwoo Jeong, Kevin Christiansen Marsim, Hyungtae Lim, Hyun Myung ยท 2025

Multi-robot localization is a crucial task for implementing multi-robot systems. Numerous researchers have proposed optimization-based multi-robot localization methods that use camera, IMU, and UWB seโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Sequential Attention-based Sampling for Histopathological Analysis

Tarun G, Naman Malpani, Gugan Thoppe, Sridharan Devarajan ยท 2025

Deep neural networks are increasingly applied in automated histopathology. Yet, whole-slide images (WSIs) are often acquired at gigapixel sizes, rendering them computationally infeasible to analyze enโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation

Yuqi Fan, Zhiyong Cui, Zhenning Li, Yilong Ren, Haiyang Yu ยท 2025

Reliable planning is crucial for achieving autonomous driving. Rule-based planners are efficient but lack generalization, while learning-based planners excel in generalization yet have limitations in โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

ATMM-SAGA: Alternating Training for Multi-Module with Score-Aware Gated Attention SASV system

Amro Asali, Yehuda Ben-Shimol, Itshak Lapidot ยท 2025

The objective of automatic speaker verification (ASV) systems is to determine whether a given test speech utterance corresponds to a claimed enrolled speaker. These systems have a wide range of applicโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

UNet with Self-Adaptive Mamba-Like Attention and Causal-Resonance Learning for Medical Image Segmentation

Saqib Qamar, Mohd Fazil, Parvez Ahmad, Shakir Khan, Abu Taha Zamani ยท 2025

Medical image segmentation plays an important role in various clinical applications; however, existing deep learning models face trade-offs between efficiency and accuracy. Convolutional Neural Networโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SACA: A Scenario-Aware Collision Avoidance Framework for Autonomous Vehicles Integrating LLMs-Driven Reasoning

Shiyue Zhao, Junzhi Zhang, Neda Masoud, Heye Huang, Xiaohui Hou, Chengkun He ยท 2025

Reliable collision avoidance under extreme situations remains a critical challenge for autonomous vehicles. While large language models (LLMs) offer promising reasoning capabilities, their applicationโ€ฆ

Read Paper โ†’
Page 1 of 4 Next โ†’