Expertini Research Research

Browse Research Papers

571+ open-access research outputs.

โœ• Clear
๐Ÿ” ferhat erata ๐Ÿ“‚ Engineering
Showing 571 results for "ferhat erata" in Engineering
Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung ยท 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annotaโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

ATLAS: An Annotation Tool for Long-horizon Robotic Action Segmentation

Sergej Stanovcic, Daniel Sliwowski, Dongheui Lee ยท 2026

Annotating long-horizon robotic demonstrations with precise temporal action boundaries is crucial for training and evaluating action segmentation and manipulation policy learning methods. Existing annโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

LLM-Flax : Generalizable Robotic Task Planning via Neuro-Symbolic Approaches with Large Language Models

Seongmin Kim, Daegyu Lee ยท 2026

Deploying a neuro-symbolic task planner on a new domain today requires significant manual effort: a domain expert must author relaxation and complementary rules, and hundreds of training problems mustโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Quasi-Constant Modulus Design for Nonlinearity-Tolerant Geometric Shaped Four Dimensional Modulation Format

Junzhe Xiao, Zekun Niu, Lyu Li, Minghui Shi, Weisheng Hu, Lilin Yi ยท 2026

In this paper, the quasi-constant modulus (QCM) property is analyzed and leveraged in the design of nonlinearity-tolerant four-dimensional (4D) modulation formats. Accordingly, we propose a family of โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

VIDS: A Verified Imaging Dataset Standard for Medical AI

Joan S. Muthu, John Shalen ยท 2026

Medical imaging AI development is fundamentally dependent on annotated datasets, yet no existing standard provides machine-enforceable validation across dataset structure, annotation provenance, qualiโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

The Memory-Enhanced Gaussian Noise (MEGN) Model for Fiber-Optic Channels

Kaiquan Wu, Gabriele Liga, Marco Secondini, Stella Civelli, Hussam Batshon, Greg Raybon, Xi Chen, Alex Alvarado ยท 2026

The enhanced Gaussian noise (EGN) model is widely used for estimating the nonlinear interference (NLI) power accumulated in coherent fiber-optic transmission systems. Given a fixed fiber link, under tโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Precise Robot Command Understanding Using Grammar-Constrained Large Language Models

Xinyun Huo, Raghav Gnanasambandam, Xinyao Zhang ยท 2026

Human-robot collaboration in industrial settings requires precise and reliable communication to enhance operational efficiency. While Large Language Models (LLMs) understand general language, they oftโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model

Peiyan Li, Yixiang Chen, Yuan Xu, Jiabing Yang, Xiangnan Wu, Jun Guo, Nan Sun, Long Qian, Xinghang Li, Xin Xiao, Jing Liu, Nianfeng Liu, Tao Kong, Yan Huang, Liang Wang, Tieniu Tan ยท 2026

Robotic manipulation requires understanding both the 3D spatial structure of the environment and its temporal evolution, yet most existing policies overlook one or both. They typically rely on 2D visuโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models

Xiaosong Jia, Yuqian Shao, Zhenjie Yang, Qifeng Li, Zhiyuan Zhang, Junchi Yan ยท 2026

With the rise of vision-language models (VLM), their application for autonomous driving (VLM4AD) has gained significant attention. Meanwhile, in autonomous driving, closed-loop evaluation has become wโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

RFSS: A Multi-Standard RF Signal Source Separation Dataset with 3GPP-Standardized Channel and Hardware Impairments

Hao Chen, Rui Jin, Dayuan Tan ยท 2026

The coexistence of heterogeneous cellular standards (2G-5G) in shared spectrum demands sophisticated RF source separation techniques, yet no public dataset exists for data-driven research on this probโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Towards Embodied AI with MuscleMimic: Unlocking full-body musculoskeletal motor learning at scale

Chengkun Li, Cheryl Wang, Bianca Ziliotto, Merkourios Simos, Jozsef Kovecses, Guillaume Durandau, Alexander Mathis ยท 2026

Learning motor control for muscle-driven musculoskeletal models is hindered by the computational cost of biomechanically accurate simulation and the scarcity of validated, open full-body models. Here โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Underwater imaging without color distortions requires RAW capture

Derya Akkaynak, Michael S. Brown ยท 2026

Consumer cameras are ubiquitous in aquatic sciences because they are affordable and easy to use, generating vast collections of underwater imagery for ecosystem surveys, monitoring, mapping, and animaโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

GenMFSR: Generative Multi-Frame Image Restoration and Super-Resolution

Harshana Weligampola, Joshua Peter Ebenezer, Weidi Liu, Abhinau K. Venkataramanan, Sreenithy Chandran, Seok-Jun Lee, Hamid Rahim Sheikh ยท 2026

Camera pipelines receive raw Bayer-format frames that need to be denoised, demosaiced, and often super-resolved. Multiple frames are captured to utilize natural hand tremors and enhance resolution. Muโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Faulty Coffees: Barriers to Adoption of an In-the-wild Robo-Barista

Bruce W. Wilson, David A. Robb, Mei Yii Lim, Helen Hastie, Matthew Peter Aylett, Theodoros Georgiou ยท 2026

We set out to study whether task-based narratives could influence long-term engagement with a service robot. To do so, we deployed a Robo-Barista for five weeks in an over-50's housing complex in Stocโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Geometry-Aligned LLM Fine-Tuning for Sequential Narrow-Opening Planning

Al Jaber Mahmud, Xuan Wang ยท 2026

We study rigid-body motion planning through multiple sequential narrow openings, which requires long-horizon geometric reasoning because the configuration used to traverse an early opening constrains โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Creating manufacturable blueprints for coarse-grained virtual robots

Zihan Guo, Muhan Li, Shuzhe Zhang, Sam Kriegman ยท 2026

Over the past three decades, countless embodied yet virtual agents have freely evolved inside computer simulations, but vanishingly few were realized as physical robots. This is because evolution was โ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

SPARK: Skeleton-Parameter Aligned Retargeting on Humanoid Robots with Kinodynamic Trajectory Optimization

Hanwen Wang, Qiayuan Liao, Bike Zhang, Kunzhao Ren, Koushil Sreenath, Xiaobin Xiong ยท 2026

Human motion provides rich priors for training general-purpose humanoid control policies, but raw demonstrations are often incompatible with a robot's kinematics and dynamics, limiting their direct usโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Safe or Slow? The Illusion of Thermal Stability Under Reduced-Velocity Nail Intrusion

Eymen Ipek, Oliver Korak, Georg Gsellmann, Andrey Golubkov ยท 2026

This study investigates the effects of nail penetration speed on the safety outcomes of large-format automotive lithium-ion pouch cells. Through six controlled tests varying the speed of nail insertioโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

Reasoning Knowledge-Gap in Drone Planning via LLM-based Active Elicitation

Zeyu Fang, Beomyeol Yu, Cheng Liu, Zeyuan Yang, Rongqian Chen, Yuxin Lin, Mahdi Imani, Tian Lan ยท 2026

Human-AI joint planning in Unmanned Aerial Vehicles (UAVs) typically relies on control handover when facing environmental uncertainties, which is often inefficient and cognitively demanding for non-exโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

A Low-Complexity PFA-Based Autofocus Algorithm for Automotive SAR

S. Hamed Javadi, Andre Bourdoux, Adnan Albaba, Hichem Sahli ยท 2026

Radars provide robust perception of vehicle surroundings by effectively functioning in poor light and adverse weather conditions. Synthetic aperture radar (SAR) algorithms are employed to address the โ€ฆ

Read Paper โ†’
Page 1 of 29 Next โ†’