Edoardo Salati in Engineering — Research Repository

Engineering Preprint PDF DOI

Robust Nasality Representation Learning for Cleft Palate-Related Velopharyngeal Dysfunction Screening in Real-World Settings

Weixin Liu, Bowen Qu, Amy Stone, Maria E. Powell, Shama Dufresne, Stephane Braun, Izabela Galdyn, Michael Golinko, Bradley Malin, Zhijun Yin, Matthew E. Pontell · 2026

Velopharyngeal dysfunction (VPD) is characterized by inadequate velopharyngeal closure during speech and often causes hypernasality and reduced intelligibility. Although speech-based machine learning …

Read Paper →

Engineering Preprint PDF DOI

SAATT Nav: a Socially Aware Autonomous Transparent Transportation Navigation Framework for Wheelchairs

Yutong Zhang, Shaiv Y. Mehra, Bradley S. Duerstock, Juan P. Wachs · 2026

While powered wheelchairs reduce physical fatigue as opposed to manual wheelchairs for individuals with mobility impairment, they demand high cognitive workload due to information processing, decision…

Read Paper →

Engineering Preprint PDF DOI

Quantifying resilience for distribution system customers with SALEDI

Arslan Ahmad, Ian Dobson · 2026

The impact of routine smaller outages on distribution system customers in terms of customer minutes interrupted can be tracked using conventional reliability indices. However, the customer minutes int…

Read Paper →

Engineering Preprint PDF DOI

Generative Artificial Intelligence creates delicious, sustainable, and nutritious burgers

Vahidullah Tac, Christopher Gardner, Ellen Kuhl · 2026

Food choices shape both human and planetary health; yet, designing foods that are delicious, nutritious, and sustainable remains challenging. Here we show that generative artificial intelligence can l…

Read Paper →

Engineering Preprint PDF DOI

Prosthetic Hand Manipulation System Based on EMG and Eye Tracking Powered by the Neuromorphic Processor AltAi

Roman Akinshin, Elizaveta Lopatina, Kirill Bogatikov, Nikolai Kiz, Anna V. Makarova, Mikhail Lebedev, Miguel Altamirano Cabrera, Dzmitry Tsetserukou, Valerii Kangler · 2026

This paper presents a novel neuromorphic control architecture for upper-limb prostheses that combines surface electromyography (sEMG) with gaze-guided computer vision. The system uses a spiking neural…

Read Paper →

Engineering Preprint PDF DOI

Botany-Bot: Digital Twin Monitoring of Occluded and Underleaf Plant Structures with Gaussian Splats

Simeon Adebola, Chung Min Kim, Justin Kerr, Shuangyu Xie, Prithvi Akella, Jose Luis Susa Rincon, Eugen Solowjow, Ken Goldberg · 2025

Commercial plant phenotyping systems using fixed cameras cannot perceive many plant details due to leaf occlusion. In this paper, we present Botany-Bot, a system for building detailed "annotated digit…

Read Paper →

Engineering Preprint PDF DOI

SALAD-VAE: Semantic Audio Compression with Language-Audio Distillation

Sebastian Braun, Hannes Gamper, Dimitra Emmanouilidou · 2025

Modern generative and multimodal models increasingly rely on compact latent representations that trade and balance semantic richness with high-fidelity reconstruction. We introduce SALAD-VAE, a contin…

Read Paper →

Engineering Preprint PDF DOI

Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification

Susmita Bhattacharjee, Jagabandhu Mishra, H.S. Shekhawat, S. R. Mahadeva Prasanna · 2025

We propose the use of parameter-efficient fine-tuning (PEFT) of foundation models for cleft lip and palate (CLP) detection and severity classification. In CLP, nasalization increases with severity due…

Read Paper →

Engineering Preprint PDF DOI

GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats

Simeon Adebola, Shuangyu Xie, Chung Min Kim, Justin Kerr, Bart M. van Marrewijk, Mieke van Vlaardingen, Tim van Daalen, E.N. van Loo, Jose Luis Susa Rincon, Eugen Solowjow, Rick van de Zedde, Ken Goldberg · 2025

Accurate temporal reconstructions of plant growth are essential for plant phenotyping and breeding, yet remain challenging due to complex geometries, occlusions, and non-rigid deformations of plants. …

Read Paper →

Engineering Preprint PDF DOI

FOCI: Trajectory Optimization on Gaussian Splats

Mario Gomez Andreu, Maximum Wilder-Smith, Victor Klemm, Vaishakh Patil, Jesus Tordesillas, Marco Hutter · 2025

3D Gaussian Splatting (3DGS) has recently gained popularity as a faster alternative to Neural Radiance Fields (NeRFs) in 3D reconstruction and view synthesis methods. Leveraging the spatial informatio…

Read Paper →

Engineering Preprint PDF DOI

SatAOI: Delimitating Area of Interest for Swing-Arm Troweling Robot for Construction

Jia-Rui Lin, Shaojie Zhou, Peng Pan, Ruijia Cai, Gang Chen · 2025

In concrete troweling for building construction, robots can significantly reduce workload and improve automation level. However, as a primary task of coverage path planning (CPP) for troweling, delimi…

Read Paper →

Engineering Preprint PDF DOI

Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech

Susmita Bhattacharjee, Jagabandhu Mishra, H.S. Shekhawat, S. R. Mahadeva Prasanna · 2025

Speech produced by individuals with cleft lip and palate (CLP) is often highly nasalized and breathy due to structural anomalies, causing shifts in formant structure that affect automatic speech recog…

Read Paper →

Engineering Preprint PDF DOI

Perceptual Implications of Automatic Anonymization in Pathological Speech

Soroosh Tayebi Arasteh, Saba Afza, Tri-Thien Nguyen, Lukas Buess, Maryam Parvin, Tomas Arias-Vergara, Paula Andrea Perez-Toro, Hiu Ching Hung, Mahshad Lotfinia, Thomas Gorges, Elmar Noeth, Maria Schuster, Seung Hee Yang, Andreas Maier · 2025

Automatic anonymization techniques are essential for ethical sharing of pathological speech data, yet their perceptual consequences remain understudied. We present a comprehensive human-centered analy…

Read Paper →

Engineering Preprint PDF DOI

GigaSLAM: Large-Scale Monocular SLAM with Hierarchical Gaussian Splats

Kai Deng, Yigong Zhang, Jian Yang, Jin Xie · 2025

Tracking and mapping in large-scale, unbounded outdoor environments using only monocular RGB input presents substantial challenges for existing SLAM systems. Traditional Neural Radiance Fields (NeRF) …

Read Paper →

Engineering Preprint PDF DOI

Visualization of Organ Movements Using Automatic Region Segmentation of Swallowing CT

Yukihiro Michiwaki, Takahiro Kikuchi, Takashi Ijiri, Yoko Inamoto, Hiroshi Moriya, Takumi Ogawa, Ryota Nakatani, Yuto Masaki, Yoshito Otake, Yoshinobu Sato · 2025

This study presents the first report on the development of an artificial intelligence (AI) for automatic region segmentation of four-dimensional computer tomography (4D-CT) images during swallowing. T…

Read Paper →

Engineering Preprint PDF DOI

Geometric Deep Learning for Automated Landmarking of Maxillary Arches on 3D Oral Scans from Newborns with Cleft Lip and Palate

Artur Agaronyan, HyeRan Choo, Marius Linguraru, Syed Muhammad Anwar · 2025

Rapid advances in 3D model scanning have enabled the mass digitization of dental clay models. However, most clinicians and researchers continue to use manual morphometric analysis methods on these mod…

Read Paper →

Engineering Preprint PDF DOI

DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models

Kevin Miao, Harsh Agrawal, Qihang Zhang, Federico Semeraro, Marco Cavallo, Jiatao Gu, Alexander Toshev · 2024

Generating high-quality 3D content requires models capable of learning robust distributions of complex scenes and the real-world objects within them. Recent Gaussian-based 3D reconstruction techniques…

Read Paper →

Engineering Preprint PDF DOI

SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching

Arjun P S, Andrew Melnik, Gora Chand Nandi · 2024

Experience Goal Visual Rearrangement task stands as a foundational challenge within Embodied AI, requiring an agent to construct a robust world model that accurately captures the goal state. The agent…

Read Paper →

Engineering Preprint PDF DOI

Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot

Justin Yu, Kush Hari, Kishore Srinivas, Karim El-Refai, Adam Rashid, Chung Min Kim, Justin Kerr, Richard Cheng, Muhammad Zubair Irshad, Ashwin Balakrishna, Thomas Kollar, Ken Goldberg · 2024

Building semantic 3D maps is valuable for searching for objects of interest in offices, warehouses, stores, and homes. We present a mapping system that incrementally builds a Language-Embedded Gaussia…

Read Paper →

Engineering Preprint PDF DOI

Speech2rtMRI: Speech-Guided Diffusion Model for Real-time MRI Video of the Vocal Tract during Speech

Hong Nguyen, Sean Foley, Kevin Huang, Xuan Shi, Tiantian Feng, Shrikanth Narayanan · 2024

Understanding speech production both visually and kinematically can inform second language learning system designs, as well as the creation of speaking characters in video games and animations. In thi…

Read Paper →

Browse Research Papers

Robust Nasality Representation Learning for Cleft Palate-Related Velopharyngeal Dysfunction Screening in Real-World Settings

SAATT Nav: a Socially Aware Autonomous Transparent Transportation Navigation Framework for Wheelchairs

Quantifying resilience for distribution system customers with SALEDI

Generative Artificial Intelligence creates delicious, sustainable, and nutritious burgers

Prosthetic Hand Manipulation System Based on EMG and Eye Tracking Powered by the Neuromorphic Processor AltAi

Botany-Bot: Digital Twin Monitoring of Occluded and Underleaf Plant Structures with Gaussian Splats

SALAD-VAE: Semantic Audio Compression with Language-Audio Distillation

Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification

GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats

FOCI: Trajectory Optimization on Gaussian Splats

SatAOI: Delimitating Area of Interest for Swing-Arm Troweling Robot for Construction

Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech

Perceptual Implications of Automatic Anonymization in Pathological Speech

GigaSLAM: Large-Scale Monocular SLAM with Hierarchical Gaussian Splats

Visualization of Organ Movements Using Automatic Region Segmentation of Swallowing CT

Geometric Deep Learning for Automated Landmarking of Maxillary Arches on 3D Oral Scans from Newborns with Cleft Lip and Palate

DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models

SplatR : Experience Goal Visual Rearrangement with 3D Gaussian Splatting and Dense Feature Matching

Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot

Speech2rtMRI: Speech-Guided Diffusion Model for Real-time MRI Video of the Vocal Tract during Speech

Browse by Category

Research Type

Publish Your Research