Databases in Engineering — Research Repository

Engineering Preprint PDF DOI

Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models

Ruchao Fan, Natarajan Balaji Shankar, Abeer Alwan · 2024

Speech foundation models (SFMs) have achieved state-of-the-art results for various speech tasks in supervised (e.g. Whisper) or self-supervised systems (e.g. WavLM). However, the performance of SFMs f…

Read Paper →

Engineering Preprint PDF DOI

Global Crop-Specific Fertilization Dataset from 1961-2019

Fernando Coello, Thomas Decorte, Iris Janssens, Steven Mortier, Jordi Sardans, Josep Penuelas, Tim Verdonck · 2024

As global fertilizer application rates increase, high-quality datasets are paramount for comprehensive analyses to support informed decision-making and policy formulation in crucial areas such as food…

Read Paper →

Engineering Preprint PDF DOI

Knowledge Graphs in the Digital Twin: A Systematic Literature Review About the Combination of Semantic Technologies and Simulation in Industrial Automation

Franz Georg Listl, Daniel Dittler, Gary Hildebrandt, Valentin Stegmaier, Nasser Jazdi, Michael Weyrich · 2024

The ongoing digitization of the industrial sector has reached a pivotal juncture with the emergence of Digital Twins, offering a digital representation of physical assets and processes. One key aspect…

Read Paper →

Engineering Preprint PDF DOI

A Hybrid Task-Constrained Motion Planning for Collaborative Robots in Intelligent Remanufacturing

Wansong Liu, Chang Liu, Xiao Liang, Minghui Zheng · 2024

Industrial manipulators have extensively collaborated with human operators to execute tasks, e.g., disassembly of end-of-use products, in intelligent remanufacturing. A safety task execution requires …

Read Paper →

Engineering Preprint PDF DOI

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech

Ashishkumar Gudmalwar, Nirmesh Shah, Sai Akarsh, Pankaj Wasnik, Rajiv Ratn Shah · 2024

Despite the significant advancements in Text-to-Speech (TTS) systems, their full utilization in automatic dubbing remains limited. This task necessitates the extraction of voice identity and emotional…

Read Paper →

Engineering Preprint PDF DOI

Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling · 2024

We uncover an underlying bias present in the audio recordings produced from the picture description task of the Pitt corpus, the largest publicly accessible database for Alzheimer's Disease (AD) detec…

Read Paper →

Engineering Preprint PDF DOI

A preprocessing-based planning framework for utilizing contacts in high-precision insertion tasks

Muhammad Suhail Saleem, Rishi Veerapaneni, Maxim Likhachev · 2024

In manipulation tasks like plug insertion or assembly that have low tolerance to errors in pose estimation (errors of the order of 2mm can cause task failure), the utilization of touch/contact modalit…

Read Paper →

Engineering Preprint PDF DOI

The Database and Benchmark for the Source Speaker Tracing Challenge 2024

Ze Li, Yuke Lin, Tian Yao, Hongbin Suo, Pengyuan Zhang, Yanzhen Ren, Zexin Cai, Hiromitsu Nishizaki, Ming Li · 2024

Voice conversion (VC) systems can transform audio to mimic another speaker's voice, thereby attacking speaker verification (SV) systems. However, ongoing studies on source speaker verification (SSV) a…

Read Paper →

Engineering Preprint PDF DOI

L-SFAN: Lightweight Spatially-focused Attention Network for Pain Behavior Detection

Jorge Ortigoso-Narro, Fernando Diaz-de-Maria, Mohammad Mahdi Dehshibi, Ana Tajadura-Jimenez · 2024

Chronic Low Back Pain (CLBP) afflicts millions globally, significantly impacting individuals' well-being and imposing economic burdens on healthcare systems. While artificial intelligence (AI) and dee…

Read Paper →

Engineering Preprint PDF DOI

Image and Video Quality Assessment using Prompt-Guided Latent Diffusion Models for Cross-Dataset Generalization

Shankhanil Mitra, Diptanu De, Shika Rao, Rajiv Soundararajan · 2024

The design of image and video quality assessment (QA) algorithms is extremely important to benchmark and calibrate user experience in modern visual systems. A major drawback of the state-of-the-art QA…

Read Paper →

Engineering Preprint PDF DOI

Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition

Alaa Nfissi, Wassim Bouachir, Nizar Bouguila, Brian Mishara · 2024

Speech emotion recognition (SER) has gained significant attention due to its several application fields, such as mental health, education, and human-computer interaction. However, the accuracy of SER …

Read Paper →

Engineering Preprint PDF DOI

MVAD: A Multiple Visual Artifact Detector for Video Streaming

Chen Feng, Duolikun Danier, Fan Zhang, Alex Mackin, Andrew Collins, David Bull · 2024

Visual artifacts are often introduced into streamed video content, due to prevailing conditions during content production and delivery. Since these can degrade the quality of the user's experience, it…

Read Paper →

Engineering Preprint PDF DOI

An Organic Weed Control Prototype using Directed Energy and Deep Learning

Deng Cao, Hongbo Zhang, Rajveer Dhillon · 2024

Organic weed control is a vital to improve crop yield with a sustainable approach. In this work, a directed energy weed control robot prototype specifically designed for organic farms is proposed. The…

Read Paper →

Engineering Preprint PDF DOI

STHN: Deep Homography Estimation for UAV Thermal Geo-localization with Satellite Imagery

Jiuhong Xiao, Ning Zhang, Daniel Tortei, Giuseppe Loianno · 2024

Accurate geo-localization of Unmanned Aerial Vehicles (UAVs) is crucial for outdoor applications including search and rescue operations, power line inspections, and environmental monitoring. The vulne…

Read Paper →

Engineering Preprint PDF DOI

Feasibility of Privacy-Preserving Entity Resolution on Confidential Healthcare Datasets Using Homomorphic Encryption

Yixiang Yao, Joseph Cecil, Praveen Angyan, Neil Bahroos, Srivatsan Ravi · 2024

Patient datasets contain confidential information which is protected by laws and regulations such as HIPAA and GDPR. Ensuring comprehensive patient information necessitates privacy-preserving entity r…

Read Paper →

Engineering Preprint PDF DOI

CT-based brain ventricle segmentation via diffusion Schr\"odinger Bridge without target domain ground truths

Reihaneh Teimouri, Marta Kersten-Oertel, Yiming Xiao · 2024

Efficient and accurate brain ventricle segmentation from clinical CT scans is critical for emergency surgeries like ventriculostomy. With the challenges in poor soft tissue contrast and a scarcity of …

Read Paper →

Engineering Preprint PDF DOI

Using a Convolutional Neural Network and Explainable AI to Diagnose Dementia Based on MRI Scans

Tyler Morris, Ziming Liu, Longjian Liu, Xiaopeng Zhao · 2024

As the number of dementia patients rises, the need for accurate diagnostic procedures rises as well. Current methods, like using an MRI scan, rely on human input, which can be inaccurate. However, the…

Read Paper →

Engineering Preprint PDF DOI

A better approach to diagnose retinal diseases: Combining our Segmentation-based Vascular Enhancement with deep learning features

Yuzhuo Chen, Zetong Chen, Yuanyuan Liu · 2024

Abnormalities in retinal fundus images may indicate certain pathologies such as diabetic retinopathy, hypertension, stroke, glaucoma, retinal macular edema, venous occlusion, and atherosclerosis, maki…

Read Paper →

Engineering Preprint PDF DOI

Verification technology for finger vein biometric

George Kumi Kyeremeh, M. Abdul-Al, R. Qahwaji, R.A. Abd-Alhameed · 2024

Finger vein biometrics is an approach to identifying individuals based on the unique patterns of blood vessels in their fingers, and the technology is advanced in image capture and processing techniqu…

Read Paper →

Engineering Preprint PDF DOI

Guidelines for evaluation of complex multi agent test scenarios

Ana Isabel Garcia Guerra, Teng Sung Shiuan · 2024

To support the testing of AVs, CETRAN has created a guideline for the evaluation of complex multi agent test scenarios presented in this report. This allows for a clear structured manner in evaluating…

Read Paper →

Browse Research Papers

Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models

Global Crop-Specific Fertilization Dataset from 1961-2019

Knowledge Graphs in the Digital Twin: A Systematic Literature Review About the Combination of Semantic Technologies and Simulation in Industrial Automation

A Hybrid Task-Constrained Motion Planning for Collaborative Robots in Intelligent Remanufacturing

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech

Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

A preprocessing-based planning framework for utilizing contacts in high-precision insertion tasks

The Database and Benchmark for the Source Speaker Tracing Challenge 2024

L-SFAN: Lightweight Spatially-focused Attention Network for Pain Behavior Detection

Image and Video Quality Assessment using Prompt-Guided Latent Diffusion Models for Cross-Dataset Generalization

Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition

MVAD: A Multiple Visual Artifact Detector for Video Streaming

An Organic Weed Control Prototype using Directed Energy and Deep Learning

STHN: Deep Homography Estimation for UAV Thermal Geo-localization with Satellite Imagery

Feasibility of Privacy-Preserving Entity Resolution on Confidential Healthcare Datasets Using Homomorphic Encryption

CT-based brain ventricle segmentation via diffusion Schr\"odinger Bridge without target domain ground truths

Using a Convolutional Neural Network and Explainable AI to Diagnose Dementia Based on MRI Scans

A better approach to diagnose retinal diseases: Combining our Segmentation-based Vascular Enhancement with deep learning features

Verification technology for finger vein biometric

Guidelines for evaluation of complex multi agent test scenarios

Browse by Category

Research Type

Publish Your Research