Michel Hebert in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Phase-Separated Complex Hilbert PCA on Markerless 3D Pose Estimation Data: A Global Phase Network and Its Extension to a Continuous Field on the Body Surface

Hiromitsu Goto, Tao Tao, Zheng-Lin Chia · 2026

Quantitative analysis of the kinematic chain in sports motion is essential for performance evaluation and injury prevention. Conventional methods such as the kinematic-sequence (KS) and continuous rel…

Read Paper →

Computer Science Preprint PDF DOI

FlashSpread: IO-Aware GPU Simulation of Non-Markovian Epidemic Dynamics via Kernel Fusion

Heman Shakeri, Behnaz Moradi-Jamei, Aram Vajdi, Ehsan Ardjmand · 2026

Non-Markovian (renewal) epidemic simulation on multi-million-node contact networks is essential for realistic forecasting under general age-dependent holding-time distributions (log-normal, Weibull, E…

Read Paper →

Computer Science Preprint PDF DOI

HAFM: Hierarchical Autoregressive Foundation Model for Music Accompaniment Generation

Jian Zhu, Jianwei Cui, Shihao Chen, Yubang Zhang, Cheng Luo · 2026

We present HAFM, a system that generates instrumental music audio to accompany input vocals. Given isolated singing voice, HAFM produces a coherent instrumental accompaniment that can be directly mixe…

Read Paper →

Computer Science Preprint PDF DOI

BRASP: Boolean Range Queries over Encrypted Spatial Data with Access and Search Pattern Privacy

Jing Zhang, Ganxuan Yang, Yifei Yang, Siqi Wen, Zhengyang Qiu · 2026

Searchable Encryption (SE) enables users to query outsourced encrypted data while preserving data confidentiality. However, most efficient schemes still leak the search pattern and access pattern, whi…

Read Paper →

Computer Science Preprint PDF DOI

Engineering Mythology: A Digital-Physical Framework for Culturally-Inspired Public Art

Jnaneshwar Das, Christopher Filkins, Rajesh Moharana, Ekadashi Barik, Bishweshwar Das, David Ayers, Christopher Skiba, Rodney Staggers Jr, Mark Dill, Swig Miller, Daniel Tulberg, Patrick Smith, Seth Brink, Kyle Breen, Harish Anand, Ramon Arrowsmith · 2026

Navagunjara Reborn: The Phoenix of Odisha was built for Burning Man 2025 as both a sculpture and an experiment-a fusion of myth, craft, and computation. This paper describes the digital-physical workf…

Read Paper →

Computer Science Preprint PDF DOI

Investigation on the Robustness of Acoustic Foundation Models on Post Exercise Speech

Xiangyuan Xue, Yuyu Wang, Ruijie Yao, Xiaoyue Ni, Xiaofan Jiang, Jingping Nie · 2026

Automatic speech recognition (ASR) has been extensively studied on neutral and stationary speech, yet its robustness under post-exercise physiological shift remains underexplored. Compared with restin…

Read Paper →

Computer Science Preprint PDF DOI

Visualizing Higher Order Structures, Overlap Regions, and Clustering in the Hilbert Geometry

Hridhaan Banerjee, Soren Brown, June Cagan, Auguste H. Gezalyan, Megan Hunleth, Veena Kailad, Chaewoon Kyoung, Rowan Shigeno, Yasmine Tajeddin, Andrew Wagger, Kelin Zhu, David M. Moun · 2026

Higher-order Voronoi diagrams and Delaunay mosaics in polygonal metrics have only recently been studied, yet no tools exist for visualizing them. We introduce a tool that fills this gap, providing dyn…

Read Paper →

Computer Science Preprint PDF DOI

MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates

Zikang Huang, Meng Ge, Tianrui Wang, Xuanchen Li, Xiaobao Wang, Longbiao Wang, Jianwu Dang · 2026

Self-supervised learning (SSL) has advanced speech processing. However, existing speech SSL methods typically assume a single sampling rate and struggle with mixed-rate data due to temporal resolution…

Read Paper →

Computer Science Preprint PDF DOI

Hetero-Net: An Energy-Efficient Resource Allocation and 3D Placement in Heterogeneous LoRa Networks via Multi-Agent Optimization

Abdullahi Isa Ahmed, Ana Maria Dragulinescu, El Mehdi Amhoud · 2026

The evolution of Internet of Things (IoT) into multi-layered environments has positioned Low-Power Wide Area Networks (LPWANs), particularly Long Range (LoRa), as the backbone for connectivity across …

Read Paper →

Computer Science Preprint PDF DOI

Paralinguistic Emotion-Aware Validation Timing Detection in Japanese Empathetic Spoken Dialogue

Zi Haur Pang, Yahui Fu, Yuan Gao, Tatsuya Kawahara · 2026

Emotional Validation is a psychotherapy communication technique that involves recognizing, understanding, and explicitly acknowledging another person's feelings and actions, which strengthens alliance…

Read Paper →

Computer Science Preprint PDF DOI

Do Compact SSL Backbones Matter for Audio Deepfake Detection? A Controlled Study with RAPTOR

Ajinkya Kulkarni, Sandipana Dowerah, Atharva Kulkarni, Tanel Alumae, Mathew Magimai Doss · 2026

Self-supervised learning (SSL) underpins modern audio deepfake detection, yet most prior work centers on a single large wav2vec2-XLSR backbone, leaving compact under studied. We present RAPTOR, Repres…

Read Paper →

Computer Science Preprint PDF DOI

Reproducing and Comparing Distillation Techniques for Cross-Encoders

Victor Morand, Mathias Vast, Basile Van Cooten, Laure Soulier, Josiane Mothe, Benjamin Piwowarski · 2026

Recent advances in Information Retrieval have established transformer-based cross-encoders as a keystone in IR. Recent studies have focused on knowledge distillation and showed that, with the right st…

Read Paper →

Computer Science Preprint PDF DOI

Adaptive Underwater Acoustic Communications with Limited Feedback: An AoI-Aware Hierarchical Bandit Approach

Fabio Busacca, Andrea Panebianco, Yin Sun · 2026

Underwater Acoustic (UWA) networks are vital for remote sensing and ocean exploration but face inherent challenges such as limited bandwidth, long propagation delays, and highly dynamic channels. Thes…

Read Paper →

Computer Science Preprint PDF DOI

Multi-Channel Speech Enhancement for Cocktail Party Speech Emotion Recognition

Youjun Chen, Guinan Li, Mengzhe Geng, Xurong Xie, Shujie Hu, Huimeng Wang, Haoning Xu, Chengxi Deng, Jiajun Deng, Zhaoqing Li, Mingyu Cui, Xunying Liu · 2026

This paper highlights the critical importance of multi-channel speech enhancement (MCSE) for speech emotion recognition (ER) in cocktail party scenarios. A multi-channel speech dereverberation and sep…

Read Paper →

Computer Science Preprint PDF DOI

Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings

Sreejith Sreekumar, Nir Weinberger · 2026

Recent works have proposed various explanations for the ability of modern large language models (LLMs) to perform in-context prediction. We propose an alternative conceptual viewpoint from an informat…

Read Paper →

Computer Science Preprint PDF DOI

MICE: Minimal Interaction Cross-Encoders for efficient Re-ranking

Mathias Vast, Victor Morand, Basile van Cooten, Laure Soulier, Josiane Mothe, Benjamin Piwowarski · 2026

Cross-encoders deliver state-of-the-art ranking effectiveness in information retrieval, but have a high inference cost. This prevents them from being used as first-stage rankers, but also incurs a cos…

Read Paper →

Computer Science Preprint PDF DOI

voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models

Aju Ani Justus, Ruchit Agrawal, Sudarsana Reddy Kadiri, Shrikanth Narayanan · 2026

We present voice2mode, a method for classification of four singing phonation modes (breathy, neutral (modal), flow, and pressed) using embeddings extracted from large self-supervised speech models. Pr…

Read Paper →

Computer Science Preprint PDF DOI

Labor, Capital, and Machine: Toward a Labor Process Theory for HCI

Yigang Qin, EunJeong Cheon · 2026

The HCI community has called for renewed attention to labor issues and the political economy of computing. Yet much work remains in engaging with labor theory to better understand modern work and work…

Read Paper →

Computer Science Preprint PDF DOI

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Georgii Aparin, Tasnima Sadekova, Alexey Rukhovich, Assel Yermekova, Laida Kushnareva, Vadim Popov, Kristian Kuznetsov, Irina Piontkovskaya · 2026

Sparse Autoencoders (SAEs) are powerful tools for interpreting neural representations, yet their use in audio remains underexplored. We train SAEs across all encoder layers of Whisper and HuBERT, prov…

Read Paper →

Computer Science Preprint PDF DOI

Classifiers in High Dimensional Hilbert Metrics

Aditya Acharya, Auguste H. Gezalyan, David M. Mount · 2026

Classifying points in high dimensional spaces is a fundamental geometric problem in machine learning. In this paper, we address classifying points in the $d$-dimensional Hilbert polygonal metric. The …

Read Paper →

Browse Research Papers

Phase-Separated Complex Hilbert PCA on Markerless 3D Pose Estimation Data: A Global Phase Network and Its Extension to a Continuous Field on the Body Surface

FlashSpread: IO-Aware GPU Simulation of Non-Markovian Epidemic Dynamics via Kernel Fusion

HAFM: Hierarchical Autoregressive Foundation Model for Music Accompaniment Generation

BRASP: Boolean Range Queries over Encrypted Spatial Data with Access and Search Pattern Privacy

Engineering Mythology: A Digital-Physical Framework for Culturally-Inspired Public Art

Investigation on the Robustness of Acoustic Foundation Models on Post Exercise Speech

Visualizing Higher Order Structures, Overlap Regions, and Clustering in the Hilbert Geometry

MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates

Hetero-Net: An Energy-Efficient Resource Allocation and 3D Placement in Heterogeneous LoRa Networks via Multi-Agent Optimization

Paralinguistic Emotion-Aware Validation Timing Detection in Japanese Empathetic Spoken Dialogue

Do Compact SSL Backbones Matter for Audio Deepfake Detection? A Controlled Study with RAPTOR

Reproducing and Comparing Distillation Techniques for Cross-Encoders

Adaptive Underwater Acoustic Communications with Limited Feedback: An AoI-Aware Hierarchical Bandit Approach

Multi-Channel Speech Enhancement for Cocktail Party Speech Emotion Recognition

Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings

MICE: Minimal Interaction Cross-Encoders for efficient Re-ranking

voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models

Labor, Capital, and Machine: Toward a Labor Process Theory for HCI

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Classifiers in High Dimensional Hilbert Metrics

Browse by Category

Research Type

Publish Your Research