Goran Muric — Research Repository

Computer Science Preprint PDF DOI

Real-Time Control of a Virtual Orchestra by Recognition of Conducting Gestures

Mert Mermerci, Emile Pascoe, Fredrik Edstrom, Hedvig Kjellstrom · 2026

We present a museum installation in a 180{\deg} dome theater, which gives the museum visitor the experience of conducting a symphony orchestra. We have pre-recorded a short music piece performed by a …

Read Paper →

Engineering Preprint PDF DOI

Sensing-Assisted Channel Estimation for Flexible-Antenna Systems: A Unified Framework

Ruoxiao Cao, Wentao Yu, Zixin Wang, Shenghui Song, Jun Zhang, Yi Gong, Khaled B. Letaief · 2026

Flexible-antenna systems, which use a small number of radio frequency (RF) chains to dynamically access a large set of candidate antenna locations, have emerged as a hardware-efficient architecture fo…

Read Paper →

Physics Preprint PDF DOI

Mujic{\Lambda}: Reconstructing Initial Conditions from Incomplete Redshift Surveys with Projected Optimization

Chenze Dong, Benjamin Horowitz, Adrian E. Bayer, Khee-Gan Lee · 2026

In this paper, we introduce Mujic{\Lambda} (Mapping the Universe with Jax-based Initial Condition Reconstr{\Lambda}ction), an optimization-based framework for reconstructing initial conditions from re…

Read Paper →

Mathematics Preprint PDF DOI

On Arithmetic Mirror Symmetry for smooth Fano fourfolds

Mikhail Ovcharenko · 2026

We introduce an explicit class of tempered Laurent polynomials in the sense of Villegas and Doran--Kerr in $n \leqslant 4$ variables including all Landau--Ginzburg models for smooth Fano threefolds wi…

Read Paper →

Sociology & Anthropology Preprint PDF DOI

Two-Dimensional Structural Characterization of Music Genre Communities in Playlist Co-occurrence Networks

Makoto Takeuchi · 2026

Music genre classification shapes how listeners discover music, how platforms design recommendations, and how sociologists study cultural taste. Yet existing genre labels are inconsistent in granulari…

Read Paper →

Computer Science Preprint PDF DOI

SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton

Xuzheng He, Nan Nan, Zhilin Wang, Ziyue Kang, Zhuoru Mo, Ao Li, Yu Pan, Xiaobing Li, Feng Yu, Xiaohong Guan · 2026

Generating symphonic music requires simultaneously managing high-level structural form and dense, multi-track orchestration. Existing symbolic models often struggle with a "complexity-control imbalanc…

Read Paper →

Computer Science Preprint PDF DOI

An event-based sequence modeling approach to recognizing non-triad chords with oversegmentation minimization

Leekyung Kim, Jonghun Park · 2026

Automatic chord recognition (ACR) extracts time-aligned chord labels from music audio recordings. Despite recent advances, ACR still struggles with oversegmentation, data scarcity, and imbalance, espe…

Read Paper →

Computer Science Preprint PDF DOI

From Players to Participants: Citizen Science and Video Games to Understand Cognition

Syrine Salouhou, Edgar Dubourg, Maxwell Scott-Slade, Hugo Spiers, Antoine Coutrot · 2026

Citizen science is transforming how cognitive scientists study the human mind, and video games are at the heart of this shift. By embedding experimental tasks into engaging, game-like experiences, res…

Read Paper →

Computer Science Preprint PDF DOI

MUSIC: Learning Muscle-Driven Dexterous Hand Control

Pei Xu, Yufei Ye, Shuchun Sun, Yu Ding, Elizabeth Schumann, C. Karen Liu · 2026

We present a data-driven approach for physics-based, muscle-driven dexterous control that enables musculoskeletal hands to perform precise piano playing for novel pieces of music outside the reference…

Read Paper →

Computer Science Preprint PDF DOI

Opening the Design Space: Two Years of Performance with Intelligent Musical Instruments

Charles Patrick Martin · 2026

Machine generation of symbolic music and digital audio are hot topics but there have been relatively few digital musical instruments that integrate generative AI. Present musical AI tools are not arti…

Read Paper →

Computer Science Preprint PDF DOI

CineAGI: Character-Consistent Movie Creation through LLM-Orchestrated Multi-Modal Generation and Cross-Scene Integration

Tianyidan Xie, Zhentao Huang, Mingjie Wang, Xin Huang, Jun Zhou, Minglun Gong, Zili Yi · 2026

Automated movie creation requires coordinating multiple characters, modalities, and narrative elements across extended sequences -- a challenge that existing end-to-end approaches struggle to address …

Read Paper →

Neuroscience Preprint PDF DOI

EyeBrain: Left and Right Brain Lateralization Activity Classification Through Pupil Diameter and Fixation Duration

Ko Watanabe, Pooja Pol, Nicolas Gro{ss}mann, Shoya Ishimaru, Andreas Dengel · 2026

The relationship between brain lateralization and cognitive functions is well-documented. The left hemisphere primarily handles tasks such as language and arithmetic, while the right hemisphere is inv…

Read Paper →

Computer Science Preprint PDF DOI

Adopting State-of-the-Art Pretrained Audio Representations for Music Recommender Systems

Yan-Martin Tamm, Anna Aljanaki · 2026

Over the years, Music Information Retrieval (MIR) research community has released various models pretrained on large amounts of music data. Transfer learning showcases the proven effectiveness of pret…

Read Paper →

AI & Data Science Preprint PDF DOI

Come Together: Analyzing Popular Songs Through Statistical Embeddings

Matthew Esmaili Mallory, Mark Glickman, Jason Brown · 2026

Statistical modeling of popular music presents a unique challenge due to the complexity of song structures, which cannot be easily analyzed using conventional statistical tools. However, recent advanc…

Read Paper →

Computer Science Preprint PDF DOI

Transformer-Based Rhythm Quantization of Performance MIDI Using Beat Annotations

Maximilian Wachter, Sebastian Murgul, Michael Heizmann · 2026

Rhythm transcription is a key subtask of notation-level Automatic Music Transcription (AMT). While deep learning models have been extensively used for detecting the metrical grid in audio and MIDI per…

Read Paper →

Engineering Preprint PDF DOI

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

Chunyu Qiang, Xiaopeng Wang, Kang Yin, Yuzhe Liang, Yuxin Guo, Teng Ma, Ziyu Zhang, Tianrui Wang, Cheng Gong, Yushen Chen, Ruibo Fu, Chen Zhang, Longbiao Wang, Jianwu Dang · 2026

Generative audio modeling has largely been fragmented into specialized tasks, text-to-speech (TTS), text-to-music (TTM), and text-to-audio (TTA), each operating under heterogeneous control paradigms. …

Read Paper →

AI & Data Science Preprint PDF DOI

Dilated CNNs for Periodic Signal Processing: A Low-Complexity Approach

Eli Gildish, Michael Grebshtein, Igor Makienko · 2026

Denoising of periodic signals and accurate waveform estimation are core tasks across many signal processing domains, including speech, music, medical diagnostics, radio, and sonar. Although deep learn…

Read Paper →

AI & Data Science Preprint PDF DOI

Listen and Chant Before You Read: The Ladder of Beauty in LM Pre-Training

Yoshinori Nomura · 2026

We show that pre-training a Transformer on music before language significantly accelerates language acquisition. Using piano performances (MAESTRO dataset), a developmental pipeline -- music $\to$ poe…

Read Paper →

Computer Science Preprint PDF DOI

ONOTE: Benchmarking Omnimodal Notation Processing for Expert-level Music Intelligence

Menghe Ma, Siqing Wei, Yuecheng Xing, Yaheng Wang, Fanhong Meng, Peijun Han, Luu Anh Tuan, Haoran Luo · 2026

Omnimodal Notation Processing (ONP) represents a unique frontier for omnimodal AI due to the rigorous, multi-dimensional alignment required across auditory, visual, and symbolic domains. Current resea…

Read Paper →

Computer Science Preprint PDF DOI

From Image to Music Language: A Two-Stage Structure Decoding Approach for Complex Polyphonic OMR

Nan Xu, Shiheng Li, Shengchao Hou · 2026

We propose a new approach for a practical two-stage Optical Music Recognition (OMR) pipeline, with a particular focus on its second stage. Given symbol and event candidates from the visual pipeline, w…

Read Paper →

Browse Research Papers

Real-Time Control of a Virtual Orchestra by Recognition of Conducting Gestures

Sensing-Assisted Channel Estimation for Flexible-Antenna Systems: A Unified Framework

Mujic{\Lambda}: Reconstructing Initial Conditions from Incomplete Redshift Surveys with Projected Optimization

On Arithmetic Mirror Symmetry for smooth Fano fourfolds

Two-Dimensional Structural Characterization of Music Genre Communities in Playlist Co-occurrence Networks

SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton

An event-based sequence modeling approach to recognizing non-triad chords with oversegmentation minimization

From Players to Participants: Citizen Science and Video Games to Understand Cognition

MUSIC: Learning Muscle-Driven Dexterous Hand Control

Opening the Design Space: Two Years of Performance with Intelligent Musical Instruments

CineAGI: Character-Consistent Movie Creation through LLM-Orchestrated Multi-Modal Generation and Cross-Scene Integration

EyeBrain: Left and Right Brain Lateralization Activity Classification Through Pupil Diameter and Fixation Duration

Adopting State-of-the-Art Pretrained Audio Representations for Music Recommender Systems

Come Together: Analyzing Popular Songs Through Statistical Embeddings

Transformer-Based Rhythm Quantization of Performance MIDI Using Beat Annotations

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

Dilated CNNs for Periodic Signal Processing: A Low-Complexity Approach

Listen and Chant Before You Read: The Ladder of Beauty in LM Pre-Training

ONOTE: Benchmarking Omnimodal Notation Processing for Expert-level Music Intelligence

From Image to Music Language: A Two-Stage Structure Decoding Approach for Complex Polyphonic OMR

Browse by Category

Research Type

Publish Your Research