Pedro Aceves — Research Repository

AI & Data Science Preprint PDF DOI

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation

Xin Zhou, Dingkang Liang, Xiwu Chen, Feiyang Tan, Dingyuan Zhang, Hengshuang Zhao, Xiang Bai · 2026

Driving world models serve as a pivotal technology for autonomous driving by simulating environmental dynamics. However, existing approaches predominantly focus on future scene generation, often overl…

Read Paper →

AI & Data Science Preprint PDF DOI

Generalizable Sparse-View 3D Reconstruction from Unconstrained Images

Vinayak Gupta, Chih-Hao Lin, Shenlong Wang, Anand Bhattad, Jia-Bin Huang · 2026

Reconstructing 3D scenes from sparse, unposed images remains challenging under real-world conditions with varying illumination and transient occlusions. Existing methods rely on scene-specific optimiz…

Read Paper →

Engineering Preprint PDF DOI

LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models

Hao Chen, Jiaming Liu, Zhonghao Yan, Nuowei Han, Renrui Zhang, Chenyang Gu, Jialin Gao, Ziyu Guo, Siyuan Qian, Yinxi Wang, Peng Jia, Chi-Wing Fu, Shanghang Zhang, Pheng-Ann Heng · 2026

Vision-Language-Action (VLA) models have increasingly incorporated reasoning mechanisms for complex robotic manipulation. However, existing approaches share a critical limitation: whether employing ex…

Read Paper →

AI & Data Science Preprint PDF DOI

Stop Holding Your Breath: CT-Informed Gaussian Splatting for Dynamic Bronchoscopy

Andrea Dunn Beltran, Daniel Rho, Aarav Mehta, Xinqi Xiong, Raul San Jose Estepar, Ron Alterovitz, Marc Niethammer, Roni Sengupta · 2026

Bronchoscopic navigation relies on registering endoscopic video to a preoperative CT scan, but respiratory motion deforms the airway by 5-20 mm, creating CT-to-body divergence that limits localization…

Read Paper →

AI & Data Science Preprint PDF DOI

Action Motifs: Self-Supervised Hierarchical Representation of Human Body Movements

Genki Kinoshita, Shu Nakamura, Ryo Kawahara, Shohei Nobuhara, Yasutomo Kawanishi, Ko Nishino · 2026

Effective human behavior modeling requires a representation of the human body movement that capitalizes on its compositionality. We propose a hierarchical representation consisting of Action Atoms tha…

Read Paper →

Engineering Preprint PDF DOI

RopeDreamer: A Kinematic Recurrent State Space Model for Dynamics of Flexible Deformable Linear Objects

Tim Missal, Lucas Domingues, Berk Guler, Simon Manschitz, Jan Peters, Paula Dornhofer Paro Costa · 2026

The robotic manipulation of Deformable Linear Objects (DLOs) is a fundamental challenge due to the high-dimensional, non-linear dynamics of flexible structures and the complexity of maintaining topolo…

Read Paper →

Physics Preprint PDF DOI

Optimal current-based sensing of phonon temperature using a finite reservoir

Sindre Brattegard, Stephanie Matern, Mark T. Mitchison, Saulo V. Moreira · 2026

In realistic nanoscale transport set-ups, electron-phonon coupling leads to the exchange of heat between phonon baths and electronic reservoirs with finite heat capacities. Such exchange affects the f…

Read Paper →

Computer Science Preprint PDF DOI

Optimal Transmitter Placement in Realistic Urban Environments

Lukas Taus, Richard Tsai, Jeffrey G. Andrews · 2026

In a wireless network, the spatial location of the transmitters has a large impact on the achievable rate at each user location. The optimal placement of -- for example -- cellular base stations is a …

Read Paper →

Mathematics Preprint PDF DOI

Beyond first-order accuracy in continuous-forcing immersed boundary methods, and their well-conditioned projection-based solution

Diederik Beckers, H. Jane Bae, Andres Goza · 2026

We introduce a refined immersed boundary (IB) methodology that is better-than-first-order accurate in practice, while preserving key properties of "continuous-forcing" IB approaches that retain a sing…

Read Paper →

Computer Science Preprint PDF DOI

Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing

Silvio Martinico, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini · 2026

Multivector retrieval models achieve state-of-the-art effectiveness through fine-grained token-level representations, but their deployment incurs substantial computational and memory costs. Current so…

Read Paper →

AI & Data Science Preprint PDF DOI

Beyond Pixel Fidelity: Minimizing Perceptual Distortion and Color Bias in Night Photography Rendering

Furkan K{i}nl{i} · 2026

Night Photography Rendering (NPR) poses a significant challenge due to the extreme contrast between dark and illuminated areas in scenes, stemming from concurrent capture of severely dark regions alon…

Read Paper →

Computer Science Preprint PDF DOI

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

Prashant Kulkarni · 2026

Multi-turn prompt injection follows a known attack path -- trust-building, pivoting, escalation but text-level defenses miss covert attacks where individual turns appear benign. We show this attack pa…

Read Paper →

Engineering Preprint PDF DOI

FreeOcc: Training-Free Embodied Open-Vocabulary Occupancy Prediction

Zeyu Jiang, Changqing Zhou, Xingxing Zuo, Changhao Chen · 2026

Existing learning-based occupancy prediction methods rely on large-scale 3D annotations and generalize poorly across environments. We present FreeOcc, a training-free framework for open-vocabulary occ…

Read Paper →

Computer Science Preprint PDF DOI

Succinct Graph Representations and Algorithmic Applications

Ahammed Ullah, Alex Pothen · 2026

We propose new graph representations that exploit dense local structure to improve time and space simultaneously. Given an undirected graph $G$, we define a dual clique cover (DCC) representation of $…

Read Paper →

Computer Science Preprint PDF DOI

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

Muhammad Ihsan Al Hafiz, Artur Podobas · 2026

Spiking neural networks (SNNs) are a promising paradigm for energy-efficient event-driven computation, but large-scale SNN execution remains challenging because sparse spike communication and synchron…

Read Paper →

AI & Data Science Preprint PDF DOI

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

Jialu Shen, Han Lyu, Suyang Zhong, Hanzheng Li, Haoyi Tao, Nan Wang, Changhong Chen, Xi Fang · 2026

Spectra are a prevalent yet highly information-dense form of scientific imagery, presenting substantial challenges to multimodal large language models (MLLMs) due to their unstructured and domain-spec…

Read Paper →

AI & Data Science Preprint PDF DOI

Shuffling-Aware Optimization for Private Vector Mean Estimation

Shun Takagi, Seng Pei Liew · 2026

We study $d$-dimensional unbiased mean estimation in the single-message shuffle model, where each user sends a single privatized message and the analyzer only observes the shuffled multiset of reports…

Read Paper →

Engineering Preprint PDF DOI

Design Structure Matrix Modularization with Large Language Models

Shuo Jiang, Jianxi Luo · 2026

Design Structure Matrix (DSM) modularization, the task of partitioning system elements into cohesive modules, is a fundamental combinatorial challenge in engineering design. Traditional methods treat …

Read Paper →

AI & Data Science Preprint PDF DOI

Faster 3D Gaussian Splatting Convergence via Structure-Aware Densification

Linjie Lyu, Ayush Tewari, Jianchun Chen, Thomas Leimkuhler, Christian Theobalt · 2026

3D Gaussian Splatting has emerged as a powerful scene representation for real-time novel-view synthesis. However, its standard adaptive density control relies on screen-space positional gradients, whi…

Read Paper →

AI & Data Science Preprint PDF DOI

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

Shijin Gong, Kai Ye, Jin Zhu, Xinyu Zhang, Hongyi Zhou, Chengchun Shi · 2026

Recent advances in large language models (LLMs) have increasingly relied on reinforcement learning (RL) to improve their reasoning capabilities. Three approaches have been widely adopted: (i) Proximal…

Read Paper →

Browse Research Papers

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation

Generalizable Sparse-View 3D Reconstruction from Unconstrained Images

LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models

Stop Holding Your Breath: CT-Informed Gaussian Splatting for Dynamic Bronchoscopy

Action Motifs: Self-Supervised Hierarchical Representation of Human Body Movements

RopeDreamer: A Kinematic Recurrent State Space Model for Dynamics of Flexible Deformable Linear Objects

Optimal current-based sensing of phonon temperature using a finite reservoir

Optimal Transmitter Placement in Realistic Urban Environments

Beyond first-order accuracy in continuous-forcing immersed boundary methods, and their well-conditioned projection-based solution

Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing

Beyond Pixel Fidelity: Minimizing Perceptual Distortion and Color Bias in Night Photography Rendering

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection

FreeOcc: Training-Free Embodied Open-Vocabulary Occupancy Prediction

Succinct Graph Representations and Algorithmic Applications

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

Shuffling-Aware Optimization for Private Vector Mean Estimation

Design Structure Matrix Modularization with Large Language Models

Faster 3D Gaussian Splatting Convergence via Structure-Aware Densification

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

Browse by Category

Research Type

Publish Your Research