Michael Codish in AI & Data Science — Research Repository

AI & Data Science Preprint PDF DOI

What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design

Ivan Bercovich · 2026

Terminal-agent benchmarks have become a primary signal for measuring the coding and system-administration capabilities of large language models. As the market for evaluation environments grows, so doe…

Read Paper →

AI & Data Science Preprint PDF DOI

TAFA-GSGC: Group-wise Scalable Point Cloud Geometry Compression with Progressive Residual Refinement

Xiumei Li, Alexander Kopte, Andre Kaup · 2026

Scalable compression is essential for bandwidth-adaptive transmission, yet most learned codecs are optimized for a fixed rate-distortion point, making rate adaptation costly due to re-encoding or main…

Read Paper →

AI & Data Science Preprint PDF DOI

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

Jackson Vonderhorst, Kuangshi Ai, Haichao Miao, Shusen Liu, Chaoli Wang · 2026

This paper examines how different types of large language model (LLM) agents perform on scientific visualization (SciVis) tasks, where users generate visualization workflows from natural-language inst…

Read Paper →

AI & Data Science Preprint PDF DOI

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

Qiyao Wang, Haoran Hu, Longze Chen, Hongbo Wang, Hamid Alinejad-Rokny, Yuan Lin, Min Yang · 2026

With the advancement of multimodal large language models (MLLMs) and coding agents, the website development has shifted from manual programming to agent-based project-level code synthesis. Existing be…

Read Paper →

AI & Data Science Preprint PDF DOI

Learning When to Remember: Risk-Sensitive Contextual Bandits for Abstention-Aware Memory Retrieval in LLM-Based Coding Agents

Mehmet Iscan · 2026

Large language model (LLM)-based coding agents increasingly rely on external memory to reuse prior debugging experience, repair traces, and repository-local operational knowledge. However, retrieved m…

Read Paper →

AI & Data Science Preprint PDF DOI

Unpacking Vibe Coding: Help-Seeking Processes in Student-AI Interactions While Programming

Daiana Rinja, Eduardo Araujo Oliveira, Sonsoles Lopez-Pernas, Mohammed Saqr, Marcus Specht, Kamila Misiejuk · 2026

Generative AI is reshaping higher education programming through vibe coding, where students collaborate with AI via natural language rather than writing code line-by-line. We conceptualize this practi…

Read Paper →

AI & Data Science Preprint PDF DOI

Automated Detection of Mutual Gaze and Joint Attention in Dual-Camera Settings via Dual-Stream Transformers

Jakub Kosmydel, Pawe{l} Gajewski, Arkadiusz Bia{l}ek · 2026

Analyzing mutual gaze (MG) and joint attention (JA) is critical in developmental psychology but traditionally relies on labor-intensive manual coding. Automating this process in multi-camera laborator…

Read Paper →

AI & Data Science Preprint PDF DOI

ProcFunc: Function-Oriented Abstractions for Procedural 3D Generation in Python

Alexander Raistrick, Karhan Kayan, Jack Nugent, David Yan, Lingjie Mei, Meenal Parakh, Hongyu Wen, Dylan Li, Yiming Zuo, Erich Liang, Jia Deng · 2026

We introduce ProcFunc, a library for Blender-based procedural 3D generation in Python. ProcFunc provides a library of easy-to-use Python functions, which streamline creating, combining, analyzing, and…

Read Paper →

AI & Data Science Preprint PDF DOI

MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching

Shuzhao Xie, Junchen Ge, Weixiang Zhang, Jiahang Liu, Chen Tang, Yunpeng Bai, Shijia Ge, Jingyan Jiang, Yuzhi Huang, Fengnian Yang, Cong Zhang, Xiaoyi Fan, Zhi Wang · 2026

3D Gaussian Splatting (3DGS) achieves high-quality novel view synthesis with real-time rendering, but its storage cost remains prohibitive for practical deployment. Existing post-training compression …

Read Paper →

AI & Data Science Preprint PDF DOI

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

GLM-V Team: Wenyi Hong, Xiaotao Gu, Ziyang Pan, Zhen Yang, Yuting Wang, Yue Wang, Yuanchang Yue, Yu Wang, Yanling Wang, Yan Wang, Xijun Liu, Wenmeng Yu, Weihan Wang, Wei Li, Shuaiqi Duan, Sheng Yang, Ruiliang Lv, Mingdao Liu, Lihang Pan, Ke Ning, Junhui Ji, Jinjiang Wang, Jing Chen, Jiazheng Xu, Jiale Zhu, Jiale Cheng, Ji Qi, Guobing Gan, Guo Wang, Cong Yao, Zijun Dou, Zihao Zhou, Zihan Wang, Zhiqi Ge, Zhijie Li, Zhenyu Hou, Zhao Xue, Zehui Wang, Zehai He, Yusen Liu, Yukuo Cen, Yuchen Li, Yuan Wang, Yijian Lu, Yanzi Wang, Yadong Xue, Xinyu Zhang, Xinyu Liu, Wenkai Li, Tianyu Tong, Tianshu Zhang, Shengdong Yan, Qinkai Zheng, Mingde Xu, Licheng Bao, Jiaxing Xu, Jiaxin Fan, Jiawen Qian, Jiali Chen, Jiahui Lin, Haozhi Zheng, Haoran Wang, Haochen Li, Fan Yang, Dan Zhang, Chuangxin Zhao, Chengcheng Wu, Boyan Shi, Bowei Jia, Baoxu Wang, Peng Zhang, Debing Liu, Bin Xu, Juanzi Li, Minlie Huang, Yuxiao Dong, Jie Tang · 2026

We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents. As foundation models are increasingly deployed in real environments, agentic capability depends not only on langu…

Read Paper →

AI & Data Science Preprint PDF DOI

Pythia: Toward Predictability-Driven Agent-Native LLM Serving

Shan Yu, Junyi Shu, Yuanjiang Ni, Kun Qian, Xue Li, Yang Wang, Jinyuan Zhang, Ziyi Xu, Shuo Yang, Lingjun Zhu, Ennan Zhai, Qingda Lu, Jiarong Xing, Youyou Lu, Xin Jin, Xuanzhe Liu, Harry Xu · 2026

As LLM applications grow more complex, developers are increasingly adopting multi-agent architectures to decompose workflows into specialized, collaborative components, introducing structure that cons…

Read Paper →

AI & Data Science Preprint PDF DOI

Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses

Jiahang Lin, Shichun Liu, Chengjun Pan, Lizhi Lin, Shihan Dou, Xuanjing Huang, Hang Yan, Zhenhua Han, Tao Gui · 2026

Harnesses are now central to coding-agent performance, mediating how models interact with tools and execution environments. Yet harness engineering remains a manual craft, because automating it faces …

Read Paper →

AI & Data Science Preprint PDF DOI

QB-LIF: Learnable-Scale Quantized Burst Neurons for Efficient SNNs

Dewei Bai, Hongxiang Peng, Jiajun Mei, Yang Ren, Hong Qu, Dawen Xia, Zhang Yi · 2026

Binary spike coding enables sparse and event-driven computation in spiking neural networks (SNNs), yet its 1-bit-per-timestep representation fundamentally limits information throughput. This bottlenec…

Read Paper →

AI & Data Science Preprint PDF DOI

Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver

Joshua Sherwood, Ben Aybar, Benjamin Kaplan · 2026

Forecasting when AI systems will become capable of meaningfully accelerating AI research is a central challenge for AI safety. Existing benchmarks measure broad capability growth, but may not provide …

Read Paper →

AI & Data Science Preprint PDF DOI

FDIM: A Feature-distance-based Generic Video Quality Metric for Versatile Codecs

Jiayi Wang, Lichun Zhang, Xiaoqi Zhuang, Jiaqi Zhang, Lu Yu, Yin Zhao · 2026

Video technology is advancing toward Ultra High Definition (UHD) and High Dynamic Range (HDR), which intensifies the need for higher compression efficiency for these high-specification videos. Beyond …

Read Paper →

AI & Data Science Preprint PDF DOI

Fix Initial Codes and Iteratively Refine Textual Directions Toward Safe Multi-Turn Code Correction

Yuto Tanaka, Issei Sato · 2026

Recent work on large language models (LLMs) has emphasized the importance of scaling inference compute. From this perspective, the state-of-the-art method Scattered Forest Search (SFS) has been propos…

Read Paper →

AI & Data Science Preprint PDF DOI

Vibe Medicine: Redefining Biomedical Research Through Human-AI Co-Work

Zihao Wu, Steven Xu, Bowen Chen, Shaowen Wan, Yiwei Li, Wei Ruan, Yanjun Lyu, Siyuan Li, Dajiang Zhu, Tianming Liu, Lin Zhao · 2026

With the emergence of large language models (LLMs) and AI agent frameworks, the human-AI co-work paradigm known as Vibe Coding is changing how people code, making it more accessible and productive. In…

Read Paper →

AI & Data Science Preprint PDF DOI

BSViT: A Burst Spiking Vision Transformer for Expressive and Efficient Visual Representation Learning

Hongxiang Peng, Dewei Bai, Hong Qu · 2026

Spiking Vision Transformers (S-ViTs) offer a promising framework for energy-efficient visual learning. However, existing designs remain limited by two fundamental issues: the restricted information ca…

Read Paper →

AI & Data Science Preprint PDF DOI

Perceptions and Utilization of GenAI Tools among Data Science Students and Faculty

Abeer M. Hasan, Sayed A. Mostafa · 2026

This study investigates perceptions and use of generative artificial intelligence (GenAI) tools among students and faculty in statistics and data science at a historically Black college or university.…

Read Paper →

AI & Data Science Preprint PDF DOI

AutoPyVerifier: Learning Compact Executable Verifiers for Large Language Model Outputs

Pouya Pezeshkpour, Estevam Hruschka · 2026

Verification is becoming central to both reinforcement-learning-based training and inference-time control of large language models (LLMs). Yet current verifiers face a fundamental trade-off: LLM-based…

Read Paper →

Browse Research Papers

What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design

TAFA-GSGC: Group-wise Scalable Point Cloud Geometry Compression with Progressive Residual Refinement

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

Learning When to Remember: Risk-Sensitive Contextual Bandits for Abstention-Aware Memory Retrieval in LLM-Based Coding Agents

Unpacking Vibe Coding: Help-Seeking Processes in Student-AI Interactions While Programming

Automated Detection of Mutual Gaze and Joint Attention in Dual-Camera Settings via Dual-Stream Transformers

ProcFunc: Function-Oriented Abstractions for Procedural 3D Generation in Python

MesonGS++: Post-training Compression of 3D Gaussian Splatting with Hyperparameter Searching

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Pythia: Toward Predictability-Driven Agent-Native LLM Serving

Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses

QB-LIF: Learnable-Scale Quantized Burst Neurons for Efficient SNNs

Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver

FDIM: A Feature-distance-based Generic Video Quality Metric for Versatile Codecs

Fix Initial Codes and Iteratively Refine Textual Directions Toward Safe Multi-Turn Code Correction

Vibe Medicine: Redefining Biomedical Research Through Human-AI Co-Work

BSViT: A Burst Spiking Vision Transformer for Expressive and Efficient Visual Representation Learning

Perceptions and Utilization of GenAI Tools among Data Science Students and Faculty

AutoPyVerifier: Learning Compact Executable Verifiers for Large Language Model Outputs

Browse by Category

Research Type

Publish Your Research