Programming Languages — Research Repository

AI & Data Science Preprint PDF DOI

Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception

Neemias B da Silva, Rodrigo Minetto, Daniel Silver, Thiago H Silva · 2026

Large Language Models (LLMs) are increasingly used as proxies for human perception in urban analysis, yet it remains unclear whether persona prompting produces meaningful and reproducible behavioral d…

Read Paper →

AI & Data Science Preprint PDF DOI

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

Rahul Ramachandran, Nidhi Jha, Muthukumaran Ramasubramanian · 2026

We present Collaborative Agent Reasoning Engineering (CARE), a disciplined methodology for engineering Large Language Model (LLM) agents in scientific domains. Unlike ad-hoc trial-and-error approaches…

Read Paper →

AI & Data Science Preprint PDF DOI

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

Jialu Shen, Han Lyu, Suyang Zhong, Hanzheng Li, Haoyi Tao, Nan Wang, Changhong Chen, Xi Fang · 2026

Spectra are a prevalent yet highly information-dense form of scientific imagery, presenting substantial challenges to multimodal large language models (MLLMs) due to their unstructured and domain-spec…

Read Paper →

AI & Data Science Preprint PDF DOI

Ease of dependency distance minimization in star-like structures

Emilia Garcia-Casademont, Ramon Ferrer-i-Cancho · 2026

The syntactic structure of a sentence can be represented as a tree where edges indicate syntactic dependencies between words. When that structure is a star, it has been demonstrated that the head shou…

Read Paper →

AI & Data Science Preprint PDF DOI

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

Garvin Kruthof · 2026

When researchers iteratively refine ideas with large language models, do the models preserve fidelity to the original objective? We introduce DriftBench, a benchmark for evaluating constraint adherenc…

Read Paper →

AI & Data Science Preprint PDF DOI

Reliable Answers for Recurring Questions: Boosting Text-to-SQL Accuracy with Template Constrained Decoding

Smit Jivani, Sarvam Maheshwari, Sunita Sarawagi · 2026

Large language models (LLMs) have revolutionized Text-to-SQL generation, allowing users to query structured data using natural language with growing ease. Yet, real-world deployment remains challengin…

Read Paper →

Sociology & Anthropology Preprint PDF DOI

Universal statistical laws governing culinary design

Ganesh Bagler, Gopal Krishna Tewari, Aditya Raj Yadav, Akshat Singh, Pranay Bansal, Ujjval Dargar, Mansi Goel, Madhvi Kumari Sinha · 2026

Cooking is a cultural expression of human creativity that transcends geography and time through the orchestration of ingredients and techniques, much like languages do through words and syntax. Yet, b…

Read Paper →

AI & Data Science Preprint PDF DOI

Cost-Aware Learning

Clara Mohri, Amir Globerson, Haim Kaplan, Tomer Koren, Yishay Mansour · 2026

We consider the problem of Cost-Aware Learning, where sampling different component functions of a finite-sum objective incurs different costs. The objective is to reach a target error while minimizing…

Read Paper →

Engineering Preprint PDF DOI

Design Structure Matrix Modularization with Large Language Models

Shuo Jiang, Jianxi Luo · 2026

Design Structure Matrix (DSM) modularization, the task of partitioning system elements into cohesive modules, is a fundamental combinatorial challenge in engineering design. Traditional methods treat …

Read Paper →

AI & Data Science Preprint PDF DOI

Echo-{\alpha}: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation

Jing Zhang, Wentao Jiang, Tao Huang, Zhiwei Wang, Jianxin Liu, Jian Chen, Ping Ye, Gang Wang, Zengmao Wang, Bo Du, Dacheng Tao · 2026

Ultrasound interpretation requires both precise lesion localization and holistic clinical reasoning, yet existing methods typically excel at only one of these capabilities: specialized detectors offer…

Read Paper →

AI & Data Science Preprint PDF DOI

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

Shijin Gong, Kai Ye, Jin Zhu, Xinyu Zhang, Hongyi Zhou, Chengchun Shi · 2026

Recent advances in large language models (LLMs) have increasingly relied on reinforcement learning (RL) to improve their reasoning capabilities. Three approaches have been widely adopted: (i) Proximal…

Read Paper →

AI & Data Science Preprint PDF DOI

A Pattern Language for Resilient Visual Agents

Habtom Kahsay Gidey, Alexander Lenz, Alois Knoll · 2026

Integrating multimodal foundation models into enterprise ecosystems presents a fundamental software architecture challenge. Architects must balance competing quality attributes: the high latency and n…

Read Paper →

AI & Data Science Preprint PDF DOI

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

Jackson Vonderhorst, Kuangshi Ai, Haichao Miao, Shusen Liu, Chaoli Wang · 2026

This paper examines how different types of large language model (LLM) agents perform on scientific visualization (SciVis) tasks, where users generate visualization workflows from natural-language inst…

Read Paper →

Computer Science Preprint PDF DOI

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

Ugo Dal Lago, Guido Fiorillo, Paolo Pistone · 2026

The problem of determining whether a probabilistic program terminates almost surely (i.e.~with probability one) is undecidable, and actually $\Pi^0_2$-complete. For this reason, a growing literature h…

Read Paper →

AI & Data Science Preprint PDF DOI

D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery

Hanane Nour Moussa, Yifei Li, Zhuoyang Li, Yankai Yang, Cheng Tang, Tianshu Zhang, Nesreen K. Ahmed, Ali Payani, Ziru Chen, Huan Sun · 2026

Despite recent progress in language models and agents for scientific data-driven discovery, further advancing their capabilities is held back by the absence of verifiable environments representing rea…

Read Paper →

AI & Data Science Preprint PDF DOI

TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot Transitions

Ce Chen, Yi Ren, Yuanming Li, Viktor Goriachko, Zhenhui Ye, Zujin Guo, Zhibin Hong, Mingming Gong · 2026

Traditional Shot Boundary Detection (SBD) inherently struggles with complex transitions by formulating the task around isolated cut points, frequently yielding corrupted video shots. We address this f…

Read Paper →

AI & Data Science Preprint PDF DOI

FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting

Fengxian Ji, Jingpu Yang, Zirui Song, Yuanxi Wang, Zhexuan Cui, Yuke Li, Qian Jiang, Xiuying Chen · 2026

Despite the rapid progress of large vision-language models (LVLMs), fine-grained, state-conditioned GUI interaction remains challenging. Current evaluations offer limited coverage, imprecise target-st…

Read Paper →

AI & Data Science Preprint PDF DOI

From LLM-Driven Trading Card Generation to Procedural Relatedness: A Pok\'emon Case Study

Johannes Pfau, Panagiotis Vrettis · 2026

Since the dawn of Trading Card Games, the genre has grown into a multi-billion-dollar industry engaging millions of analog and digital players worldwide. Popular TCGs rely on regular updates, balance …

Read Paper →

Computer Science Preprint PDF DOI

From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation

Guang Yang, Xing Hu, Xiang Chen, Xin Xi · 2026

Multimodal large language models (MLLMs) are increasingly used to translate visual artifacts into code, from UI mockups into HTML to scientific plots into Python scripts. A circuit diagram can be view…

Read Paper →

AI & Data Science Preprint PDF DOI

Language Models Refine Mechanical Linkage Designs Through Symbolic Reflection and Modular Optimisation

Joao Pedro Gandarela, Thiago Rios, Stefan Menzel, Andre Freitas · 2026

Designing mechanical linkages involves combinatorial topology selection and continuous parameter fitting. We show that language models can systematically improve linkage designs through symbolic repre…

Read Paper →

Browse Research Papers

Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images

Ease of dependency distance minimization in star-like structures

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

Reliable Answers for Recurring Questions: Boosting Text-to-SQL Accuracy with Template Constrained Decoding

Universal statistical laws governing culinary design

Cost-Aware Learning

Design Structure Matrix Modularization with Large Language Models

Echo-{\alpha}: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

A Pattern Language for Resilient Visual Agents

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

On Higher-Order Probabilistic Verification via the Weighted Relational Model of Linear Logic

D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery

TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot Transitions

FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting

From LLM-Driven Trading Card Generation to Procedural Relatedness: A Pok\'emon Case Study

From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation

Language Models Refine Mechanical Linkage Designs Through Symbolic Reflection and Modular Optimisation

Browse by Category

Research Type

Publish Your Research