Kui Wu in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Beyond Screenshots: Evaluating VLMs' Understanding of UI Animations

Chen Liang, Xirui Jiang, Naihao Deng, Eytan Adar, Anhong Guo · 2026

AI agents operating on user interfaces must understand how interfaces communicate state and feedback to act reliably. As a core communicative modality, animations are increasingly used in modern inter…

Read Paper →

Computer Science Preprint PDF DOI

Generative UI as an Accessibility Bridge: Lessons from C2C E-Commerce

Bektur Ryskeldiev · 2026

Web accessibility rests on static standards and developer compliance. That model frays in platforms where content is user-generated: photos arrive blurry or off-frame, descriptions skip size and condi…

Read Paper →

Computer Science Preprint PDF DOI

VisualNeo: Bridging the Gap between Visual Query Interfaces and Graph Query Engines

Kai Huang, Houdong Liang, Chongchong Yao, Xi Zhao, Yue Cui, Yao Tian, Ruiyuan Zhang, Xiaofang Zhou · 2026

Visual Graph Query Interfaces (VQIs) empower non-programmers to query graph data by constructing visual queries intuitively. Devising efficient technologies in Graph Query Engines (GQEs) for interacti…

Read Paper →

Computer Science Preprint PDF DOI

"We Wanted to Do Better Than the Law": Exploring UI/UX Designers' Privacy Advocacy in Practice

Keyu Yao, Jinghui Cheng, Jin L.C. Guo · 2026

Designers hold primary responsibility for shaping the user interface (UI) and user experience (UX) of a product. This role goes beyond aesthetics and usability, extending to the privacy outcomes of us…

Read Paper →

Computer Science Preprint PDF DOI

RFID-Based Non-Biometric Classroom Attendance System: Proxy Attendance Detection via Weight Sensor Integration

Furkan Ege, Muhsin Ozdemir · 2026

Attendance tracking in educational institutions, when conducted through traditional methods, leads to structural problems that consume instruction time and threaten academic integrity. Attendance dura…

Read Paper →

Computer Science Preprint PDF DOI

TraceScope: Interactive URL Triage via Decoupled Checklist Adjudication

Haolin Zhang, William Reber, Yuxuan Zhang, Guofei Gu, Jeff Huang · 2026

Modern phishing campaigns increasingly evade snapshot-based URL classifiers using interaction gates (e.g., checkbox/slider challenges), delayed content rendering, and logo-less credential harvesters. …

Read Paper →

Computer Science Preprint PDF DOI

AgentLens: Adaptive Visual Modalities for Human-Agent Interaction in Mobile GUI Agents

Jeonghyeon Kim, Byeongjun Joung, Junwon Lee, Joohyung Lee, Taehoon Min, Sunjae Lee · 2026

Mobile GUI agents can automate smartphone tasks by interacting directly with app interfaces, but how they should communicate with users during execution remains underexplored. Existing systems rely on…

Read Paper →

Computer Science Preprint PDF DOI

ViBR: Automated Bug Replay from Video-based Reports using Vision-Language Models

Sidong Feng, Dingbang Wang, Nikola Tomic, Tingting Yu, Aldeida Aleti, Chunyang Chen · 2026

Bug reports play a critical role in software maintenance by helping users convey encountered issues to developers. Recently, GUI screen capture videos have gained popularity as a bug reporting artifac…

Read Paper →

Computer Science Preprint PDF DOI

PlayCoder: Making LLM-Generated GUI Code Playable

Zhiyuan Peng, Wei Tao, Xin Yin, Chenhao Ying, Yuan Luo, Yiwen Guo · 2026

Large language models (LLMs) have achieved strong results in code generation, but their ability to generate GUI applications, especially games, remains insufficiently studied. Existing benchmarks main…

Read Paper →

Computer Science Preprint PDF DOI

Proactive Detection of GUI Defects in Multi-Window Scenarios via Multimodal Reasoning

Xinyao Zhang, Rui Wang, Jinhao Cui, Haotian Huang, Wei Xue, Wenhua Hu, Jianwen Xiang, Rui Hao · 2026

Multi-window mobile scenarios, such as split-screen and foldable modes, make GUI display defects more likely by forcing applications to adapt to changing window sizes and dynamic layout reflow. Existi…

Read Paper →

Computer Science Preprint PDF DOI

Temporal UI State Inconsistency in Desktop GUI Agents: Formalizing and Defending Against TOCTOU Attacks on Computer-Use Agents

Wenpeng Xu · 2026

GUI agents that control desktop computers via screenshot-and-click loops introduce a new class of vulnerability: the observation-to-action gap (mean 6.51 s on real OSWorld workloads) creates a Time-Of…

Read Paper →

Computer Science Preprint PDF DOI

Analysing Human Interaction with Electronic Displays in Microgravity

Pradipta Biswas, Himanshu Vishwakarma, Mukund Mitra, KamalPreet Singh Saluja, Aumkar Kishore Shah · 2026

Human Space Flight missions often require interaction with touchscreen displays. This paper presents a study of investigating human machine interaction with touchscreen using both finger and stylus in…

Read Paper →

Computer Science Preprint PDF DOI

SkillDroid: Compile Once, Reuse Forever

Qijia Chen, Andrea Bellucci, Zhida Sun, Giulio Jacucci · 2026

LLM-based mobile GUI agents treat every task invocation as an independent reasoning episode, requiring a full LLM inference call at each action step. This per-step dependence makes them stateless: a t…

Read Paper →

Computer Science Preprint PDF DOI

Beyond Chat and Clicks: GUI Agents for In-Situ Assistance via Live Interface Transformation

Pan Hao, Rishi Selvakumaran, Jacob Sun, Qianwen Wang · 2026

Complex visual interfaces are powerful yet have a steep learning curve, as users must navigate feature-rich visual interfaces while reasoning about domain-specific operations. Existing approaches eith…

Read Paper →

Computer Science Preprint PDF DOI

NewsTorch: A PyTorch-based Toolkit for Learner-oriented News Recommendation

Rongyao Wang, Veronica Liesaputra, Zhiyi Huang · 2026

News recommender systems are devised to alleviate the information overload, attracting more and more researchers' attention in recent years. The lack of a dedicated learner-oriented news recommendatio…

Read Paper →

Computer Science Preprint PDF DOI

RecNextEval: A Reference Implementation for Temporal Next-Batch Recommendation Evaluation

Tze-Kean Ng, Joshua Teng-Khing Khoo, Aixin Sun · 2026

A good number of toolkits have been developed in Recommender Systems (RecSys) research to promote fair evaluation and reproducibility. However, recent critical examinations of RecSys evaluation protoc…

Read Paper →

Computer Science Preprint PDF DOI

Agentic Open RAN: A Deterministic and Auditable Framework for Intent-Driven Radio Control

Hengxu Li, Dongkuan Xu, Mingzhe Chen, Yuchen Liu · 2026

Large language models (LLMs) open new possibilities for agentic control in Open RAN, allowing operators to express intents in natural language while delegating low-level execution to autonomous agents…

Read Paper →

Computer Science Preprint PDF DOI

From Words to Widgets for Controllable LLM Generation

Chao Zhang, Yiren Liu, Lunyiu Nie, Jeffrey M. Rzeszotarski, Yun Huang, Tal August · 2026

Natural language remains the predominant way people interact with large language models (LLMs). However, users often struggle to precisely express and control subjective preferences (e.g., tone, style…

Read Paper →

Computer Science Preprint PDF DOI

GALA: Multimodal Graph Alignment for Bug Localization in Automated Program Repair

Zhuoyao Liu, Zhengran Zeng, Shu-Dong Huang, Yang Liu, Shikun Zhang, Wei Ye · 2026

Large Language Model (LLM)-based Automated Program Repair (APR) has shown strong potential on textual benchmarks, yet struggles in multimodal scenarios where bugs are reported with GUI screenshots. Ex…

Read Paper →

Computer Science Preprint PDF DOI

Same Outcomes, Different Journeys: A Trace-Level Framework for Comparing Human and GUI-Agent Behavior in Production Search Systems

Maria Movin, Claudia Hauff, Aron Henriksson, Panagiotis Papapetrou · 2026

LLM-driven GUI agents are increasingly used in production systems to automate workflows and simulate users for evaluation and optimization. Yet most GUI-agent evaluations emphasize task success and pr…

Read Paper →

Browse Research Papers

Beyond Screenshots: Evaluating VLMs' Understanding of UI Animations

Generative UI as an Accessibility Bridge: Lessons from C2C E-Commerce

VisualNeo: Bridging the Gap between Visual Query Interfaces and Graph Query Engines

"We Wanted to Do Better Than the Law": Exploring UI/UX Designers' Privacy Advocacy in Practice

RFID-Based Non-Biometric Classroom Attendance System: Proxy Attendance Detection via Weight Sensor Integration

TraceScope: Interactive URL Triage via Decoupled Checklist Adjudication

AgentLens: Adaptive Visual Modalities for Human-Agent Interaction in Mobile GUI Agents

ViBR: Automated Bug Replay from Video-based Reports using Vision-Language Models

PlayCoder: Making LLM-Generated GUI Code Playable

Proactive Detection of GUI Defects in Multi-Window Scenarios via Multimodal Reasoning

Temporal UI State Inconsistency in Desktop GUI Agents: Formalizing and Defending Against TOCTOU Attacks on Computer-Use Agents

Analysing Human Interaction with Electronic Displays in Microgravity

SkillDroid: Compile Once, Reuse Forever

Beyond Chat and Clicks: GUI Agents for In-Situ Assistance via Live Interface Transformation

NewsTorch: A PyTorch-based Toolkit for Learner-oriented News Recommendation

RecNextEval: A Reference Implementation for Temporal Next-Batch Recommendation Evaluation

Agentic Open RAN: A Deterministic and Auditable Framework for Intent-Driven Radio Control

From Words to Widgets for Controllable LLM Generation

GALA: Multimodal Graph Alignment for Bug Localization in Automated Program Repair

Same Outcomes, Different Journeys: A Trace-Level Framework for Comparing Human and GUI-Agent Behavior in Production Search Systems

Browse by Category

Research Type

Publish Your Research