Berat Dogan in Computer Science — Research Repository

Computer Science Preprint PDF DOI

Cross-lingual Comparison of Research Funding Projects with Multilingual Sentence-BERT: Evidence from KAKENHI, NIH, NSF, and UKRI

Miki Kimura-Ida · 2026

Cross-national comparison of research funding projects is increasingly important for science policy and strategic planning, but language differences remain a major obstacle. In particular, KAKENHI pro…

Read Paper →

Computer Science Preprint PDF DOI

Identifying and Characterizing Semantic Clones of Solidity Functions

Ermanno Francesco Sannini, Francesco Salzano, Simone Scalabrino, Rocco Oliveto, Remo Pareschi, Corrado Aaron Visaggio, Andrea Di Sorbo · 2026

Smart Contracts are essential blockchain components, mainly written in Solidity. The high availability of public Solidity code leads to frequent reuse and high clone ratios. Since cloning can propagat…

Read Paper →

Computer Science Preprint PDF DOI

SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton

Xuzheng He, Nan Nan, Zhilin Wang, Ziyue Kang, Zhuoru Mo, Ao Li, Yu Pan, Xiaobing Li, Feng Yu, Xiaohong Guan · 2026

Generating symphonic music requires simultaneously managing high-level structural form and dense, multi-track orchestration. Existing symbolic models often struggle with a "complexity-control imbalanc…

Read Paper →

Computer Science Preprint PDF DOI

Geometric Analysis of Self-Supervised Vision Representations for Semantic Image Retrieval

Esteban Rodriguez-Betancourt, Edgar Casasola-Murillo · 2026

Content-based image retrieval (CBIR) systems enable users to search images based on visual content instead of relying on metadata. The text domain has benefited from vector search of representations c…

Read Paper →

Computer Science Preprint PDF DOI

Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models

Gautam Kishore Shahi, Oliver Hummel · 2026

The relentless expansion of scientific literature presents significant challenges for navigation and knowledge discovery. Within Research Information Retrieval, established tasks such as text summariz…

Read Paper →

Computer Science Preprint PDF DOI

Transformer-Based Rhythm Quantization of Performance MIDI Using Beat Annotations

Maximilian Wachter, Sebastian Murgul, Michael Heizmann · 2026

Rhythm transcription is a key subtask of notation-level Automatic Music Transcription (AMT). While deep learning models have been extensively used for detecting the metrical grid in audio and MIDI per…

Read Paper →

Computer Science Preprint PDF DOI

White Paper: Human-AI Collaboration in Conflict Analysis: Text Classifier Development with Peacebuilders

Allan Kipyator Kipkemboi Cheboi, Julie Hawke, Hussam Abualfatah, Andrew Sutjahjo, Daniel Burkhardt Cerigo, Rachael Olpengs, William OBrien · 2026

This paper documents a collaborative research process involving peacebuilders and data scientists in Kenya and Sudan to develop AI-based text classifiers for monitoring online polarization and hatespe…

Read Paper →

Computer Science Preprint PDF DOI

Scaling Worst-Case Optimal Datalog to GPUs

Yihao Sun, Kunting Qi, Thomas Gilray, Sidharth Kumar, Kristopher Micinski · 2026

Datalog is a declarative logic-programming language used for complex analytic reasoning workloads such as program analysis and graph analytics. Datalog's popularity is due to its unique price-point, m…

Read Paper →

Computer Science Preprint PDF DOI

BEAT: Tokenizing and Generating Symbolic Music by Uniform Temporal Steps

Lekai Qian, Haoyu Gu, Jingwei Zhao, Ziyu Wang · 2026

Tokenizing music to fit the general framework of language models is a compelling challenge, especially considering the diverse symbolic structures in which music can be represented (e.g., sequences, g…

Read Paper →

Computer Science Preprint PDF DOI

Circadian Phase Locking of Epilepsy Seizures in Wearable Data: A Single-Patient Case Study

Berenika Ewart-James, Matthew Wragg, Nawid Keshtmand, Amberly Brigden, Paul Marshall, Raul Santos-Rodriguez (University of Bristol) · 2026

Epilepsy is a common, chronic neurological disorder characterized by recurrent seizures caused by sudden bursts of abnormal electrical activity in the brain. Seizures can often be unpredictable, leadi…

Read Paper →

Computer Science Preprint PDF DOI

On the Robustness of LLM-Based Dense Retrievers: A Systematic Analysis of Generalizability and Stability

Yongkang Li, Panagiotis Eustratiadis, Yixing Fan, Evangelos Kanoulas · 2026

Decoder-only large language models (LLMs) are increasingly replacing BERT-style architectures as the backbone for dense retrieval, achieving substantial performance gains and broad adoption. However, …

Read Paper →

Computer Science Preprint PDF DOI

A Manual Bar-by-Bar Tempo Measurement Protocol for Polyphonic Chamber Music Recordings: Design, Validation, and Application to Beethoven's Piano and Cello Sonatas

Ignasi Sole · 2026

Empirical performance analysis depends on the accurate extraction of tempo data from recordings, yet standard computational tools, designed for monophonic audio or modern studio conditions, fail syste…

Read Paper →

Computer Science Preprint PDF DOI

ToxiShield: Promoting Inclusive Developer Communication through Real-Time Toxicity Filtering

MD Awsaf Alam Anindya, Showvik Biswas, Anindya Iqbal, Jaydeb Sarker, Amiangshu Bosu · 2026

Toxic interactions during code reviews can undermine teamwork and hinder productivity in software engineering (SE) teams. While prior studies explore toxicity detection and empirical investigation, th…

Read Paper →

Computer Science Preprint PDF DOI

Log-based vs Graph-based Approaches to Fault Diagnosis

Mathis Nguyen, Mohamed Ali Lajnef · 2026

Modern distributed systems generate large volumes of logs that can be analyzed to support essential AIOps tasks such as fault diagnosis, which plays a crucial role in maintaining system reliability. M…

Read Paper →

Computer Science Preprint PDF DOI

Max Cut with Small-Dimensional SDP Solutions

Hsien-Chih Chang, Suprovat Ghoshal, Euiwoong Lee · 2026

We study the Max-Cut semidefinite programming (SDP) relaxation in the regime where a near-optimal solution admits a low-dimensional realization. While the Goemans--Williamson hyperplane rounding achie…

Read Paper →

Computer Science Preprint PDF DOI

CUTEv2: Unified and Configurable Matrix Extension for Diverse CPU Architectures with Minimal Design Overhead

Jinpeng Ye, Chongxi Wang, Wenqing Li, Bin Yuan, Shiyi Wang, Fenglu Zhang, Junyu Yue, Jianan Xie, Yunhao Ye, Haoyu Deng, Yingkun Zhou, Xin Cheng, Fuxin Zhang, Jian Wang · 2026

Matrix extensions have emerged as an essential feature in modern CPUs to address the surging demands of AI workloads. However, existing designs often incur substantial hardware and software design ove…

Read Paper →

Computer Science Preprint PDF DOI

Benchmarking and Enabling Efficient Chinese Medical Retrieval via Asymmetric Encoders

Angqing Jiang, Jianlyu Chen, Zhe Fang, Yongcan Wang, Xinpeng Li, Keyu Ding, Defu Lian · 2026

Effective medical text retrieval requires both high accuracy and low latency. While LLM-based embedding models possess powerful retrieval capabilities, their prohibitive latency and high computational…

Read Paper →

Computer Science Preprint PDF DOI

Machine Learning-Based Detection of MCP Attacks

Tobias Mattsson, Samuel Nyberg, Anton Borg, Ricardo Britto · 2026

The Model Context Protocol (MCP) is a new and emerging technology that extends the functionality of large language models, improving workflows but also exposing users to a new attack surface. Several …

Read Paper →

Computer Science Preprint PDF DOI

EncFormer: Secure and Efficient Transformer Inference over Encrypted Data

Yufan Zhu, Chao Jin, Khin Mi Mi Aung, Xiaokui Xiao · 2026

Transformer inference in machine-learning-as-a-service (MLaaS) raises privacy concerns for sensitive user inputs. Prior secure solutions that combine fully homomorphic encryption (FHE) and secure mult…

Read Paper →

Computer Science Preprint PDF DOI

Real-Time Toxicity Filtering for Open-Source Code Reviews

Md Awsaf Alam Anindya, Showvik Biswas, Anindya Iqbal, Jaydeb Sarker, Amiangshu Bosu · 2026

Toxic interactions in open-source software development harm community collaboration. To combat this, we propose ToxiShield, a realtime browser extension that identifies and detoxifies toxic code revie…

Read Paper →

Browse Research Papers

Cross-lingual Comparison of Research Funding Projects with Multilingual Sentence-BERT: Evidence from KAKENHI, NIH, NSF, and UKRI

Identifying and Characterizing Semantic Clones of Solidity Functions

SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton

Geometric Analysis of Self-Supervised Vision Representations for Semantic Image Retrieval

Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models

Transformer-Based Rhythm Quantization of Performance MIDI Using Beat Annotations

White Paper: Human-AI Collaboration in Conflict Analysis: Text Classifier Development with Peacebuilders

Scaling Worst-Case Optimal Datalog to GPUs

BEAT: Tokenizing and Generating Symbolic Music by Uniform Temporal Steps

Circadian Phase Locking of Epilepsy Seizures in Wearable Data: A Single-Patient Case Study

On the Robustness of LLM-Based Dense Retrievers: A Systematic Analysis of Generalizability and Stability

A Manual Bar-by-Bar Tempo Measurement Protocol for Polyphonic Chamber Music Recordings: Design, Validation, and Application to Beethoven's Piano and Cello Sonatas

ToxiShield: Promoting Inclusive Developer Communication through Real-Time Toxicity Filtering

Log-based vs Graph-based Approaches to Fault Diagnosis

Max Cut with Small-Dimensional SDP Solutions

CUTEv2: Unified and Configurable Matrix Extension for Diverse CPU Architectures with Minimal Design Overhead

Benchmarking and Enabling Efficient Chinese Medical Retrieval via Asymmetric Encoders

Machine Learning-Based Detection of MCP Attacks

EncFormer: Secure and Efficient Transformer Inference over Encrypted Data

Real-Time Toxicity Filtering for Open-Source Code Reviews

Browse by Category

Research Type

Publish Your Research