Expertini Research Research

Browse Research Papers

49,469+ open-access research outputs.

โœ• Clear
๐Ÿ” recognition
Showing 49469 results for "recognition"
AI & Data Science Preprint PDF DOI

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images

Bo Zhang, Tzu-Yen Ma, Zichen Tang, Junpeng Ding, Zirui Wang, Yizhuo Zhao, Peilin Gao, Zijie Xi, Zixin Ding, Haiyang Sun, Haocheng Gao, Yuan Liu, Liangjia Wang, Yiling Huang, Yujie Wang, Yuyue Zhang, Ronghui Xi, Yuanze Li, Jiacheng Liu, Zhongjun Yang, Haihong E ยท 2026

We introduce AEGIS, A holistic benchmark for Evaluating forensic analysis of AI-Generated academic ImageS. Compared to existing benchmarks, AEGIS features three key advances: (1) Domain-Specific Complโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Action Motifs: Self-Supervised Hierarchical Representation of Human Body Movements

Genki Kinoshita, Shu Nakamura, Ryo Kawahara, Shohei Nobuhara, Yasutomo Kawanishi, Ko Nishino ยท 2026

Effective human behavior modeling requires a representation of the human body movement that capitalizes on its compositionality. We propose a hierarchical representation consisting of Action Atoms thaโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Normativity and Productivism: Ableist Intelligence? A Degrowth Analysis of AI Sign Language Translation Tools for Deaf People

Nina Seron-Abouelfadil, Poppy Fynes ยท 2026

Sign languages, of any geographical or accentual variation, understandably face continuous scrutiny under the ever present popularity of verbal dictation and audism. Through this, many potential problโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Characterizing the Consistency of the Emergent Misalignment Persona

Anietta Weckauff, Yuchen Zhang, Maksym Andriushchenko ยท 2026

Fine-tuning large language models (LLMs) on narrowly misaligned data generalizes to broadly misaligned behavior, a phenomenon termed emergent misalignment (EM). While prior work has found a correlatioโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

AesRM: Improving Video Aesthetics with Expert-Level Feedback

Yujin Han, Yujie Wei, Yefei He, Xinyu Liu, Tianle Li, Zichao Yu, Andi Han, Shiwei Zhang, Tingyu Weng, Difan Zou ยท 2026

Despite rapid advances in photorealistic video generation, real-world applications such as filmmaking require video aesthetics, e.g., harmonious colors and cinematic lighting, beyond visual fidelity. โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering

An-Yang Ji, Jun-Peng Jiang, De-Chuan Zhan, Han-Jia Ye ยท 2026

Large Language Models (LLMs) have advanced Table Question Answering, where most queries can be answered by extracting information or simple aggregation. However, a common class of real-world queries iโ€ฆ

Read Paper โ†’
Sociology & Anthropology Preprint PDF DOI

Universal statistical laws governing culinary design

Ganesh Bagler, Gopal Krishna Tewari, Aditya Raj Yadav, Akshat Singh, Pranay Bansal, Ujjval Dargar, Mansi Goel, Madhvi Kumari Sinha ยท 2026

Cooking is a cultural expression of human creativity that transcends geography and time through the orchestration of ingredients and techniques, much like languages do through words and syntax. Yet, bโ€ฆ

Read Paper โ†’
Neuroscience Preprint PDF DOI

Multisensory learning recruits visual neurons into an olfactory memory engram

Zeynep Okray, Nils Otto, Anna A. Cook, Clifford Talbot, Ashwin Miriyala, Martin Klappenbach, Ciara Stern, Kieran Desmond, Paola Vargas-Gutierrez, Scott Waddell ยท 2026

Associating multiple sensory cues with a single experience or object is a fundamental process that improves object recognition and memory performance. However, neural mechanisms that bind sensory featโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

LLMs as ASP Programmers: Self-Correction Enables Task-Agnostic Nonmonotonic Reasoning

Adam Ishay, Joohyung Lee ยท 2026

Recent large language models (LLMs) have achieved impressive reasoning milestones but continue to struggle with high computational costs, logical inconsistencies, and sharp performance degradation on โ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Real-Time Control of a Virtual Orchestra by Recognition of Conducting Gestures

Mert Mermerci, Emile Pascoe, Fredrik Edstrom, Hedvig Kjellstrom ยท 2026

We present a museum installation in a 180{\deg} dome theater, which gives the museum visitor the experience of conducting a symphony orchestra. We have pre-recorded a short music piece performed by a โ€ฆ

Read Paper โ†’
Computer Science Preprint PDF DOI

Enhancing multimodal affect recognition in healthcare: the robustness of appraisal dimensions over labels within age groups and in cross-age generalisation

Hippolyte Fournier, Sina Alisamir, Safaa Azzakhnini, Isabella Zsoldos, Eleonore Tran, Gerard Bailly, Frederic Elisei, Beatrice Bouchot, Brice Varini, Patrick Constant, Joan Fruitet, Franck Tarpin-Bernard, Solange Rossato, Francois Portet, Olivier Koenig, Hanna Chainay, Fabien Ringeval ยท 2026

The integration of artificial intelligence (AI) into healthcare has advanced significantly, yet affect recognition remains a major challenge, particularly in AI-assisted interventions such as Computerโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Taming the Centaur(s) with LAPITHS: a framework for a theoretically grounded interpretation of AI performances

Matteo Da Pelo, Alessio Donvito, Claudio Frongia, Pietro Salis, Antonio Lieto ยท 2026

We introduce a framework called LAPITHS (Language model Analysis through Paradigm grounded Interpretations of Theses about Human likenesS) and use it to show that several major claims advanced by modeโ€ฆ

Read Paper โ†’
Neuroscience Preprint PDF DOI

On Agentic Behavioral Modeling

Dirk Ostwald, Rasmus Bruckner, Franziska Usee, Belinda Fleischmann, Joram Soch, Sean Mulready ยท 2026

Integrating theoretical neuroscience, decision theory, and probabilistic inference offers a promising route to understanding human cognition, yet concrete methodological bridges between agentic AI modโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Parameter-Efficient Architectural Modifications for Translation-Invariant CNNs

Nuria Alabau-Bosque, Jorge Vila-Tomas, Paula Dauden-Oliver, Valero Laparra, Jesus Malo ยท 2026

Convolutional Neural Networks (CNNs) are widely assumed to be translation-invariant, yet standard architectures exhibit a startling fragility: even a single-pixel shift can drastically degrade performโ€ฆ

Read Paper โ†’
Engineering Preprint PDF DOI

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

Doyeop Kwak, Jeongsoo Choi, Suyeon Lee, Joon Son Chung ยท 2026

We introduce LRS-VoxMM, an in-the-wild benchmark for audio-visual speech recognition (AVSR). The benchmark is derived from VoxMM, a dataset of diverse real-world spoken conversations with human-annotaโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Learning to Reason: Targeted Knowledge Discovery and Fuzzy Logic Update for Robust Image Recognition

Gurucharan Srinivas, Joshua Niemeijer, Frank Koster ยท 2026

Integrating domain knowledge into deep neural networks is a promising way to improve generalization. Existing methods either encode prior knowledge in the loss function or apply post-processing moduleโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Learning from a single labeled face and a stream of unlabeled data

Branislav Kveton, Michal Valko ยท 2026

Face recognition from a single image per person is a challenging problem because the training sample is extremely small. We consider a variation of this problem. In our problem, we recognize only one โ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Online semi-supervised perception: Real-time learning without explicit feedback

Branislav Kveton, Michal Valko, Matthai Phillipose, Ling Huang ยท 2026

This paper proposes an algorithm for real-time learning without explicit feedback. The algorithm combines the ideas of semi-supervised learning on graphs and online learning. In particular, it iteratiโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

HATS: An Open data set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics

Thibault Baneras Roux, Jane Wottawa, Mickael Rouvier, Teva Merlin, Richard Dufour ยท 2026

Conventionally, Automatic Speech Recognition (ASR) systems are evaluated on their ability to correctly recognize each word contained in a speech signal. In this context, the word error rate (WER) metrโ€ฆ

Read Paper โ†’
AI & Data Science Preprint PDF DOI

Self-Supervised Learning of Plant Image Representations

Ilyass Moummad, Kawtar Zaher, Herve Goeau, Jean-Christophe Lombardo, Pierre Bonnet, Alexis Joly ยท 2026

Automated plant recognition plays a crucial role in biodiversity monitoring and conservation, yet current approaches rely heavily on supervised learning, which is limited by the availability of expertโ€ฆ

Read Paper โ†’
Page 1 of 2474 Next โ†’