Ankesh Anand in Computer Science — Research Repository

Computer Science Preprint PDF DOI

NVLLM: A 3D NAND-Centric Architecture Enabling Edge on-Device LLM Inference

Mingbo Hao, Changwei Yan, Haoyu Cui, Zhihao Yan, Yizhi Ding, Zhangrui Qian, Weiwei Shan · 2026

The rapid growth of LLMs demands high-throughput, memory-capacity-intensive inference on resource-constrained edge devices, where single-batch decoding remains fundamentally memory-bound. Existing out…

Read Paper →

Computer Science Preprint PDF DOI

RecFlash: Fast Recommendation System on In-Storage Computing with Frequency-Based Data Mapping

Jangho Baik, Sunghyun Kim, Gisan Ji, Wonbo Shim, Sungju Ryu · 2026

Recommendation system has gained a large popularity for a variety of personalized suggestion tasks, but the ever-increasing number of user data makes real-time processing of recommendation systems dif…

Read Paper →

Computer Science Preprint PDF DOI

Subquadratic Counting via Perfect Marginal Sampling

Xiaoyu Chen, Zongchen Chen, Kuikui Liu, Xinyuan Zhang · 2026

We study the computational complexity of approximately computing the partition function of a spin system. Techniques based on standard counting-to-sampling reductions yield $\tilde{O}(n^2)$-time algor…

Read Paper →

Computer Science Preprint PDF DOI

Disguising Topology and Side-Channel Information through Covert Gate- and ML-Enabled IP Camouflaging

Junling Fan, David Koblah, Domenic Forte · 2026

Semiconductor intellectual property (IP) theft incurs hundreds of billions in annual losses, driven by advanced reverse engineering (RE) techniques. Traditional ``cryptic'' IC camouflaging methods typ…

Read Paper →

Computer Science Preprint PDF DOI

AnkleType: A Hands- and Eyes-free Foot-based Text Entry Technique in Virtual Reality

Xiyun Luo, Weirong Luo, Kening Zhu, Taizhou Chen · 2026

Virtual Reality (VR) emphasizes immersive experiences, while text entry often requires hands or visual attention, which may disrupt the interaction flows in VR. We present AnkleType, a hand- and eye-f…

Read Paper →

Computer Science Preprint PDF DOI

Co-Design of Memory-Storage Systems for Workload Awareness with Interpretable Models

Jay Sarkar, Vamsi Pavan Rayaprolu, Abhijeet Bhalerao · 2026

Solid-state storage architectures based on NAND or emerging memory devices (SSD), are fundamentally architected and optimized for both reliability and performance. Achieving these simultaneous goals r…

Read Paper →

Computer Science Preprint PDF DOI

HAVEN: High-Bandwidth Flash Augmented Vector Engine for Large-Scale Approximate Nearest-Neighbor Search Acceleration

Po-Kai Hsu, Weihong Xu, Qunyou Liu, Tajana Rosing, Shimeng Yu · 2026

Retrieval-Augmented Generation (RAG) relies on large-scale Approximate Nearest Neighbor Search (ANNS) to retrieve semantically relevant context for large language models. Among ANNS methods, IVF-PQ of…

Read Paper →

Computer Science Preprint PDF DOI

Implementation and Performance Evaluation of CMOS-integrated Memristor-driven Flip-flop Circuits

Paras Tiwari, Narendra Singh Dhakad, Shalu Rani, Sanjay Kumar, Themis Prodromakis · 2026

In this work, we report implementation and performance evaluation of memristor-driven fundamental logic gates, including NOT, AND, NAND, OR, NOR, and XOR, and novel and optimized design of the sequent…

Read Paper →

Computer Science Preprint PDF DOI

Scalable IP Mimicry: End-to-End Deceptive IP Blending to Overcome Rectification and Scale Limitations of IP Camouflage

Junling Fan, George Rushevich, Giorgio Rusconi, Mengdi Zhu, Reiner Dizon-Paradis, Domenic Forte · 2025

Semiconductor intellectual property (IP) theft incurs estimated annual losses ranging from $225 billion to $600 billion. Despite initiatives like the CHIPS Act, many semiconductor designs remain vulne…

Read Paper →

Computer Science Preprint PDF DOI

A Performance Analyzer for a Public Cloud's ML-Augmented VM Allocator

Roozbeh Bostandoost, Pooria Namyar, Siva Kesava Reddy Kakarla, Ryan Beckett, Santiago Segarra, Eli Cortez, Ankur Mallick, Kevin Hsieh, Rodrigo Fonseca, Mohammad Hajiesmaili, Behnaz Arzani · 2025

Many operational cloud systems use one or more machine learning models that help them achieve better efficiency and performance. But operators do not have tools to help them understand how each model …

Read Paper →

Computer Science Preprint PDF DOI

KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing

Lishuo Deng, Shaojie Xu, Jinwu Chen, Changwei Yan, Jiajie Wang, Zhe Jiang, Weiwei Shan · 2025

Deploying large language models (LLMs) on edge devices enables personalized agents with strong privacy and low cost. However, with tens to hundreds of billions of parameters, single-batch autoregressi…

Read Paper →

Computer Science Preprint PDF DOI

Characterizing Off-Chain Influence Proof Transaction Fee Mechanisms

Aadityan Ganesh, Clayton Thomas, S. Matthew Weinberg · 2025

Roughgarden (2020) initiates the study of Transaction Fee Mechanisms (TFMs), and posits that the on-chain game of a ``good'' TFM should be on-chain simple (OnCS), i.e., incentive compatible for users …

Read Paper →

Computer Science Preprint PDF DOI

Drafting and Multi-Input Switching in Digital Dynamic Timing Simulation for Multi-Input Gates

Arman Ferdowsi, Ulrich Schmid, Josef Salzmann · 2025

We present a prototype multi-input gate extension of the publicly available Involution Tool for accurate digital timing simulation and power analysis of integrated circuits introduced by Oehlinger et …

Read Paper →

Computer Science Preprint PDF DOI

A Novel 8T SRAM-Based In-Memory Computing Architecture for MAC-Derived Logical Functions

Amogh K M, Sunita M S · 2025

This paper presents an in-memory computing (IMC) architecture developed on an 8x8 array of 8T SRAM cells. This architecture enables both multi-bit parallel Multiply-Accumulate (MAC) operations and sta…

Read Paper →

Computer Science Preprint PDF DOI

HDDB: Efficient In-Storage SQL Database Search Using Hyperdimensional Computing on Ferroelectric NAND Flash

Quanling Zhao, Yanru Chen, Runyang Tian, Sumukh Pinge, Weihong Xu, Augusto Vega, Steven Holmes, Saransh Gupta, Tajana Rosing · 2025

Hyperdimensional Computing (HDC) encodes information and data into high-dimensional distributed vectors that can be manipulated using simple bitwise operations and similarity searches, offering parall…

Read Paper →

Computer Science Preprint PDF DOI

Dissecting and Re-architecting 3D NAND Flash PIM Arrays for Efficient Single-Batch Token Generation in LLMs

Yongjoo Jang, Sangwoo Hwang, Hojin Lee, Sangwoo Jung, Donghun Lee, Wonbo Shim, Jaeha Kung · 2025

The advancement of large language models has led to models with billions of parameters, significantly increasing memory and compute demands. Serving such models on conventional hardware is challenging…

Read Paper →

Computer Science Preprint PDF DOI

STAR: Improving Lifetime and Performance of High-Capacity Modern SSDs Using State-Aware Randomizer

Omin Kwon, Kyungjun Oh, Jaeyong Lee, Myungsuk Kim, Jihong Kim · 2025

Although NAND flash memory has achieved continuous capacity improvements via advanced 3D stacking and multi-level cell technologies, these innovations introduce new reliability challenges, particularl…

Read Paper →

Computer Science Preprint PDF DOI

From Minutes to Seconds: Redefining the Five-Minute Rule for AI-Era Memory Hierarchies

Tong Zhang, Vikram Sharma Mailthody, Fei Sun, Linsen Ma, Chris J. Newburn, Teresa Zhang, Yang Liu, Jiangpeng Li, Hao Zhong, Wen-Mei Hwu · 2025

In 1987, Jim Gray and Gianfranco Putzolu introduced the five-minute rule, a simple, storage-memory-economics-based heuristic for deciding when data should live in DRAM rather than on storage. Subseque…

Read Paper →

Computer Science Preprint PDF DOI

PDA-LSTM: Knowledge-driven page data arrangement based on LSTM for LCM supression in QLC 3D NAND flash memories

Qianhui Li, Weiya Wang, Qianqi Zhao, Tong Qu, Jing He, Xuhong Qiang, Jingwen Hou, Ke Chen, Bao Zhang, Qi Wang · 2025

Quarter level cell (QLC) 3D NAND flash memory is emerging as the predominant storage solution in the era of artificial intelligence. QLC 3D NAND flash stores 4 bit per cell to expand the storage densi…

Read Paper →

Computer Science Preprint PDF DOI

Shifts in U.S. Social Media Use, 2020-2024: Decline, Fragmentation, and Enduring Polarization

Petter Tornberg · 2025

Using nationally representative data from the 2020 and 2024 American National Election Studies (ANES), this paper traces how the U.S. social media landscape has shifted across platforms, demographics,…

Read Paper →

Browse Research Papers

NVLLM: A 3D NAND-Centric Architecture Enabling Edge on-Device LLM Inference

RecFlash: Fast Recommendation System on In-Storage Computing with Frequency-Based Data Mapping

Subquadratic Counting via Perfect Marginal Sampling

Disguising Topology and Side-Channel Information through Covert Gate- and ML-Enabled IP Camouflaging

AnkleType: A Hands- and Eyes-free Foot-based Text Entry Technique in Virtual Reality

Co-Design of Memory-Storage Systems for Workload Awareness with Interpretable Models

HAVEN: High-Bandwidth Flash Augmented Vector Engine for Large-Scale Approximate Nearest-Neighbor Search Acceleration

Implementation and Performance Evaluation of CMOS-integrated Memristor-driven Flip-flop Circuits

Scalable IP Mimicry: End-to-End Deceptive IP Blending to Overcome Rectification and Scale Limitations of IP Camouflage

A Performance Analyzer for a Public Cloud's ML-Augmented VM Allocator

KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing

Characterizing Off-Chain Influence Proof Transaction Fee Mechanisms

Drafting and Multi-Input Switching in Digital Dynamic Timing Simulation for Multi-Input Gates

A Novel 8T SRAM-Based In-Memory Computing Architecture for MAC-Derived Logical Functions

HDDB: Efficient In-Storage SQL Database Search Using Hyperdimensional Computing on Ferroelectric NAND Flash

Dissecting and Re-architecting 3D NAND Flash PIM Arrays for Efficient Single-Batch Token Generation in LLMs

STAR: Improving Lifetime and Performance of High-Capacity Modern SSDs Using State-Aware Randomizer

From Minutes to Seconds: Redefining the Five-Minute Rule for AI-Era Memory Hierarchies

PDA-LSTM: Knowledge-driven page data arrangement based on LSTM for LCM supression in QLC 3D NAND flash memories

Shifts in U.S. Social Media Use, 2020-2024: Decline, Fragmentation, and Enduring Polarization

Browse by Category

Research Type

Publish Your Research