LibEvolutionEval: A Benchmark and Study for Version-Specific Code   Generation

Sachit Kuhar, Wasi Uddin Ahmad, Zijian Wang, Nihal Jain, Haifeng Qian, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras

Computer Science PDF Available Non-peer-reviewed Preprint

LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation

Sachit Kuhar, Wasi Uddin Ahmad, Zijian Wang, Nihal Jain, Haifeng Qian, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras · Published 2024-11-19

Expertini /
Research /
Computer Science /
LibEvolutionEval: A Benchmark and Study for...

📄 Download PDF 🔖 Bookmark Paper

Abstract

Recent advancements in code completion models have primarily focused on local file contexts. However, these studies do not fully capture the complexity of real-world software development, which often requires the use of rapidly-evolving public libraries. To fill the gap, we introduce LibEvolutionEval, a detailed study requiring an understanding of library evolution to perform in-line code completion accurately. LibEvolutionEval provides a version-specific code-completion task comprised of eight libraries (torch, torchvision, scipy, pil, tqdm, pyyaml, matplotlib, and pandas) as they evolve over the year along with a detailed analysis of the evolution of two popular and well-maintained public libraries: PyTorch and Matplotlib. We evaluate popular public models and find that public library evolution significantly influences model performance. We explored mitigation methods by studying how retrieved version-specific library documentation and prompting can improve the model's capability in handling these fast-evolving packages, paving a promising future path in better handling fast-evolving libraries.

Keywords

Computer Science

📄 Full Paper Available as PDF

This paper is available as a downloadable PDF.

📄 Download PDF

Comments (0)

No comments yet. Be the first to comment.

Paper Details

Authors Sachit Kuhar ,
Wasi Uddin Ahmad ,
Zijian Wang ,
Nihal Jain ,
Haifeng Qian ,
Baishakhi Ray ,
Murali Krishna Ramanathan ,
Xiaofei Ma ,
Anoop Deoras
Published 2024-11-19
Category Computer Science
Status Non-peer-reviewed Preprint
Language English
Word Count 148

LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation

Abstract

Keywords

✨ AI Plain-English Summary

Comments (0)

Related Papers

Modeling and Forecasting the COVID-19 Temporal Spread in Greece: An...

Addressing the mental health concerns of migrant workers during the COVID-19...

A COVID-19 Risk Assessment Decision Support System for General...

The COVID-19 pandemic, personal protective equipment and respirator: A...