Expertini Research Research
Artificial Intelligence And Data Science PDF Available Non-peer-reviewed Preprint

CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model

Tae-Hwan Jung  ·  Published 2021-05-29

Abstract

Commit message is a document that summarizes source code changes in natural language. A good commit message clearly shows the source code changes, so this enhances collaboration between developers. Therefore, our work is to develop a model that automatically writes the commit message. To this end, we release 345K datasets consisting of code modification and commit messages in six programming languages (Python, PHP, Go, Java, JavaScript, and Ruby). Similar to the neural machine translation (NMT) model, using our dataset, we feed the code modification to the encoder input and the commit message to the decoder input and measure the result of the generated commit message with BLEU-4. Also, we propose the following two training methods to improve the result of generating the commit message: (1) A method of preprocessing the input to feed the code modification to the encoder input. (2) A method that uses an initial weight suitable for the code domain to reduce the gap in contextual representation between programming language (PL) and natural language (NL). Training code, dataset, and pre-trained weights are available at https://github.com/graykode/commit-autosuggestions
📄 Full Paper Available as PDF
This paper is available as a downloadable PDF.
📄 Download PDF

✨ AI Plain-English Summary

Get a plain-English summary of this paper generated by AI (5 free per day).

Comments (0)

No comments yet. Be the first to comment.

Related Papers

Artificial Intelligence And Data Science PDF

An Efficient Algorithm for Computing Interventional Distributions in ...

2012
Artificial Intelligence And Data Science PDF

Sparse matrix-variate Gaussian process blockmodels for network modeling

2012
Artificial Intelligence And Data Science PDF

Hierarchical Maximum Margin Learning for Multi-Class Classification

2012
Artificial Intelligence And Data Science PDF

Tightening MRF Relaxations with Planar Subproblems

2012