Task Adaptive Parameter Sharing for Multi-Task Learning

Matthew Wallingford, Hao Li, Alessandro Achille, Avinash Ravichandran, Charless Fowlkes, Rahul Bhotika, Stefano Soatto

Artificial Intelligence And Data Science PDF Available Non-peer-reviewed Preprint

Task Adaptive Parameter Sharing for Multi-Task Learning

Matthew Wallingford, Hao Li, Alessandro Achille, Avinash Ravichandran, Charless Fowlkes, Rahul Bhotika, Stefano Soatto · Published 2022-03-30

Expertini /
Research /
Artificial Intelligence And Data Science /
Task Adaptive Parameter Sharing for Multi-Task Learning

📄 Download PDF 🔖 Bookmark Paper

Abstract

Adapting pre-trained models with broad capabilities has become standard practice for learning a wide range of downstream tasks. The typical approach of fine-tuning different models for each task is performant, but incurs a substantial memory cost. To efficiently learn multiple downstream tasks we introduce Task Adaptive Parameter Sharing (TAPS), a general method for tuning a base model to a new task by adaptively modifying a small, task-specific subset of layers. This enables multi-task learning while minimizing resources used and competition between tasks. TAPS solves a joint optimization problem which determines which layers to share with the base model and the value of the task-specific weights. Further, a sparsity penalty on the number of active layers encourages weight sharing with the base model. Compared to other methods, TAPS retains high accuracy on downstream tasks while introducing few task-specific parameters. Moreover, TAPS is agnostic to the model architecture and requires only minor changes to the training scheme. We evaluate our method on a suite of fine-tuning tasks and architectures (ResNet, DenseNet, ViT) and show that it achieves state-of-the-art performance while being simple to implement.

Keywords

Artificial Intelligence & Data Science

📄 Full Paper Available as PDF

This paper is available as a downloadable PDF.

📄 Download PDF

Comments (0)

No comments yet. Be the first to comment.

Paper Details

Authors Matthew Wallingford ,
Hao Li ,
Alessandro Achille ,
Avinash Ravichandran ,
Charless Fowlkes ,
Rahul Bhotika ,
Stefano Soatto
Published 2022-03-30
Category Artificial Intelligence And Data Science
Status Non-peer-reviewed Preprint
Language English
Word Count 182

Task Adaptive Parameter Sharing for Multi-Task Learning

Abstract

Keywords

✨ AI Plain-English Summary

Comments (0)

Related Papers

Digital technology, tele-medicine and artificial intelligence in...

Empowering OLAC Extension using Anusaaraka and Effective text processing ...

High-dimensional Graphical Model Search with gRapHD R Package

Lower Bounds for BMRM and Faster Rates for Training SVMs