PROFIT: A Specialized Optimizer for Deep Fine Tuning

Anirudh S Chakravarthy, Shuai Kyle Zheng, Xin Huang, Sachithra Hemachandra, Xiao Zhang, Yuning Chai, Zhao Chen

发表年份: 2024
访问权限: 开放获取

摘要

The fine-tuning of pre-trained models has become ubiquitous in generative AI, computer vision, and robotics. Although much attention has been paid to improving the efficiency of fine-tuning model, there has been less scholarship around fine-tuning specifically for improved model performance. To remedy this gap, we present PROFIT, one of the first optimizers designed to incrementally fine-tune converged models on new tasks and/or datasets. Unlike traditional optimizers such as SGD or Adam, which make minimal assumptions due to random initializations, PROFIT takes the properties of a converged model into account explicitly to regularize the optimization process. Employing a temporal gradient-orthogonalization process, PROFIT outperforms fine-tuning methods in various tasks, from image classification to multimodal language model training to large-scale motion prediction. Moreover, PROFIT is encapsulated as a modular optimizer, which makes it easy to integrate directly into any training pipeline with minimal engineering effort.

关键词

cs.CV

PROFIT: A Specialized Optimizer for Deep Fine Tuning

摘要

关键词

相关论文

如何缓解越野环境中语义分割的分布偏移

基于点云配准的非破坏性高分辨率涂层厚度三维扫描测量

基于原型模糊推理与证据融合的不确定性引导工业机器人可进化识别框架

迈向智能机器人时代：用于高级感知系统的多模态柔性触觉传感器