Vision foundation models
Raviteja Vemulapalli, Hadi Pouransari, Fartash Faghri, Sachin Mehta, Mehrdad Farajtabar, Mohammad Rastegari, Oncel Tuzel, "Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models", arXiv:2311.18237, 2023. [PDF]
Mohammadreza Salehi, Mehrdad Farajtabar, Maxwell Horton, Fartash Faghri, Hadi Pouransari, Raviteja Vemulapalli, Oncel Tuzel, Ali Farhadi, Mohammad Rastegari, Sachin Mehta, "CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement", arXiv:2310.14108, 2023. [PDF]
Mohammad Samragh, Mehrdad Farajtabar, Sachin Mehta, Raviteja Vemulapalli, Fartash Faghri, Devang Naik, Oncel Tuzel, Mohammad Rastegari, "Weight subcloning: direct initialization of transformers using larger pretrained ones", arXiv:2312.09299, 2023. [PDF]
Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training", CVPR, 2024. [PDF]
Haoxiang Wang, Pavan Kumar Anasosalu Vasu, Fartash Faghri, Raviteja Vemulapalli, Mehrdad Farajtabar, Sachin Mehta, Mohammad Rastegari, Oncel Tuzel, Hadi Pouransari, "SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding", eLVM workshop, CVPR 2024. [PDF]
Saurabh Garg, Mehrdad Farajtabar, Hadi Pouransari, Raviteja Vemulapalli, Sachin Mehta, Oncel Tuzel, Vaishaal Shankar, Fartash Faghri, "TiC-CLIP: Continual Training of CLIP Models", ICLR, 2024. [PDF]