多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants

多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第1张图片

论文作者:Chunyuan Li,Zhe Gan,Zhengyuan Yang,Jianwei Yang,Linjie Li,Lijuan Wang,Jianfeng Gao

作者单位:Microsoft Corporation

论文链接:http://arxiv.org/abs/2309.10020v1

项目链接:https://vlp-tutorial.github.io/2023/

内容简介:

这篇论文全面调查了展示视觉和视觉语言能力的多模态基础模型的分类法和演变,重点关注从专业模型向通用助手的过渡。研究领域包括五个核心主题,分为两类。首先,对已经建立的研究领域进行了调查,包括为特定目的预训练的多模态基础模型,涵盖了学习视觉主干以实现视觉理解和文本到图像生成的方法。其次,介绍了最近在探索性、开放性研究领域取得的进展,包括旨在扮演通用助手角色的多模态基础模型,其中包括受到大型语言模型(LLMs)启发的统一视觉模型、多模态LLMs的端到端训练,以及将多模态工具与LLMs链接起来。本文的目标受众是计算机视觉和视觉语言多模态社区的研究人员、研究生和专业人士,他们渴望了解多模态基础模型的基础知识和最新进展。多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第2张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第3张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第4张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第5张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第6张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第7张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第8张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第9张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第10张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第11张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第12张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第13张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第14张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第15张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第16张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第17张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第18张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第19张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第20张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第21张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第22张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第23张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第24张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第25张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第26张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第27张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第28张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第29张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第30张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第31张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第32张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第33张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第34张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第35张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第36张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第37张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第38张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第39张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第40张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第41张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第42张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第43张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第44张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第45张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第46张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第47张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第48张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第49张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第50张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第51张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第52张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第53张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第54张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第55张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第56张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第57张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第58张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第59张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第60张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第61张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第62张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第63张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第64张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第65张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第66张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第67张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第68张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第69张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第70张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第71张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第72张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第73张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第74张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第75张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第76张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第77张图片多模态大模型:Multimodal Foundation Models: From Specialists to General-Purpose Assistants_第78张图片

你可能感兴趣的:(大模型,多模态大模型,人工智能,计算机视觉)