Multimodal Machine Learning:A Survey and Taxonomy
Abstractaim:buildmodelsthatcanprocessandrelateinformationfrommultiplemodalitiesnewtaxonomy:representation,translation,alignment,fusion,andco-learning.INTRODUCTIONthreemodalities:(nlp)naturallanguagewh