The revolution of multimodal large language models: a survey
Published in Findings of the Association for Computational Linguistics (ACL F), 2024
This paper provides a comprehensive survey of the rapidly evolving field of multimodal large language models, examining their architectures, capabilities, and applications.
Recommended citation: D. Caffagni, F. Cocchi, L. Barsellotti, N. Moratelli, S. Sarto, L. Baraldi, et al. (2024). "The revolution of multimodal large language models: a survey." arXiv preprint arXiv:2402.12451.
Download Paper