Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models

Please cite with the following BibTeX:

@article{caffagni2025seeing,
  title={{Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models}},
  author={Caffagni, Davide and Sarto, Sara and Cornia, Marcella and Baraldi, Lorenzo and Dovesi, Pier Luigi and Roohi, Shaghayegh and Granroth-Wilding, Mark and Cucchiara, Rita},
  journal={arXiv preprint arXiv:X.X},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
model.png		model.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models

Coming Soon

About

Uh oh!

Releases

Packages

Uh oh!

aimagelab/JARVIS

Folders and files

Latest commit

History

Repository files navigation

Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models

Coming Soon

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages