AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Document Understanding
ServiceNow
·
Jun 02, 2025
·
video
View original source
https://www.youtube.com/watch?v=9GGsU8l3gYU