JALAL, Hafiza Dua; ASLAM, Saba; SULTAN, Muhammad Hasnain; RAEE, Ghulam Muhy Ud Deen; AZAM, Muhammad; MALIK, Mubasher Hussain. Cross-Modal Knowledge Mining Leveraging Multimodal Large Language Models for Automated Video Scene Understanding and Event Detection. NextGen AI & Computing Journal, [S. l.], v. 1, n. 1, p. 102–131, 2026. DOI: 10.5281/zenodo.20461727. Disponível em: https://scientia-nexus.org/index.php/nac/article/view/16. Acesso em: 4 jun. 2026.