(1)

Jalal, H. D.; Aslam, S.; Sultan, M. H.; Raee, G. M. U. D.; Azam, M.; Malik, M. H. Cross-Modal Knowledge Mining Leveraging Multimodal Large Language Models for Automated Video Scene Understanding and Event Detection. NAC 2026, 1 (1), 102-131. https://doi.org/10.5281/zenodo.20461727.