Aiding Complex Multimodal Reasoning with Contextual and Structural InformationAyyubi, Hammad Abdullah
Multimodal Reasoning with Fine-grained Knowledge RepresentationWang, Zhecan