From 2D Grids to 1D Tokens: Reforming Shared Representations for Multimodal Image Fusion | Yuchen Xian et al. | ResearchPod