From 2D Grids to 1D Tokens: Reforming Shared Representations for Multimodal Image Fusion | ResearchPod