Text-Vision Co-Instructed Image Editing | Chenxi Xie et al. | ResearchPod