Text-Vision Co-Instructed Image Editing | ResearchPod