VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text? | ResearchPod