arXiv
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
Yufeng Zhong, Lei Chen, Xuanle Zhao
Jan 30, 2026·05:43·
00:0005:43
Holistic OCRText-centric OCRVision-centric OCROCRVerseData EngineeringTwo-stage SFT-RL Training
Turn any paper into a podcast
ResearchPod turns research papers into podcasts you can actually follow.
Download on the App Store