Policy and World Modeling Co-Training for Language Agents | ResearchPod