Toward Training Superintelligent Software Agents through Self-Play SWE-RL | Yuxiang Wei; Zhiqing Sun; Emily McMilin; Jonas Gehring; David Zhang; Gabriel Synnaeve; Daniel Fried; Lingming Zhang; Sida Wang | ResearchPod