Learning from the Self-future: On-policy Self-distillation for dLLMs
Yifu Luo et al.
4 min
On-policy Self-distillation (opsd)
0:00 / 4:20
Listen on ResearchPod
→