Learning from the Self-future: On-policy Self-distillation for dLLMs | Yifu Luo et al. | ResearchPod