Learning from the Self-future: On-policy Self-distillation for dLLMs | ResearchPod

Learning from the Self-future: On-policy Self-distillation for dLLMs | ResearchPod