SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment | Sihang Jiang et al. | ResearchPod