PilotBench: A Benchmark for General Aviation Agents with Safety Constraints | Yalun Wu et al. | ResearchPod