When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning? | Xuanfei Ren, Tengyang Xie | ResearchPod