EarlyTom: Early Token Compression Completes Fast Video Understanding | Hesong Wang et al. | ResearchPod