CAUSALMIX: Data Mixture as Causal Inference for Language Model Training | ResearchPod