Attention Please: What Transformer Models Really Learn for Process Prediction | Martin Käppel et al. | ResearchPod