Reinforcement Learning with LLM-Guided Action Spaces for Synthesizable Lead Optimization | Tao Li et al. | ResearchPod