arXiv
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection
Tao Liu, Taiqiang Wu, Runming Yang
Jan 19, 2026·08:57·
00:0008:57
Supervised Fine-Tuning (SFT)Single-Reference OverfittingToken Probability and Semantic ImportanceProFit MethodHigh-Value SignalsProFit method
Turn any paper into a podcast
ResearchPod turns research papers into podcasts you can actually follow.
Download on the App Store