“Algorithms such as DAPO, GSPO from the Qwen lab, and Syspo represent significant improvements upon the original GRPO algorithm.”