Brandon Foodi claims that many types of Supervised Fine-Tuning (SFT) and Reinforcement Learning f..., Sonic AI
“Brandon Foodi claims that many types of Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) data are no longer as useful as they once were, and corporate budgets for them are decreasing.”