“Minimax utilizes its in-house expert developers as a source for reward models in its training pipeline.”