Skip to content
Sonic AI
AlphaGo's policy network is trained to imitate the entire MCTS visit count distribution, which is... — Sonic AI