“GitHub's key metric for evaluating new Copilot models is 'accepted and retained characters,' tracked through offline and online testing.”