“Google began applying deep neural networks to mathematics problems as early as 2016, prior to the development of the Transformer architecture.”