“Edwin Chen estimates that data spend as a percentage of total model training cost varies widely among labs, from as low as 1% to around 10%.”