“The Protein Data Bank (PDB), the dataset used to train AlphaFold, is relatively small, containing less than a terabyte of protein structure data.”