Pytorch cuda benchmark. yaml and store the results in runs/cuda_pytorch_bert.