On four GLUE tasks and text-normalization, we observe evidence of capacity limitations and interference