Ultrascale Playbook - Tensor and Sequence Parallelism Notes on training LLMs using tensor and sequence parallelism