Ultrascale Playbook - Expert Parallelism
Notes on training LLMs using expert parallelism
Notes on training LLMs using expert parallelism
Notes on training LLMs using context parallelism
Notes on training LLMs using tensor and sequence parallelism
Notes on training LLMs using pipeline parallelism
My talk on training LLMs at Pydata MCR