Agent Skills: deepspeed
Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention
UncategorizedID: davila7/claude-code-templates/deepspeed
19,6461,834
Install this agent skill to your local
Skill Files
Browse the full folder contents for deepspeed.
Loading file tree…
Select a file to preview its contents.