TorchAO: A PyTorch-Native Shortcut To Smaller, Faster Models TorchAO is PyTorch's native toolkit for model efficiency: it unifies post-training quantization (PTQ), quantization-aware training (QAT), float8 (FP8) training, and structured sparsity in one coherent... deep learning FP8 model efficiency open source PyTorch QAT quantization sparsity TorchAO
How MXFP8, TorchAO, and TorchTitan Boost Large-Scale AI Training on Crusoe B200 Modern AI models are growing larger and more complex, demanding new solutions to speed up training without compromising accuracy. Recent experiments on the Crusoe B200 cluster , using 1,856 GPUs, show... AI acceleration Crusoe B200 float8 large-scale training MXFP8 PyTorch quantization TorchAO