Post

FP8 Quantization using TensorRT

FP8 Quantization using TensorRT

FP8 Quantization with TensorRT

Model Optimization

Calibration Process

Attention Fusion Verification

Verify Fusion in Profiler Output

This post is licensed under CC BY 4.0 by the author.