FP8 Quantization using TensorRT
FP8 Quantization using TensorRT
FP8 Quantization with TensorRT
Model Optimization
Calibration Process
Attention Fusion Verification
Verify Fusion in Profiler Output
This post is licensed under CC BY 4.0 by the author.