-
at inference, we can technically stick with static scaling
-
need to call
precompute_float8_dynamic_scale_for_fsdp
for correct dynamic scaling
Jul 27, 20241 min read
at inference, we can technically stick with static scaling
need to call precompute_float8_dynamic_scale_for_fsdp
for correct dynamic scaling