Quantization in depth

Quantization:

The method for representing the weights of models into lower precision in order to maximize the usability and lower the resources consumption.

Untitled

method of mapping the large set to the smaller set of values.

Untitled

Untitled

higher precision range to the lower percision range.

$$ r = s(q-z) $$