package tensorflow.quantization

Mouse Melon logoGet desktop application:
View/edit binary Protocol Buffers messages

enum OpSet

quantization_options.proto:101

List of supported opsets to deploy the quantized model. The quantized model contains different set of ops depending on the opset.

Used in: QuantizationOptions

message QuantizationMethod

quantization_options.proto:26

Model quantization method for optimization. Various techniques for model quantization are defined within this message along with a field that specifies a method to be used for a particular quantization request.

Used in: QuantizationOptions

enum QuantizationMethod.ExperimentalMethod

quantization_options.proto:36

Experimental quantization methods. These methods are either not implemented or provided with an unstable behavior.

Used in: QuantizationMethod

enum QuantizationMethod.Method

quantization_options.proto:28

Quantization methods that are supported as a stable API.

Used in: QuantizationMethod

message QuantizationOptions

quantization_options.proto:119

Defines various options to specify and control the behavior of the quantizer. It consists of 1) Model-wise quantization configuration as a default configuration. If it is None, the default configuration is "do not quantize the model". 2) A set of supported operations. 3) Unit wise quantization precision. 4) Target hardware name.

enum QuantizationPrecision

quantization_options.proto:59

Quantization precisions. If the specified quantization precision is not available, our quantizer needs to raise an error.

Used in: QuantizationOptions, UnitWiseQuantizationPrecision

message UnitWiseQuantizationPrecision

quantization_options.proto:74

Unit (either nodes or ops at this moment) wise quantization method for mixed bit precision quantization. It contains the name of the unit, the granularity of the unit, and the quantization method for each unit.

Used in: QuantizationOptions

enum UnitWiseQuantizationPrecision.UnitType

quantization_options.proto:76

Quantization unit granularity.

Used in: UnitWiseQuantizationPrecision