package tflite.proto

Get desktop application:
View/edit binary Protocol Buffers messages

An error that occurred during benchmarking. Used with event type ERROR.

optional BenchmarkStage stage = 1
How far benchmarking got.
optional int32 exit_code = 2
Process exit code.
optional int32 signal = 3
Signal the process received.
repeated ErrorCode error_code = 4
Handled tflite error.
optional int32 mini_benchmark_error_code = 5
Mini-benchmark error code.

Top-level benchmarking event stored on-device. All events for a model are parsed to detect the status.

Used in: BestAccelerationDecision, MiniBenchmarkEvent

optional TFLiteSettings tflite_settings = 1
Which settings were used for benchmarking.
optional BenchmarkEventType event_type = 2
Type of the event.
optional BenchmarkResult result = 3
Result of benchmark, used when type is END.
optional BenchmarkError error = 4
Error during benchmark, used when type is ERROR.
optional int64 boottime_us = 5
Start timestamps. These are used for 1. Checking whether a test was started but not completed within a given deadline. 2. Optionally, telemetry timestamps.
optional int64 wallclock_us = 6

Which stage of benchmarking the event is for. There might be multiple events with the same type, if a benchmark is run multiple times.

Used in: BenchmarkEvent

UNDEFINED_BENCHMARK_EVENT_TYPE = 0
START = 1
Benchmark start. A start without an end can be interpreted as a test that has crashed or hung.
END = 2
Benchmarking completion. A model was successfully loaded, acceleration configured and inference run without errors. There may still be an issue with correctness of results, or with performance.
ERROR = 3
Benchmark was not completed due to an error. The error may be a handled error (e.g., failure in a delegate), or a crash.
LOGGED = 4
Benchmark data has been sent for logging.
RECOVERED_ERROR = 5
Benchmark encountered an error but was able to continue. The error is not related to the model execution but to the mini-benchmark logic. An example of error is a failure when trying to set the CPU affinity of the benchmark runner process.

Represent a failure during the initialization of the mini-benchmark.

Used in: MiniBenchmarkEvent

optional int32 initialization_status = 1
Status code returned by the mini-benchmark initialization function.

A correctness metric from a benchmark, for example KL-divergence between known-good CPU output and on-device output. These are primarily used for telemetry and monitored server-side.

Used in: BenchmarkResult

optional string name = 1
repeated float values = 2

Outcome of a successfully complete benchmark run. This information is intended to both be used on-device to select best compute configuration as well as sent to server for monitoring. Used with event type END.

Used in: BenchmarkEvent

repeated int64 initialization_time_us = 1
Time to load model and apply acceleration. Initialization may get run multiple times to get information on variance.
repeated int64 inference_time_us = 2
Time to run inference (call Invoke()). Inference may get run multiple times to get information on variance.
optional int32 max_memory_kb = 3
Maximum memory used. Measures size of application heap (does not necessarily take into account driver-side allocation.
optional bool ok = 4
Whether the inference produced correct results (validation graph output 'ok' for all test inputs). Used on-device to disallow configurations that produce incorrect results (e.g., due to OpenCL driver bugs).
repeated BenchmarkMetric metrics = 5
Metrics that were used to determine the 'ok' status.

When during benchmark execution an error occurred.

Used in: BenchmarkError

UNKNOWN = 0
INITIALIZATION = 1
During model loading or delegation.
INFERENCE = 2
During inference.

Where to store mini-benchmark state.

Used in: MinibenchmarkSettings

optional string storage_file_path = 1
Base path to the files used to store benchmark results in. Two files will be generated: one with the given path and an extra file to store events related to best acceleration results at path storage_file_path + ".extra.fb". Must be specific to the model. Note on Android, this should be the code cache directory.
optional string data_directory_path = 2
Path to a directory for intermediate files (lock files, extracted binaries). Note on Android, this typically is the data cache directory (i.e. the one returned by `getCacheDir()`).

Represent the decision on the best acceleration from the mini-benchmark.

Used in: MiniBenchmarkEvent

optional int32 number_of_source_events = 1
Number of events used to take the decision. Using just the size instaed of the full list of events to save space.
optional BenchmarkEvent min_latency_event = 2
Event with min latency in the source ones.
optional int64 min_inference_time_us = 3
Min latency as read from min_latency_event.

Used in: TFLiteSettings

optional int32 num_threads = 1
Set to -1 to let the interpreter choose. Otherwise, must be > 0.

Indicates the type and a human readable text for an error in an operation.

Used in: OpCompatibilityResult

optional CompatibilityFailureType failure_type = 1
Type of the errors.
optional string description = 2
Human readable message explaining the error.

Used in: CompatibilityFailure

DCC_UNSUPPORTED_QUANTIZATION_PARAMETERS = 0
Quantization scale and/or zero point are not in the supported value(s) for the accelerated operation. Applied DDC(s): NNAPI
DCC_INVALID_ARGUMENT = 1
Indicates that the caller specified an invalid argument, such as incorrect stride values. Applied DDC(s): GPU
DCC_INTERNAL_ERROR = 2
Indicates an internal error has occurred and some invariants expected by the underlying system have not been satisfied, such as expecting different number of input or ouput tensors. Applied DDC(s): GPU
DCC_UNIMPLEMENTED_ERROR = 3
Indicates the operation is not implemented or supported in this service. In this case, the operation should not be re-attempted. Applied DDC(s): GPU
DCC_OUT_OF_RANGE = 4
Indicates the operation was attempted past the valid range, such as requesting an index that goes beyond the array size. Applied DDC(s): GPU
DCC_UNSUPPORTED_OPERATOR = 5
The operator is not supported by the Delegate. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_VERSION = 6
The given operation or operands are not supported on the specified runtime feature level. The min supported version is specified in the compatibility failure message. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_OPERATOR_VERSION = 7
The version of the operator (value of OpSignature.version) for the given op is not supported. The max supported version is specified in the compatibility failure message. For more details on each operator version see the GetBuiltinOperatorVersion function in third_party/tensorflow/lite/tools/versioning/op_version.cc. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_INPUT_TYPE = 8
The given input operand type is not supported for the current combination of operator type and runtime feature level. Applied DDC(s): NNAPI
DCC_NOT_RESTRICTED_SCALE_COMPLIANT = 9
When using NN API version 1.0 or 1.1, the condition input_scale * filter_scale < output_scale must be true for quantized versions of the following ops: * CONV_2D * DEPTHWISE_CONV_2D * FULLY_CONNECTED (where filter actually stands for weights) The condition is relaxed and no longer required since version 1.2. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_OUTPUT_TYPE = 10
The given output operand type is not supported for the current combination of operator type and runtime feature level. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_OPERAND_SIZE = 11
The size of the operand tensor is too large. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_OPERAND_VALUE = 12
The value of one of the operands or of a combination of operands is not supported. Details are provided in the compatibility failure message. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_HYBRID_OPERATOR = 13
The combination of float inputs and quantized weights or filters is not supported. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_QUANTIZATION_TYPE = 14
The quantization type (for example per-channel quantization) is not supported. Applied DDC(s): NNAPI
DCC_MISSING_REQUIRED_OPERAND = 15
The accelerated version of operation requires a specific operand to be specified. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_OPERAND_RANK = 16
The rank of the operand is not supported. Details in the compatibility failure message. Applied DDC(s): NNAPI
DCC_INPUT_TENSOR_SHOULD_HAVE_CONSTANT_SHAPE = 17
The input tensor cannot be dynamically-sized. Applied DDC(s): NNAPI
DCC_UNSUPPORTED_OPERATOR_VARIANT = 18
The operator has a different number of inputs of the one or ones that are supported by NNAPI. Applied DDC(s): NNAPI
DCC_NO_ACTIVATION_EXPECTED = 19
The accelerated version of the operator cannot specify an activation function. Applied DDC(s): NNAPI

repeated OpCompatibilityResult compatibility_results = 1
One result for each operation.

One possible acceleration configuration.

optional ExecutionPreference preference = 1
Which preference to use this accelerator for.
optional TFLiteSettings tflite_settings = 2
How to configure TFLite
optional string model_namespace_for_statistics = 3
Identifiers to use for instrumentation and telemetry.
optional string model_identifier_for_statistics = 4
optional MinibenchmarkSettings settings_to_test_locally = 5
'Maybe' acceleration: use mini-benchmark to select settings.

Coral Dev Board / USB accelerator delegate settings. See https://github.com/google-coral/edgetpu/blob/master/libedgetpu/edgetpu_c.h

Used in: TFLiteSettings

optional string device = 1
The Edge Tpu device to be used. See https://github.com/google-coral/libcoral/blob/982426546dfa10128376d0c24fd8a8b161daac97/coral/tflite_utils.h#L131-L137
optional CoralSettings.Performance performance = 2
The desired performance level. This setting adjusts the internal clock rate to achieve different performance / power balance. Higher performance values improve speed, but increase power usage.
optional bool usb_always_dfu = 3
If true, always perform device firmware update (DFU) after reset. DFU is usually only necessary after power cycle.
optional int32 usb_max_bulk_in_queue_length = 4
The maximum bulk in queue length. Larger queue length may improve USB performance on the direction from device to host. When not specified (or zero), `usb_max_bulk_in_queue_length` will default to 32 according to the current EdgeTpu Coral implementation.

Used in: CoralSettings

UNDEFINED = 0
MAXIMUM = 1
HIGH = 2
MEDIUM = 3
LOW = 4

CoreML Delegate settings. See https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/delegates/coreml/coreml_delegate.h

Used in: TFLiteSettings

optional CoreMLSettings.EnabledDevices enabled_devices = 1
Only create delegate when Neural Engine is available on the device.
optional int32 coreml_version = 2
Specifies target Core ML version for model conversion. Core ML 3 come with a lot more ops, but some ops (e.g. reshape) is not delegated due to input rank constraint. if not set to one of the valid versions, the delegate will use highest version possible in the platform. Valid versions: (2, 3)
optional int32 max_delegated_partitions = 3
This sets the maximum number of Core ML delegates created. Each graph corresponds to one delegated node subset in the TFLite model. Set this to 0 to delegate all possible partitions.
optional int32 min_nodes_per_partition = 4
This sets the minimum number of nodes per partition delegated with Core ML delegate. Defaults to 2.

Note the enum order change from the above header for better proto practice.

Used in: CoreMLSettings

DEVICES_ALL = 0
Always create Core ML delegate.
DEVICES_WITH_NEURAL_ENGINE = 1
Create Core ML delegate only on devices with Apple Neural Engine.

TFLite accelerator to use.

Used in: ErrorCode, TFLiteSettings

NONE = 0
NNAPI = 1
GPU = 2
HEXAGON = 3
XNNPACK = 4
EDGETPU = 5
The EdgeTpu in Pixel devices.
EDGETPU_CORAL = 6
The Coral EdgeTpu Dev Board / USB accelerator.
CORE_ML = 7
Apple CoreML.

EdgeTPU device spec.

Used in: EdgeTpuSettings

optional EdgeTpuDeviceSpec.PlatformType platform_type = 1
Execution platform for the EdgeTPU device.
optional int32 num_chips = 2
Number of chips to use for the EdgeTPU device.
repeated string device_paths = 3
Paths to the EdgeTPU devices;
optional int32 chip_family = 4
Chip family used by the EdgeTpu device.

EdgeTPU platform types.

Used in: EdgeTpuDeviceSpec

MMIO = 0
REFERENCE = 1
SIMULATOR = 2
REMOTE_SIMULATOR = 3

Used in: EdgeTpuSettings

optional EdgeTpuPowerState inactive_power_state = 1
Inactive power states between inferences.
optional int64 inactive_timeout_us = 2
Inactive timeout in microseconds between inferences.

Generic definitions of EdgeTPU power states.

Used in: EdgeTpuInactivePowerConfig, EdgeTpuSettings

UNDEFINED_POWERSTATE = 0
Undefined power state.
TPU_CORE_OFF = 1
TPU core is off but control cluster is on.
READY = 2
A non-active low-power state that has much smaller transition time to active compared to off.
ACTIVE_MIN_POWER = 3
Minimum power active state.
ACTIVE_VERY_LOW_POWER = 4
Very low performance, very low power.
ACTIVE_LOW_POWER = 5
Low performance, low power.
ACTIVE = 6
The normal performance and power. This setting usually provides the optimal perf/power trade-off for the average use-case.
OVER_DRIVE = 7
Maximum performance level. Potentially higher power and thermal. This setting may not be allowed in production depending on the system.

EdgeTPU Delegate settings.

Used in: TFLiteSettings

optional EdgeTpuPowerState inference_power_state = 1
Target inference power state for running the model.
repeated EdgeTpuInactivePowerConfig inactive_power_configs = 2
Inactive power states between inferences.
optional int32 inference_priority = 3
Priority for the inference request.
optional EdgeTpuDeviceSpec edgetpu_device_spec = 4
Device spec for creating the EdgeTpu device.
optional string model_token = 5
A unique identifier of the input TfLite model.
optional EdgeTpuSettings.FloatTruncationType float_truncation_type = 6
Float truncation type for EdgeTPU.
optional EdgeTpuSettings.QosClass qos_class = 7
QoS class to determine chunking size for PRO onward.

Float truncation types for EdgeTPU.

Used in: EdgeTpuSettings

UNSPECIFIED = 0
NO_TRUNCATION = 1
BFLOAT16 = 2
HALF = 3

Used in: EdgeTpuSettings

QOS_UNDEFINED = 0
BEST_EFFORT = 1
REALTIME = 2

A handled error.

Used in: BenchmarkError

optional Delegate source = 1
Which delegate the error comes from (or NONE, if it comes from the tflite framework).
optional int32 tflite_error = 2
What the tflite level error is.
optional int64 underlying_api_error = 3
What the underlying error is (e.g., NNAPI or OpenGL error).

ExecutionPreference is used to match accelerators against the preferences of the current application or usecase. Some of the values here can appear both in the compatibility list and as input, some only as input. These are separate from NNAPIExecutionPreference - the compatibility list design doesn't assume a one-to-one mapping between which usecases compatibility list entries have been developed for and what settings are used for NNAPI.

Used in: ComputeSettings

ANY = 0
Match any selected preference. Allowlist (semantically - value is same as on input).
LOW_LATENCY = 1
Match low latency preference. Both compatibility list and input.
LOW_POWER = 2
Math low power preference. Both compatibility list and input.
FORCE_CPU = 3
Never accelerate. Can be used for input to compatibility list or for standalone Acceleration configuration.

Whether to automatically fallback to TFLite CPU path on delegation errors. Typically fallback is enabled in production use but disabled in tests and benchmarks to ensure they test the intended path.

Used in: NNAPISettings, TFLiteSettings

optional bool allow_automatic_fallback_on_compilation_error = 7
Whether to allow automatically falling back to TfLite CPU path on compilation failure. Default is not allowing automatic fallback. This is useful in naive production usecases where the caller would prefer for the model to run even if it's not accelerated. More advanced users will implement fallback themselves; e.g., by using a different model on CPU. Note that compilation errors may occur either at initial ModifyGraphWithDelegate() time, or when calling AllocateTensors() after resizing.
optional bool allow_automatic_fallback_on_execution_error = 8
Whether to allow automatically falling back to TfLite CPU path on execution error. Default is not allowing automatic fallback. Experimental, use with care (only when you have complete control over the client code). The caveat above for compilation error holds. Additionally, execution-time errors are harder to handle automatically as they require invalidating the TfLite interpreter which most client code has not been designed to deal with.

Which GPU backend to select. Default behaviour on Android is to try OpenCL and if it's not available fall back to OpenGL.

Used in: GPUSettings

UNSET = 0
OPENCL = 1
OPENGL = 2
Not yet supported. VULKAN = 3; METAL = 4;

GPU inference priorities define relative priorities given by the GPU delegate to different client needs. Corresponds to TfLiteGpuInferencePriority.

Used in: GPUSettings

GPU_PRIORITY_AUTO = 0
GPU_PRIORITY_MAX_PRECISION = 1
GPU_PRIORITY_MIN_LATENCY = 2
GPU_PRIORITY_MIN_MEMORY_USAGE = 3

GPU inference preference for initialization time vs. inference time. Corresponds to TfLiteGpuInferenceUsage.

Used in: GPUSettings

GPU_INFERENCE_PREFERENCE_FAST_SINGLE_ANSWER = 0
Delegate will be used only once, therefore, bootstrap/init time should be taken into account.
GPU_INFERENCE_PREFERENCE_SUSTAINED_SPEED = 1
Prefer maximizing the throughput. Same delegate will be used repeatedly on multiple inputs.

GPU Delegate settings. See https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/delegates/gpu/delegate.h

Used in: TFLiteSettings

optional bool is_precision_loss_allowed = 1
Ignored if inference_priority1/2/3 are set.
optional bool enable_quantized_inference = 2
optional GPUBackend force_backend = 3
optional GPUInferencePriority inference_priority1 = 4
Ordered priorities provide better control over desired semantics, where priority(n) is more important than priority(n+1). Therefore, each time inference engine needs to make a decision, it uses ordered priorities to do so. Default values correspond to GPU_PRIORITY_AUTO. AUTO priority can only be used when higher priorities are fully specified. For example: VALID: priority1 = MIN_LATENCY, priority2 = AUTO, priority3 = AUTO VALID: priority1 = MIN_LATENCY, priority2 = MAX_PRECISION, priority3 = AUTO INVALID: priority1 = AUTO, priority2 = MIN_LATENCY, priority3 = AUTO INVALID: priority1 = MIN_LATENCY, priority2 = AUTO, priority3 = MAX_PRECISION Invalid priorities will result in error. For more information, see TfLiteGpuDelegateOptionsV2.
optional GPUInferencePriority inference_priority2 = 5
optional GPUInferencePriority inference_priority3 = 6
optional GPUInferenceUsage inference_preference = 7
Whether to optimize for compilation+execution time or execution time only.
optional string cache_directory = 8
Model serialization. Setting both of these fields will also set the TFLITE_GPU_EXPERIMENTAL_FLAGS_ENABLE_SERIALIZATION flag on the delegate. GPU model serialization directory passed in TfLiteGpuDelegateOptionsV2. This should be set to the application's code cache directory so that it can not be accessed by other apps and is correctly deleted on app updates. tflite::StatefulNnApiDelegate
optional string model_token = 9
Normally, the model name with version number should be provided here, since each model needs an unique ID to avoid cache collision.

Hexagon Delegate settings. See https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/delegates/hexagon/hexagon_delegate.h

Used in: TFLiteSettings

optional int32 debug_level = 1
optional int32 powersave_level = 2
optional bool print_graph_profile = 3
optional bool print_graph_debug = 4

Events generated by the mini-benchmark before and after triggering the different configuration-specific benchmarks

Not using oneof because of the way the generated cpp code. See comment above on TfLite settings for details.

optional bool is_log_flushing_event = 1
If set to true, this event is used to mark all previous events in the mini-benchmark internal storage as read and one of the other fields in this message will have a value.
optional BestAccelerationDecision best_acceleration_decision = 2
Event generated when a best acceleration decision is taken.
optional BenchmarkInitializationFailure initialization_failure = 3
Reports a failure during mini-benchmark initialization.
optional BenchmarkEvent benchmark_event = 4
Event generated while benchmarking the different settings to test locally.

How to run a minibenchmark. Next ID: 5

Used in: ComputeSettings

repeated TFLiteSettings settings_to_test = 1
Which settings to test. This would typically be filled in from an allowlist.
optional ModelFile model_file = 2
How to access the model. This would typically be set dynamically, as it depends on the application folder and/or runtime state.
optional BenchmarkStoragePaths storage_paths = 3
Where to store state. This would typically be set dynamically, as it depends on the application folder.
optional ValidationSettings validation_settings = 4
Validation test related settings.

How to access the model for mini-benchmark. Since mini-benchmark runs in a separate process, it can not access an in-memory model. It can read the model either from a file or from a file descriptor. The file descriptor typically comes from the Android asset manager. Users should set either filename, or all of fd, offset and length.

Used in: MinibenchmarkSettings

optional string filename = 1
Filename for reading model from.
optional int64 fd = 2
File descriptor to read model from.
optional int64 offset = 3
Offset for model in file descriptor.
optional int64 length = 4
Length of model in file descriptor.

Used in: NNAPISettings

UNDEFINED = 0
Undefined.
NNAPI_LOW_POWER = 1
Prefer executing in a way that minimizes battery drain.
NNAPI_FAST_SINGLE_ANSWER = 2
Prefer returning a single answer as fast as possible, even if this causes more power consumption.
NNAPI_SUSTAINED_SPEED = 3
Prefer maximizing the throughput of successive frames, for example when processing successive frames coming from the camera.

Used in: NNAPISettings

NNAPI_PRIORITY_UNDEFINED = 0
NNAPI_PRIORITY_LOW = 1
NNAPI_PRIORITY_MEDIUM = 2
NNAPI_PRIORITY_HIGH = 3

NNAPI delegate settings.

Used in: TFLiteSettings

optional string accelerator_name = 1
Which instance (NNAPI accelerator) to use. One driver may provide several accelerators (though a driver may also hide several back-ends behind one name, at the choice of the driver vendor). Note that driver introspection is only available in Android Q and later.
optional string cache_directory = 2
NNAPI model compilation caching settings to be passed to tflite::StatefulNnApiDelegate
optional string model_token = 3
optional NNAPIExecutionPreference execution_preference = 4
NNAPI execution preference to pass. See https://developer.android.com/ndk/reference/group/neural-networks.html
optional int32 no_of_nnapi_instances_to_cache = 5
Number of instances to cache for the same model (for input size changes). This is mandatory for getting reasonable performance in that case.
optional FallbackSettings fallback_settings = 6
Deprecated; use the fallback_settings in TFLiteSettings. Whether to automatically fall back to TFLite CPU path.
optional bool allow_nnapi_cpu_on_android_10_plus = 7
Whether to allow use of NNAPI CPU (nnapi-reference accelerator) on Android 10+ when an accelerator name is not specified. The NNAPI CPU typically performs less well than the TfLite built-in kernels; but allowing allows a model to be partially accelerated which may be a win.
optional NNAPIExecutionPriority execution_priority = 8
optional bool allow_dynamic_dimensions = 9
Whether to allow dynamic dimension sizes without re-compilation. A tensor of with dynamic dimension must have a valid dims_signature defined. Only supported in NNAPI 1.1 and newer versions. WARNING: Setting this flag to true may result in model being rejected by accelerator. This should only be enabled if the target device supports dynamic dimensions of the model. By default this is set to false.
optional bool allow_fp16_precision_for_fp32 = 10
Whether to allow the NNAPI accelerator to optionally use lower-precision float16 (16-bit floating point) arithmetic when doing calculations on float32 (32-bit floating point).
optional bool use_burst_computation = 11
Whether to use NNAPI Burst mode. Burst mode allows accelerators to efficiently manage resources, which would significantly reduce overhead especially if the same delegate instance is to be used for multiple inferences.
optional int64 support_library_handle = 12
Optional pointer to NNAPI Support Library provided pointer to NnApiSLDriverImplFL5 which can be used to construct the NNAPI delegate.

Result for one operation of the given model and stores if the operation is supported. If it is supported, validation_failures will not have a value. If it is not supported, validation_failures will contain all the errors for that operation. Also saves the subgraph index inside the model and the operator index inside the subgraph.

Used in: CompatibilityResult

optional bool is_supported = 1
True if the operation is supported for the required DCC.
optional int32 subgraph_index_in_model = 2
Index of the subgraph where this operation is contained.
optional int32 operator_index_in_subgraph = 3
Index of the operator inside the subgraph.
repeated CompatibilityFailure compatibility_failures = 4
Type of the errors.

How to configure TFLite.

Used in: BenchmarkEvent, ComputeSettings, MinibenchmarkSettings

optional Delegate delegate = 1
Which delegate to use.
optional NNAPISettings nnapi_settings = 2
How to configure the chosen delegate. (In principle we would like to use 'oneof', but flatc turns that into an nested anonymous table rather than a union. See https://github.com/google/flatbuffers/issues/4628).
optional GPUSettings gpu_settings = 3
optional HexagonSettings hexagon_settings = 4
optional XNNPackSettings xnnpack_settings = 5
optional CoreMLSettings coreml_settings = 11
optional CPUSettings cpu_settings = 6
How to configure CPU execution.
optional int32 max_delegated_partitions = 7
Shared delegation settings.
optional EdgeTpuSettings edgetpu_settings = 8
For configuring the EdgeTpuDelegate.
optional CoralSettings coral_settings = 10
For configuring the Coral EdgeTpu Delegate.
optional FallbackSettings fallback_settings = 9
Whether to automatically fall back to TFLite CPU path.
optional bool disable_default_delegates = 12
Whether to disable default delegates (XNNPack).

Validation related settings. Next ID: 2

Used in: MinibenchmarkSettings

optional int64 per_test_timeout_ms = 1
Timeout for one settings under test. If test didn't finish within this timeout, this setting is considered hanging.

XNNPack Delegate settings. See https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/delegates/xnnpack/xnnpack_delegate.h

Used in: XNNPackSettings

TFLITE_XNNPACK_DELEGATE_NO_FLAGS = 0
These flags match the flags in xnnpack_delegate.h.
TFLITE_XNNPACK_DELEGATE_FLAG_QS8 = 1
Enable fast signed integer XNNpack kernels.
TFLITE_XNNPACK_DELEGATE_FLAG_QU8 = 2
Enable fast unsigned integer XNNpack kernels.
TFLITE_XNNPACK_DELEGATE_FLAG_QS8_QU8 = 3
Enable both, signed and unsigned integer XNNpack kernels.
TFLITE_XNNPACK_DELEGATE_FLAG_FORCE_FP16 = 4
Force 16-bit floating point inference.

Used in: TFLiteSettings

optional int32 num_threads = 1
optional XNNPackFlags flags = 2

package tflite.proto

message BenchmarkError

optional BenchmarkStage stage = 1

optional int32 exit_code = 2

optional int32 signal = 3

repeated ErrorCode error_code = 4

optional int32 mini_benchmark_error_code = 5

message BenchmarkEvent

optional TFLiteSettings tflite_settings = 1

optional BenchmarkEventType event_type = 2

optional BenchmarkResult result = 3

optional BenchmarkError error = 4

optional int64 boottime_us = 5

optional int64 wallclock_us = 6

enum BenchmarkEventType

UNDEFINED_BENCHMARK_EVENT_TYPE = 0

START = 1

END = 2

ERROR = 3

LOGGED = 4

RECOVERED_ERROR = 5

message BenchmarkInitializationFailure

optional int32 initialization_status = 1

message BenchmarkMetric

optional string name = 1

repeated float values = 2

message BenchmarkResult

repeated int64 initialization_time_us = 1

repeated int64 inference_time_us = 2

optional int32 max_memory_kb = 3

optional bool ok = 4

repeated BenchmarkMetric metrics = 5

enum BenchmarkStage

UNKNOWN = 0

INITIALIZATION = 1

INFERENCE = 2

message BenchmarkStoragePaths

optional string storage_file_path = 1

optional string data_directory_path = 2

message BestAccelerationDecision

optional int32 number_of_source_events = 1

optional BenchmarkEvent min_latency_event = 2

optional int64 min_inference_time_us = 3

message CPUSettings

optional int32 num_threads = 1

message CompatibilityFailure

optional CompatibilityFailureType failure_type = 1

optional string description = 2

enum CompatibilityFailureType

DCC_UNSUPPORTED_QUANTIZATION_PARAMETERS = 0

DCC_INVALID_ARGUMENT = 1

DCC_INTERNAL_ERROR = 2

DCC_UNIMPLEMENTED_ERROR = 3

DCC_OUT_OF_RANGE = 4

DCC_UNSUPPORTED_OPERATOR = 5

DCC_UNSUPPORTED_VERSION = 6

DCC_UNSUPPORTED_OPERATOR_VERSION = 7

DCC_UNSUPPORTED_INPUT_TYPE = 8

DCC_NOT_RESTRICTED_SCALE_COMPLIANT = 9

DCC_UNSUPPORTED_OUTPUT_TYPE = 10

DCC_UNSUPPORTED_OPERAND_SIZE = 11

DCC_UNSUPPORTED_OPERAND_VALUE = 12

DCC_UNSUPPORTED_HYBRID_OPERATOR = 13

DCC_UNSUPPORTED_QUANTIZATION_TYPE = 14

DCC_MISSING_REQUIRED_OPERAND = 15

DCC_UNSUPPORTED_OPERAND_RANK = 16

DCC_INPUT_TENSOR_SHOULD_HAVE_CONSTANT_SHAPE = 17

DCC_UNSUPPORTED_OPERATOR_VARIANT = 18

DCC_NO_ACTIVATION_EXPECTED = 19

message CompatibilityResult

repeated OpCompatibilityResult compatibility_results = 1

message ComputeSettings

optional ExecutionPreference preference = 1

optional TFLiteSettings tflite_settings = 2

optional string model_namespace_for_statistics = 3

optional string model_identifier_for_statistics = 4

optional MinibenchmarkSettings settings_to_test_locally = 5

message CoralSettings

optional string device = 1

optional CoralSettings.Performance performance = 2